środa, 2 września 2015

Control M newbie issues - date, sqoop and other stuff

Recently I moved automation from Oozie into BMC Control-M. Reason to that is Hadoop is just a small part in our company and maintaining many scheduling systems is a huge issue.

Control-M offers the pretty good tool which helps with Oozie workflow/coordinator import but still there is a lot of work to do after that.
As I didn't have prior knowledge about Control-M I used F1 help a lot and I faced few beginner issues.

System variables and their format

I've configured a job that run reports with parameters set to yesterday's date. Control-M supports that by system variables like: %%ODATE which has a value of current order date. The problem is that this variable has yymmdd format. My task preferred yyyymmdd format. I've found out that there is %%$ODATE which has yyyymmdd format. Great!

Yesterday's date - whitespaces are important!

The second issue is to calculate a date. There is %%CALCDATE function which supports yymmdd format, and, of course, there is %%$CALCDATE for full year format.
There is the small catch, whitespaces. I wasn't aware why "%%$CALCDATE %%$ODATE - 1" returns "CTMERR 20150901 - 1". Searching and reading docs couldn't give any useful hint, but one thing was common - minus was very close to 1. Long story short - "%%$CALCDATE %%$ODATE -1" is the proper one.
Documentation specifies specifies format: result=%%CALCDATE date +|-quantity.

Sqoop

Sqoop configuration is a huge pain, especially when password and jdbc uri has special characters. It may happen that Control-m fail on testing configuration but will later work properly when importing data.

niedziela, 3 maja 2015

Citi Mobile Challenge summary

Recently I have finished our journey with Citi Mobile Challenge. It was exciting 7 weeks!

Idea

It all began on 10th of March when We’ve decided that I and group of my friends want to join the challenge.
It appeared that on this day we are not only registering ourselves but we are supposed to submit diagrams with use cases, narrative for our app and write something about our team and create website. After few hours and dozens of emails exchanged we decided about one specific idea:

"I’m your Bank in your smart device. You can simply talk to me.”

Analysis

After 3 days of waiting we’ve finally received confirmation that we can start next round with deadline of 29th March.
For 2nd round we had to prepare movie about our idea, do user UX/CX analysis which results with customer journey mapping, a lot of descriptions regarding elevator pitch, target audience, sdks, api etc. At this time we have exchanged more than hundred of emails. Mailbox app wasn’t stable when reading and writing to this conversation so we moved to Slack and Trello.

UX/CX

Never heard about Customer Experience methodologies like Customer Journey mapping? Read it here and then try it: smaply.com. It’s great fun, especially from engineer role perspective, even if we were not doing it 100% properly.

Communication

Moving to Trello’s kanban simplifies task organization. Moving communication to Slack makes communication fast and easy. Slack enables integration from many other apps - I’ve configured Trello and GitHub for this project. Slack has nice native app and it works great on mobile. Trello and Slack are great only when everyone starts using them and luckily all of us enjoyed using both products.

Development

Preparing demo was invaluable experience in terms of multi-platform development and rapid prototyping. Some of core frameworks we have been able to use: Speech2Text, Text2Speech, Google Wear SDK, IBM Bluemix, IBM Watson. Having all kind of devices (different android devices, google wear) was crucial. Showing emulator is not great idea.

Demo Day

It was very long and tiring day - if I could change one thing, I would go sleep earlier day before to have more energy. I’ve met a lot of people from around the world that are open to new ideas. Movie and slide deck got attention and prototype received huge applause when finally responded properly to commands.

There’s still one week to results, but regardless of the final results, I think we did great job and had a lot of fun.

Our team on stage (photo: https://twitter.com/selkiner/)

piątek, 1 maja 2015

Hortonworks Data Platform on AWS EC2 - yet another instruction!

Sandbox is great?

From time to time I have a possibility to run training about Hadoop ecosystem. Main requirement is good linux/mac laptop to run:

MRUnit tests
Map Reduce in local mode
PigUnit

For windows machines it’s more difficult as it requires Hadoop binaries.

But most important is that this laptop should run smoothly Hortonworks Sandbox machine which every half year is updated and requires more and more resources. At this moment 6GB RAM is a minimum and 8GB is recommended to run only VM and still it will work slow.

On trainings students are using Eclipse/Idea and browse internet and it looks like that 16GB laptop should be used. It is a problem for students because most of the people don’t have brand new hardware with those parameters and can’t attend courses.

So for training and private use I have created on Amazon EC2 very small cluster consisting of two machines.

Below there is short instruction how to install HDP 2.2.4.2 with Ambari 1.7.

Reference

This instruction is a mix of many instructions, but most of the credit goes to two below:

http://hortonworks.com/blog/deploying-hadoop-cluster-amazon-ec2-hortonworks/
http://hortonworks.com/kb/ambari-on-ec2/

EC2 setup

Cluster consits of two machines:

m3.xlarge hdpmaster1
m3.xlarge hdpmaster2

I have created first instance based on Centos 6 with 100GB SSD.

During the instance creation process there is possibility to create security group - below there is short table with some port exceptions. If anything is missing, it is easy to fix it later:

ICMP open all ALL
TCP rules:

0 – 65535 your subnet
22 (SSH)    0.0.0.0/0       
7180    0.0.0.0/0   
8080 - 8100 0.0.0.0/0   
50000 – 50100 0.0.0.0/0

UDP rules:

0 – 65535     your-subnet

I’ve downloaded the training-cluster.pem and saved it securely (.ssh directory is great for that).

Now I can login using this key:

ssh -i .ssh/training-cluster.pem root@52.6.33.48

there are some things to install and turn off:

vi /etc/sysconfig/selinux (set SELINUX=disabled)
yum -y install ntp
chkconfig ntpd on
chkconfig iptables off
chkconfig ip6tables off
/etc/init.d/iptables stop 2> /dev/null > /dev/null

Now we can save our image (without reboot) as our private AMI and run second instance (hdpmaster2).

To succeed with keyless login from Ambari (running on hdpmaster1) to second node we have to upload our key.

scp -i .ssh/training-cluster.pem .ssh/training-cluster.pem root@ip-of-hdpmaster1:
ssh -i .ssh/training-cluster.pem root@ip-of-hdpmaster1
mv training-cluster.pem .ssh/id_rsa

To make it more easy I modify etc/hosts

127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
<private ip of hdpmaster1>  hdpmaster1
<private ip of hdpmaster2> hdpmaster2

I distribute hosts to 2nd node, it doesn’t ask for key or password:

scp /etc/hosts root@hdpmaster2:/etc/hosts

At this moment I have all the machines ready to install HDP.

Ambari setup

I setup ambari repo, install it and start -everything with default values.

yum install wget
wget http://public-repo-1.hortonworks.com/ambari/centos6/1.x/updates/1.7.0/ambari.repo
cp ambari.repo /etc/yum.repos.d
yum install epel-release
yum repolist
yum install ambari-server
ambari-server setup
ambari-server start

Right now I can login (admin/admin) to hdpmaster1-public-ip:8080. I change password!

Cluster setup

I choose HDP 2.2 stack, paste hostnames of my huge cluster:

hdpmaster1
hdpmaster2

and watch progress bars that finishes quite smoothly. I setup all the services I wanted, assign DataNode and NodeManager to both machines and again.

There are some issues that I fix later (below)

User creation

I add sample user - it’s good snippet to add more users:

groupadd hadoopusers
useradd -g hadoopusers app_user
passwd app_user
sudo -u hdfs hdfs dfs -mkdir /user/app_user/
sudo -u hdfs hdfs dfs -chown -R app_user:hadoopusers /user/app_user/

Smoketests

I test all the functionalities - Hive, Tez, MapReduce, HBase and fix problems.

Problems

1) During mapreduce we get this Error : File does not exist: hdfs://…../hdp/apps/2.2.4.2–2/mapreduce/mapreduce.tar.gz From hdpmaster1 do:

sudo -u hdfs hdfs dfs -mkdir -p /hdp/apps/2.2.4.2-2/mapreduce
sudo -u hdfs  hdfs dfs -put  /usr/hdp/current/hadoop-client/mapreduce.tar.gz

2) During mapreduce we get this Error : File does not exist: hdfs://…../hdp/apps/2.2.4.2–2/tez/tez.tar.gz From hdpmaster1 do:

sudo -u hdfs hdfs dfs -mkdir -p /hdp/apps/2.2.4.2-2/tez
sudo -u hdfs  hdfs dfs -put  /usr/hdp/2.2.4.2-2/tez/lib/tez.tar.gz   /hdp/apps/2.2.4.2-2/tez/

3) Despite assigning 100GB SSD to each machine, Centos reports 8GB disk size. Here is great fix for that: http://stackoverflow.com/a/24030938/4368212 - I just follow instructions.

Previus day struggles

On previous day I tried to install cluster on RHL7 - it doesn’t work at this moment, some issues worth to note:

Ambari-agent has some issues with rpms: https://issues.apache.org/jira/browse/AMBARI–10201 which of course I’ve encountered :) There is fix for that but is not yet released
“Cannot register host with not supported os type, hostname={host private DNS}, serverOsType=redhat7, agentOsType=redhat7” http://hortonworks.com/community/forums/topic/failure-registering-hosts/
Ambari requires root access at default. To enable it on RHL there is nice instruction: http://stackoverflow.com/a/18047873/4368212

Cludbreak

One could ask - why not to use SequenceIQ Cloudbreak - I tried it yesterday - it creates cluster very nicely. But when I logged in I didn’t know how to use it - how to run Hive, run Pig script, HBase or MapReduce application. I need to read some articles about and return to it, because it automates everything I have written above.

Costs

This setup isn’t expensive as long as I remember to shutdown cluster after use. Amazon Calculator estimates it should costs 18$ for storage and 50 cents per hour for using cluster.

I hope this instruction will help someone.

wtorek, 27 stycznia 2015

Pig, Kibana and BetterMap - getting started guide

Year ago I had a possibility to test very nice Kibana feature - visualizing events on BetterMap that were indexed using Pig. I was able also to verify if geoquery works.

I will try to explain step by step how to achieve that.

Where is the challenge

My data has to be inserted into Elastic Search in GeoJson format.
GeoJson format is a a two element array [longitude,latitude]. Order is important, and is different than in other Elastic Search Geo formats.

Elastic Search has a capability to guess types and create/append mapping on runtime, but GeoPoint isn't one of those that is easy to discover.

Creating Index and Defining Type

First I have to create empty index with default settings:

POST /zagwozdka/

{}

Then I can create mapping for my new type, I call it events. My dataset row consists of timestamp, event string and location - I want to index all properties.

 POST /zagwozdka/_mapping/events  
  {   
    "events": {   
      "properties": {   
           "event_datetime": {   
              "type": "date"   
           },   
           "event": {   
              "type": "string"   
           },   
           "location": {   
              "type": "geo_point"   
           }   
        }   
     }   
  }

Dataset

My dataset is a csv file with columns:

datetime (yyyy-MM-dd HH:mm:ss)
event(chararray)
latitude(double)
longitude(double)

Below I have put few rows.

Datetime	Event	Latitude	Longitude
2009-01-24 12:00:00.000	Travis	40.5833333	-4.1166667
2009-01-28 11:19:00.000	Diamond	37.65361	-101.19056
2009-01-07 17:48:00.000	Stefan	32.51722	-80.07583
2009-01-23 12:42:00.000	Watson	-31.6333333	150.3333333
2009-01-07 19:48:00.000	Andrew	-32.8833333	152.2166667
2009-01-26 11:19:00.000	John	61.9666667	24.6666667
2009-01-05 13:23:00.000	Greg	44.2166667	15.3666667

Loading a File

First I need to load file using standard PigStorage input.

 events = LOAD 'demo.csv' using PigStorage(';')   
 AS (event_datetime:chararray, event:chararray, latitude:double, longitude:double);

Transforming into proper types

With Pig I can transform each record into datetime object, event and location tuple (longitude, latitude). Last tuple will be interpreted later as a geopoint in GeoJson format.

 transformed_events = FOREACH events GENERATE  
    ToDate(event_datetime,'yyyy-MM-dd HH:mm:ss') as event_datetime,  
    event,  
    TOTUPLE(longitude,latitude) as location;

dumping transformed_events to stdout should give output similar to this:

(2009-01-24T12:00:00.000Z,Travis,(-4.1166667,40.5833333))

(2009-01-28T11:19:00.000Z,Diamond,(-101.19056,37.65361))

(2009-01-07T17:48:00.000Z,Stefan,(-80.07583,32.51722))

(2009-01-23T12:42:00.000Z,Watson,(150.3333333,-31.6333333))

(2009-01-07T19:48:00.000Z,Andrew,(152.2166667,-32.8833333))

(2009-01-26T11:19:00.000Z,John,(24.6666667,61.9666667))

(2009-01-05T13:23:00.000Z,Greg,(15.3666667,44.2166667))

Indexing

First I register Elastic Search user defined functions for Pig which enable storing data into Elastic Search, next I store using EsStorage output.

 REGISTER elasticsearch-hadoop-2.0.2.jar  
 STORE transformed_events INTO 'zagwozdka/events' USING org.elasticsearch.hadoop.pig.EsStorage('es.mapping.names=event_datime:event_datetime,event:event,location:location');

I could ommit es.mapping.names parameter - names in pig are the same in elastic search type. When they're different, this parameter will be helpful.

Configuring Panel

Finally, I can run Kibana and configure dashboard for new source of data. Just need to add BetterMap panel and configure field with proper location field and tooltip source.

Result

If configured properly Kibana presents great events on BetterMap.

QA

- Is datetime mapping required in this scenario?

No, but other diagrams (like Histogram) in Kibana have useful filtering features based on timestamp, so it is worth to index properties with their correct types.

- Is geo_point mapping required in this case?

No, Kibana's BetterMap can display any array consisting of longitude and latitude as long as it is array and has proper order.

The missing part will be geo query in Elastic Search - it won't work without this mapping. Indexing Elastic Search will guess type and it may look like this:

 GET /zagwozdka/events/_mapping  
 {  
   "zagwozdka": {  
    "mappings": {  
      "events": {  
       "properties": {  
         "event": {  
          "type": "string"  
         },  
         "event_datetime": {  
          "type": "date",  
          "format": "dateOptionalTime"  
         },  
         "location": {  
          "type": "double"  
         }  
       }  
      }  
    }  
   }  
 }

Sample geoquery:

 POST /zagwozdka/events/_search  
 {  
   "query": {  
   "filtered" : {  
     "filter" : {  
       "geo_distance" : {  
         "distance" : "50km",  
         "location" : [-111.89028,40.76083]  
       }  
     },  
      "query" : {  
       "match_all" : {}  
     }  
   }  
  }  
 }

will give exception:

org.elasticsearch.index.query.QueryParsingException: [zagwozdka] field [location] is not a geo_point field

Update for Kibana 4:

Kibana 4 uses tilemap but it still requires one field with same type.

bettermap required it to be geopoint type in geoJSON format, tilemap support more formats, for example "lat,lon”:

"location” : "41.12,-71.34”

There has to be some more mapping added, more information can be found in links below.

formats: http://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-geo-point-type.html#mapping-geo-point-type

mapping: http://stackoverflow.com/questions/27158799/kibana-3-geojson-vs-kibana4-geohash

poniedziałek, 26 stycznia 2015

Random thoughts after running a Big Data Training

Recently I had a possibility to run a two day training for one of the software houses in Szczecin.
Training was tailored for customer needs and consisted of 5 huge parts.

Big Data Introduction
Hadoop introduction (architecture, commands, usage)
Map Reduce (paradigm, hadoop framework, MRUnit)
Pig (getting started guide, different running modes, PigUnit)
Hive (introduction, serdes, file formats, integrations)
Elastic Search & Kibana (introduction, search concept, integration with Hadoop Ecosystem, Kibana)

Training finished with very good notes, but surveys shown that there are places where I can do better and this post is about those places and many more I have found out by myself.

Following thoughts are in random order, writing them down probably will help me in preparing next presentations or trainings :-) Someone maybe will find them helpful.

Production experience

In every agenda point except Elastic Search I have production or PoC experience. For Elastic Search I didn't have problems in making training material (I have some experience), but I had trouble to find some answers about running it on production - scaling, clustering, administering. One of the funniest mistakes was when everyone started elastic search and by accident we have created huge cluster because Elastic Search instance finds other nodes by multicasting in subnet.

Elastic Search Query DSL

I find Elastic Search query language quite hard to learn and teach. There is no 3-5 page cheat sheet like in Pig or Hive - whole documentation is like very long tutorial. It is difficult to organize it into short guide.
I tried to teach by showing example queries which may be copied and applied to different datasets, but it lacks of reference, schema - they're still just examples.
Sense (Beta) Chrome extensions is very useful for writing queries. There is small context assist, it helps to find some misspells or missing bracket but it doesn't validate scheme'a like it possible with XSD on XML.

Workshop tasks

All participants were developers with good Java and Python experience but when facing new challenge (new paradigm, new language) the learning time vary. It may happen that some participants were finishing their tasks two times faster than others. What to do then?
When there is more than 10 participants it may be difficult to check progress of everyone.
On my training few time it happened that people were waiting because I haven't noticed that they've finished.
I'm thinking about some solutions for this case:

prepare some kind progress bar based on post-it tasks or find solution online.
prepare core tasks for everyone, and many additional tasks that can broaden participant knowledge, participants that make tasks slower can take additional tasks as homework

Breaks

One of the keys of training success is good mood of participants. First day I was so focused on presentation that I forgot about some brakes so everyone was tired. Next day I made short break every hour. It was better. People could share experience, ask more questions etc. Next time I have to organize some kind of timer.

Material length

I was worried that my presentation and workshop material are too short. They weren't. They were too long. Probably I need to give more presentations to have a good timing experience. For workshop length there is good proportion: 1h of presentation should have 2-3 hours of workshop.

Elastic Search basics

Is it possible to teach basics Elastic Search with Kibana in 3 hours? I think not.

Keynote iOS app

I could see notes, change slides, draw on them. It is not perfect app, from time to time it was disconnecting from my keynote app, probably Wi-Fi disconnected on iPhone, drawing was quite slow. For presentation I recommend trying to run it on personal hotspot so no-one interfere in quality of wifi network. If you present on Mac - Buy keynote iOS app.

Cheat sheets

I found Mortar's Pig and Hortonworks Hive cheat sheets very helpful but they lack of some basic functionalities. For example creating table in hive with custom delimiter. So when participants want to write something really basic and they search for reference I have to point them slides or website. Probably will have to write my own cheatsheets.

Programming Environment

One of key points I wanted to proove in training was showing that most of the training material can be finished without use of cluster or Hadoop itself. For that I have created starter projects on github for Pig and Java Map Reduce that participants could download and fill with their code. They could write MRUnit tests, run Map Reduce Java job in local mode (without cluster or hdfs), write Pig scripts and test them with PigUnit. But I have faced some problems as well:

For this training I told that Mac OS or Linux is required but two participants had Windows so they couldn't finish their tasks - For Windows systems it is required to have binaries of Hadoop. It has taken me about 4-5 hours to organise all dependencies and compile Hadoop 2.5 so it may be difficult to give it as a prerequisite for training. Next time probably I need to provide this as a downloadable package for participants (both versions - 32bit and 64bit).
Pig Unit is not very well documented and there is not so much development around it - is it good idea to teach about it?

Sandbox

On the opposite to IDE only training, there are Hadoop distributions that one can download and install on laptop. Hortonworks and Cloudera have Hue which is very helpful when learning Pig or Hive. They help creating tables, importing flat files. Pig query page has a dropdown menu with all useful commands so there is no problem with syntax.
Cons:

The main problem with Sandbox is speed. VM is eating 4GB of memory and even easiest job which should finish in 10 seconds starts more than one minute on Intel i5 with 8GB of ram.
VM is huge - there is always a participant that forgets to download an image and downloading 5GB during training may be difficult because. Always have it on pendrive.

Pros:

When skipping workshop related to writing Java Map Reduce or UDF, you can run whole training on this VM
Hue helps a lot with learning Pig syntax in comparison to context help in Grunt
We could run interesting scenario - find when ORC (binary columnar file format) is faster than default file format in Hive for specific query.

Gist and Github

I found Github very useful to share starter mavenized projects. Most of developers already use github and they didn't have any problems with forking, downloading and importing the projects.
Gist was very helpful when sharing simple command lines for downloading, extracting binaries, running applications etc.
I remember a slide full of command lines and asking participant to write commands which always finished with a lot of typing errors - with gist it won't happen anymore.

By the way, many thanks to Sages Sp z o.o. for this opportunity :-)

UsernameToken in Websphere Message Broker 7.0

This post should be written two years ago...

WMB 7.0 lacks of some basic functionality related to WS-Security - UsernameToken for Consumer scenario. It was probably implemented in next Broker version.
Some companies still use this version though.

Two years ago I had to implement WS-Security based on UsernameToken and Signature. Second part was easier as it is supported by Policy Sets.
First part wasn't supported and I had to write it by myself. Below you can find snippets with code chunks.

What is Username Token?

According to oasis-open.org UsernameToken element was introduced to SOAP Security as a way of providing username.

syntax:

 <wsse:UsernameToken>  
   <wsse:Username> ... </wsse:Username>  
   <wsse:Password Type="..."> ... </wsse:Password>  
   <wsse:Nonce EncodingType="..."> ... </wsse:Nonce>  
   <wsu:Created> ... </wsu:Created>  
 </wsse:UsernameToken>

Where:
- Username is plaintext username
- Password is a password or password equivalent - for example of type #Password_Digest - Base64 ( SHA-1 ( nonce + created + password ) )
- Nonce is random value to detect replay attacks encoded with Base64
- Created is datetime of creation time of the request, helps detect replay attacks

You can read more here: http://www.ws-i.org/profiles/basicsecurityprofile-1.0.html#UsernameToken

Implementation

Code below should be pasted to compute module where SOAP envelope is created.

1) Define procedures:

 create PROCEDURE encrypt (IN P1 CHARACTER) RETURNS CHARACTER LANGUAGE JAVA EXTERNAL NAME "com.zagwozdka.broker.UsernameTokenUtils.encrypt";  
 create PROCEDURE getTime () RETURNS CHARACTER LANGUAGE JAVA EXTERNAL NAME "com.zagwozdka.broker.UsernameTokenUtils.getTime";  
 create PROCEDURE getNonce () RETURNS CHARACTER LANGUAGE JAVA EXTERNAL NAME "com.zagwozdka.broker.UsernameTokenUtils.getNonce";  
 create PROCEDURE b64Encode(IN source BLOB) RETURNS CHARACTER LANGUAGE JAVA EXTERNAL NAME "com.ibm.broker.javacompute.Base64.encode";

2) Java UsernameTokenUtils class with static methods:

 public class UsernameTokenUtils {  
 public static synchronized String getTime(){  
 Calendar c = new GregorianCalendar();  
 SimpleDateFormat s = new SimpleDateFormat("yyyy-MM-dd'T'HH:mm:ss'Z'");  
 return s.format(c.getTime());  
 }  
 public static synchronized String getNonce(){  
 Random generator = new Random();  
   String nonceString = String.valueOf(generator.nextInt(999999999));  
   return nonceString;  
 }  
  public static synchronized String encrypt(String plaintext)   
  {  
   MessageDigest md = null;  
   try  
   {  
    md = MessageDigest.getInstance("SHA");  
   }  
   catch(NoSuchAlgorithmException e)  
   {  
    return null;  
   }  
   try  
   {  
    md.update(plaintext.getBytes("UTF-8"));  
   }  
   catch(UnsupportedEncodingException e)  
   {  
    return null;  
   }  
   byte raw[] = md.digest();  
   String hash = (new BASE64Encoder()).encode(raw);  
   return hash;  
  }  
 }

3) declare oasis-open namespaces

 DECLARE soapenv NAMESPACE 'http://schemas.xmlsoap.org/soap/envelope/';   
 DECLARE wsse NAMESPACE 'http://docs.oasis-open.org/wss/2004/01/oasis-200401-wss-wssecurity-secext-1.0.xsd';   
 DECLARE wsu NAMESPACE 'http://docs.oasis-open.org/wss/2004/01/oasis-200401-wss-wssecurity-utility-1.0.xsd';

4) declare variables, generate time and nonce

 DECLARE toEncode BLOB;  
 DECLARE ctime CHARACTER;  
 DECLARE nonce CHARACTER;  
 SET ctime= getTime();  
 SET nonce = getNonce();

5) prepare soap header with UsernameToken and fill it with proper values

 CREATE LASTCHILD OF OutputRoot.SOAP DOMAIN('SOAP') TYPE Name NAME 'Header';   
 Set OutputRoot.SOAP.Header.wsse:Security.wsse:UsernameToken.wsse:Username = username; -- here is username   
 Set OutputRoot.SOAP.Header.wsse:Security.wsse:UsernameToken.wsse:Password = encrypt(nonce||ctime||password); -- here is password  
 Set OutputRoot.SOAP.Header.wsse:Security.wsse:UsernameToken.wsse:Password.(XMLNSC.Attribute)"Type"='http://docs.oasis-open.org/wss/2004/01/oasis-200401-wss-username-token-profile-1.0#PasswordDigest';  
 SET toEncode = CAST(nonce as BLOB CCSID 1208);  
 Set OutputRoot.SOAP.Header.wsse:Security.wsse:UsernameToken.wsse:Nonce = b64Encode(toEncode);  
 Set OutputRoot.SOAP.Header.wsse:Security.wsse:UsernameToken.wsse:Nonce.(XMLNSC.Attribute)"EncodingType"='http://docs.oasis-open.org/wss/2004/01/oasis-200401-wss-soap-message-security-1.0#Base64Binary';  
 Set OutputRoot.SOAP.Header.wsse:Security.wsse:UsernameToken.wsu:Created=ctime;  
 Set OutputRoot.SOAP.Header.wsse:Security.(XMLNSC.Attribute)soapenv:mustUnderstand='true';

I hope someone will google this article and find it useful :-)