Gene PA14_00510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_00510 
Symbol 
ID4384760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp42934 
End bp53265 
Gene Length10332 bp 
Protein Length3443 aa 
Translation table11 
GC content69% 
IMG OID639322596 
Productputative hemagglutinin 
Protein accessionYP_788197 
Protein GI116053762 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.443452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCC GCAGCCCGCT GAACCAGTGC ATCGCCCTGT CCCTGGCCGG CATCCTGTTC 
CTCAACCCGA TCGTCGCCGC GGCGGCGGGG CTGGCGCTGG ACAAGGCCGC CGGCGGCAAC
ACCGGCCTGG GCCAGGCGGG CAACGGCGTG CCCATCGTCA ATATCGCCAC GCCCAACGGC
GCCGGGCTGT CGAACAACCA TTTCCGCGAC TACAACGTCG GCGCCAACGG GCTGATCCTC
AACAACGCCA CCGGCAAGAC CCAGGGTACC CAGCTCGGCG GGATCATCCT CGGCAACCCC
AACCTCAAGG GCCAGGCGGC GCAGGTGATC CTCAACCAGG TCACCGGCGG CAACCGCAGC
ACCCTGGCCG GCTACACCGA GGTGGCCGGG CAGTCGGCGC GGGTGATCGT CGCCAACCCG
CACGGCATCA CCTGCCAGGG CTGCGGCTTC ATCAACACGC CGCGCGCGAC CCTCACCACC
GGCAAGCCGA TCATGGACGG CCAGCGCCTG GAGCGCTTCC AGGTGGACGG CGGCGACATC
GTCGTCGAAG GCGCCGAACT GAACGTCGGC AACCTCGAAC AGTTCGACCT GATCACCCGC
AGCGCCAAGC TCAACGCCAA GCTCTACGCG AAGAACCTCA ACATCGTCAC CGGCCGCAAC
GACGTCCAGG CCGACAGCCT GCAGGCCACG CCGCGCGCCG CCGATGGCAG CGAGAAGCCA
CAGCTGGCGA TCGACAGCTC GGCGCTGGGC GGGATGTACG CCGGGGCGAT CCGCCTGGTC
GGCACCGAGC AGGGCGTGGG GGTGAAGCTG GCCGGCGACA TGGCCGCCAG CGGCGGCGAC
ATCCGCATCG ACGCCAGCGG CAAGCTGAGC CTGGCCCAGG CCTCCAGCCA GGGCGACCTG
AAGATCGCGG CCCAGGCCGT GGAGCTGAAC GGCAAGACCT ACGCCGGTGG CAGCGCCGAG
ATCCGCAGCG CGGAGGAACT GGTCAACCGG CAGAGCCTGG CGGCGCGCGA ACGCATCGCG
CTGGAGGCGG CGCATATCGA CAACGCCGGG GTGATCGAAG CCGGCGTCGA GCCAGACGAG
CGGCGCAACG CGCGCGGCGA CCTCGAGCTG CGCAGCGGCA CCCTGCGCAA CGCCGGCAGC
CTGGTGGCCA GCCGCGCGCT GGAAGCGAAG GCGAGCCAGG CGCTGGACAA CCAGGGCGGC
AGCCTGAAGG GGGCGACCGT CCGGGTCGAC GCCGGGCACC TGGACAACCG TGGCGGCAAG
CTGCTCGCCG AGGGCGAACT GCGGGTCGAG GCGAGCAGCC TGGACAACCG CCAGGACGGC
CTGTTGCAGA GCCGGGACCG CGCCGTGGTC AAGACCCGTG GCGATCTCGA CAACCGTGGC
GGCCAGGTGA TCGGCCTGAA CGACCTGGAG GTCGGCGCGG CGACGCTCGA CAACGGCCAG
CAAGGCCTGC TCGGCAGCCA GCAGTCCACC CGCGTCAGCG CCCAGGCGCT GGTCAACCGG
GGGGACGGCG AAGTCTCCGG CAAGCGCGTC GAGGCCCGCG TCGGCAGCCT CGACAATCGC
GGCGGCAAGC TGATCGGCGA CGACCTGCTG GTGGTCGCCA GCGGTGCCAT CGACAACCGC
CTCGGCTTGT TCTCCGCAGC CAACCGCCTC GACCTGCGGG CGCGCAGCCT GGACAACAGC
GGCAAGGGCA CGCTGAGCAG CCGGGGCGGC CTGGAGGTCA GCCTCGGCGG CCTGCTGGAC
AACCGCGATG AAGGCAACCT GCTCAGCCAG GGCGCGCAGC GCGTGACGGT GGGGCAACTG
GACAACCGCG CCGGCGGCCT GCTGTCGAGC CGCAGCGAGT TGAACGTCCA CGGCGCCAGC
CTGGACAACC GTGGCGGCGT GCTGGTAGCC GACGCCGGCC TGAGCGCCAC GGGAGGCGCC
TTCGACAACC GCGACGGCGG CAGCGCCAGC GGCAAGGCTG GCGTGCGCGT GGAGGTCGCC
AGCCTGCGCA ACGACCAGGG TGGCAAGCTG CTCAGCGATG GCCGCCTGGA CCTCGCAGCG
AACGCCGTCG GCAACGCCGG AGGGCGTATC GCCGCCAAGG GCGACCTGCA GGCGACGCTT
GGCAGCCTGG CCCAGCAAGG TGGCGAACTG GTCAGCGAAA AGACCCTGAA GGTCGCGGCC
GACACGCTCG ACAACAGCCA GTCCGGGCTG ATCGCCGCGA ATGGCGACAT CGCTATCGAG
GCGCGGCAGG TCGACAACCG CGCCGGCGAG ATTTCCAGCA CCTCGAAGGT CGCCGTGAAC
GCCCGCGAGC AACTGGACAA CCGCGGCGGC AAGGTCATCG GCGACAGCGG CCTGCGCCTC
ACCGTGCAGC GCCTGCTGAA CCAGGCCAAG GGGGTGCTGG CCGGGCGCGA CGGCCTGAGT
CTGGACGGCG GCGAACTGTT CAACGGCGAC GGCGGTCGGC TCGACAGCCA GAACAGCCTG
AGCGTGAGCC TCGGCGGCGT GCTGGACAAC CAGGGCGGCG CGCTGGTCAG CGAAGGCAGC
CTGACGGCGC GCGCCGCGCG CCTGGACAAC CGTGGCGGAA CCTTCTCCAG CGCCGGTGCG
CTGGCGCTGA CCAGCCAGGC CGCGCTGGAC AACCAGGGCG GCAGGCTGCT CAGCGATGCC
GGCGTGACGC TGAAGGGCGC CAGCCTCGAC AACAGCCGTT CCGGCGTGAT CAGCGCCAAG
GGCGCGGTGG ATATCCGCAC CGGCGTGCTG GACAACAGCC GCAACGGCGG CATCGGCAGC
AACGCCGGCA TCACCCTGGT GGCCGCCCGG CTGGACAACG GCCAGCAGGG CCGGGTCAGC
GCCAAGGGCC TGCTCGACGC CAACCTGAAA GGCCTCGACC AGCGCGGAGG CGGCGTCCTG
GTCAGCGAAA CCGGCGTCAC CCTCGACCTC AATGGCGGCA CGCTGGTCAA CCGCGACGGC
GGCCTGATCG CCACGCCCGG CGCGCTGCTG CTGCGCCAGC TCGGCGCGGT GGACAACGGC
GTCGGCGGGG AAATCTCCAG CGACCGCGCC TTCACCCTCG CCGCCGCCAG CCTGGACAAC
CGCGGCGGGC GCCTGATCGG CGCCGACAGC CTGACCCTGC GCATCGCCCA GGCCCTGGAC
AACAGCCTGG CCGGGGTGAT CTCCGGCGCC GCCGGCCTGG ACATCGCGGC CGCTCGCCTG
GACAACAGCG CCAAGGGCAC CCTGGCCAGC CGCGCCGGCA TCGACCTGCG CGTCGACGGC
GCGCTGGACA ACCACGCCGA AGGCACCGTT TCCGGCGCCC GCCTGACGCT CGCCAGCGCC
TCGCTGGACA ACAGCGGCAA GGGCCTGCTC TCCGGCAACG CCGGCCTGAG CGTCGCCACT
GGCGCGCTGG ACAACGCCGA GGGTGGCCAG TTGACCAGCC AGGGCGTGCT GGACGTCAGC
AGCGCCGACC TCGACAACCG TGGCGGCGCC CTCAGTGGCA AGCAGTCGCT GCGCCTGAGC
GCCGCCAACC TGGACAACCG TGGCGGCCTG CTCACCAGCG ACGGCGAACT GGAACTGACG
GCAGGGCGCG TCGATTCCGC CGACGGCGGC GAAATCTCCG CCCGGGGCGA CCTGCGCCTG
ACGGTCGAGC GCCTGGTGCA ACGCCAGGGC CGGCTGGTCG GCGAGCGCGG CGTCAGTCTC
GACCTGCGAG GCGGCGACCT GGACAACCAG GGCGGCCTGA TCAGTGCCCG CGGCCCGCTG
AGCATCGAGC GGCTGAACGT CCTCGACAAC CGCCAGGGCG GCGAGATTTC CAGCCAGCAG
GGCTTCGAGC TGCTGGCCAG GCGCATCGAC AACGGCCAGC AGGGGCGCAT CATCAGCGCC
GGGAAACTGC GCCTGGACGC CGACGCGCTG GGCAACGCCG GCGCCGGCCT GCTCTCCGGA
TGGCAGGGCC TGACGGTGAC AGGCGGGAGC CTGGACAACA GCGCCGGCGG TACCCTTTCG
AGCAAGGACG GCGAGCTGGC CATCAGCCTC GGCGGCGCGC TGGACAACCA CGGCCAGGGC
GCGCTGGTCA GCAAGGGCGC GCAACGGATC GACGCCGCCA GCCTGGATAA CGCCCAGGGC
ATCGTCTCCG GCGAAAGCGA CGTGACCCTG AGCATCGCCG GGAAGCTGGA CAACGGCCAG
GGCGGCCTGG TCTCGGCGCA GCGCGCGCTG AGCTTCGAGC GCGACGATAC GCTGCTGAAC
AACGCCGGCG GCCGGATCAA CGGCGGCAGC CTGCTGCTCA AGGGCGCCAG CCTGGATAAC
AGCGACGGCC AGTTGATCAG CCAGGGCCGG CTCGACGCCA TCCTCGGCGG CGCCCTGGTC
AACACCGGCG CGGCGCGCCT GGCCAGCGGC GGCGACCTGC TGCTGCGCAG CGCCAGCGTC
GACAACCGCG GCGGCAAGCT GGTCAGCCAG GGGCTGCTGG AGATCAGCGC CGGCAGCCTC
GACAACAGCG CCTCCGGCAC CCTCGCCAGC CAGGCCGACA TGAGCCTGCG CCTGGGCGGC
GGCGCCCTGC GCAACCAGCA GGACGGCCTG ATCTTCAGCC AGGCCGGCGC CCTCGAGGTG
CAGGCCGGCA GCCTGGACAA CCGCCAGGGC ACGCTCCAGG CCCAGGGTGA CAACCGGCTG
CGTATCGGCG GCGCGCTGGA CAACCAGGGC GGCCGCCTGG ACAGCCGGGC CGGCAACCTC
GACCTGCAGA GCGGCAGCCT CGACAACGGC GCCGGCGGCG TGCTCAACAG CGCCAAGGGT
TGGCTGAAGC TGGTCACCGG GCTGTTCGAC AACAGCGCCG GCGTCACCCA GGCGCAGTCG
CTGGAGATTC GCGCCGGGCA AGGCGTGCGC AACCAGCAGG GCCATCTCTC GGCGCTGGGC
GGCGACAACC GCATCGTCAC CGCCGACTTC GACAACCAGG GTGGCGGCCT CTACGCCAGC
GGCCTGCTCA GCCTCGACGG CCAGCGCTTC CTCAACCAGG GCGCGGCGGC GGGCCAGGGC
GGCAAGGTCG GCGCCGGGCG CATCGACTTC AGCCTGGCCG GCGCGCTGGC CAACCGCTTC
GGCCAGTTGG AAAGCGAGAG CGAGCTGCAC CTGCGCGCCG CCGCGATCGA CAACAGCGGC
GGCAGCCTGC GCGCCCTTGG CCGCAGCGGC AGCACGCGGC TGGTCGCTGG CGACCTGAAC
AACGCCTACG GCGTGCTGGA AAGCGCCAAC CAGGACCTCG ACCTGCAACT GGGCAGCCTG
GCCAACGCCG GCGGGCGCAT CCTCCACACT GGCAACGGCA CCTTCGGCCT GGATTCCGGG
CAGGTGATCC GCGCCGGCGG CGAACTGACC ACCAATGGCC TGCTGGACAT CCGCGCCAGC
GAATGGACCA ACAGCAGCGT GCTGCAAGCC GGACGGCTGA ACCTGGACAT CGGCACCTTC
CGCCAGACGG CCGAGGGCAA GCTGCTGGCG GTGCAGTCCT TCACTGGCCG CGGCGGCGAC
TGGAGCAACG ACGGCCTGCT GGCCAGCGAC GGCAGCTTGC GCCTCGACCT GAGCGGCGGC
TACCGTGGCA ACGGCCGCGC CACCAGCCTC GGCGACTTCG CCCTGAACGC CGCCAGCCTC
GACCTCGGCA ACGCCGCCAG CCTCGCCGGC GGCGCCAATG TCACGCTCGG CGCCGGCAAC
CTGCTGGTCA ACCGTGGGCG GATCACCGCC GCCGGCGACC TCGTGGCCAG CGCCGCGAGC
CTGAACAACT ACGGCACCCT GGGCGGCGGC GGCAACCTGC GATTGAACGC GCCCGCCCTG
CTCAACGAGC GCGGGTTGCT GTTCAGTGGC GCCGACATGA CCTTGCGCGC CGGCGACATC
ACCAACCTCT ACGGGGATGT GTACAGCCTC GGCAGGCTGG ATATCGCCCG CGACGATGCC
GGCAACCGTG CCGCCAGCCT GCGCAACCTT TCCGGGGTGA TCGAGAGCGG CAAGGACTTC
AGCCTGCGTG CCAGCCTGAT CGAGAACCGT CGCGCCGTGC TGGAAAGCAA GTCGGGCCTG
TACACCGCGA AGATGGAGCA GACCGCCTGC ATCGAAGGCG TCAACGCAGG CGACTGCAGC
GGCAAGCGCA ACGCCATCTG GACCATCACC CAGCGCGACA AGACCGAGGT CACCGCCAGC
AGCGCCATGG GGCAACTGCT GGCCGGAGGC GACTTCGCCA TCGACGGCGG CACCCTGAAC
AACCTTTCCA GCCTGATCGG CAGCGGCGGC AACCTCACCG CCAACCTCGA AGTCCTCGAC
AACCAGGGCC TGGAAACCGG CGAGCTGGAA ACCATCCGCG TGCTGCGTAC CGCTCGCGGC
GGCGATATCG GCGGCATCGA CCAGAAGTCG CGTAACTTCA CCAACCTCTA CTGGTACCAG
AGCGCCAATT TCGACCCGGC GCGCGCGGGC GAGATTCCCG CCGCGCTCAA CGCGATCCTC
AGCGACTGGT CCTTCGAGTA CGAATTCCCG AGCAAGGGAC CGACCCCGAT CAGCAGTGGC
GACCAGTCCT ACGCAGCGGT GATCCAAGCC GCCGGCGACG TCACGGTCAA TGCCAGCACG
CGCATCGACA ACGGCGTCAC CCGCCCCGGC TACACCTTCG TCGGCAGCGG CCGCCAGGTG
GGCGACAGCG CGGTGGGCGG CAGCGGGGTT TCGGTGGTCG TGCCGCTGAC CTCGCAACTG
CCGCCCGACC TGGCGCGGCG CCAGGTCAAC CCGGTCACCC TGCCCGGCTT CAGCCTGCCC
CATGGCGACA ACGGCCTGTT CCGTCTCAGC TCGCGCTTCG CCGAGGACGG CAATGGCAGC
GCCGCGCTCG GTGCCGGCGC CGACCGCACC CAGGGCGGCA GCGGCGTCTC GGTCGGCCAG
CAGGGCGCCG GCAACGCCGC CGGTACCTGG CAGGGCCAGG GCGTGCGAGT CGACGGCCTG
GCTGGCGCGG CCAACGTCCA GGGTCAGGGC GGCAGCGCGC TCGGCGGTAG CCTGCCGGGC
GTCGCCCGGG TCCAGGGCGT GCCCGGCAAC GCCACGCCGA GCGCCAGCCA CAAGTACCTG
ATCGAGACCA ACCCGGCGCT CACCGAACTG AAGCAGTTCC TCAACTCGGA CTACTTGCTC
AGCGGCCTGG GCATGAACCC GGACGCTAGC AAGAAGCGTC TCGGCGACGG TCTCTACGAG
CAGCGGCTGA TCCGCGACGC GGTGGTGGCG CGCACCGGCC AGCGCTACAT CGACGGGCTG
AGCAGCGACG AGGCACTGTT CCGCTACCTG ATGGACAACG CCATCGCTTA CAAGGACCAA
CTGCACCTGC AACTGGGCGT GGGTCTGAGC GCGGAGCAGA TGGCGGCGCT GACCCACGAC
ATCGTCTGGC TGGAAGAGGT CGAGGTGAAC GGCGAGAAGG TCCTCGCGCC GGTGGTCTAC
CTGGCCCAGG CGGAGGGTCG GCTGGCACCC AACGGTGCGC TGATCCAGGG CCGCGACGTG
AAGCTGGTGA GCGGCGGCGA CCTGCATAAC GTCGGCACCC TGCGCGCGCG GAACGACCTC
TCGGCGACGG CCGACAACCT CGACAACAGC GGCCTGATCG AGGCCGGCAA GCGCCTCGAC
CTGCTCGCCG GCGACTCGAT CCGCAACCGC CAGGGCGGGG TCATCGCCGG GCGCGATGTG
AGCCTCACCG CGCTGACCGG CGACGTGATC AACGAACGCA GCGTGACCCG CTACGACAGC
GCGCTCGACG GCCGCACCTG GGAGCGCAGC TTCGCCGACA GCGCCGCGCG GGTGGAGGCG
GCGAACAGCC TGAACGTCCA GGCCGGACGC GACATCGCCA ACCTCGGCGG GGTGCTGCAG
AGCCGCGGCG ACCTCAGCCT CGACGCCGGA CGCGACGTCA CCGTCGCCGC CGTCGAGGAC
CGCCAGGGCC AGACCCGCTG GAGCACGTCG CGGCTACAGA GCGTGACCCA GCTCGGCGCC
GAAGTCAGTG CCGGGCGGGA CCTGAACGTC AGCGCCGGGC GCGACCTCAG CGCAGTGGCC
AGCGCCCTCG AAGCGCGCCG CGACATCGCC CTCTCCGCCG GGCGCGACGT GACCCTGGCG
GCGGCGGCGA ACGAGGAGCA TGCCTACAGC AAGACCAGGA AGGTCACCTA CCAGGAAGAC
AAGGTCGCCC AGCAAGGCAC CCGCGTGGAC GCCGGCGGCG ACCTGGCGAT CAATGCCGGA
CAGGACCTGC GCCTGATCGC GAGCCAGGCC AGCGCCGGCG ACGAGGCCTA CCTGGTGGCC
GGCGACAAGC TGGAACTGCT GGCCGCCAAC GACAGCAACT ACTACCTGTA CGACAAGAAG
AAGAAAGGCG ACTTCGGCCG CAAGGAAACC CGGCGCGACG AAGTCACCGA CGTCAAGGCG
GTGGGCAGCC AGATCAGCAG CGGCGGCGAC CTCACCCTGC TCAGCGGCGG CGACCAGACC
TACCAGGGCG CGAAGCTGGA ATCGGGCAAC GACCTGGCCA TCGTCAGCGG CGGCGCGGTG
ACCTTCGAGG CGGTGAAGGA CCTGCACCAG GAAAGCCACG AGAAGAGCAA GGGCGACCTG
GCGTGGAACA GCGCCAAGGG GAAAGGGCAG ACCGATGAAA CGCTTCGGCA GACCCAGATC
GTCGCTCAGG GAAATCTGGC GATCAAAGCA GCGGACGGGC TGAAGATCGA TGTGAAGCAT
ATCGACCAGA AGACCGTTTC CGAGACCATC GATGTCATGG TCAAGGCCGA TCCGAGCCTC
GCCTGGTTGC GGGAGGCCGA AAAACAGGGT GACGTCGATT GGCGCAAAGT CCGGGAAGTA
CATGACAGCT TCAAGTACAG CCATTCGGGG TTGGGCGCTG GCGCGGCATT GGCCATCGCC
ATTGTGGTTA CCTACCTGAC CTGGGGTGCG GGTAGTTCGA TGGCAGGCGT CGCGGCCAAA
AGCGCAACAG GCGTCGCTGC CAACTCCGTC GCCAGTGCCG TAGCCACCAA CGCGGCGATC
AGCACGGTGA ATAACCGCGG CAACCTTGGC GCGGTGGCGA AAGACGTGAC CTCCAGCGAC
AGCCTGAAGG GCTACGCGGT CGCCGGTATC AGCGGCGGCT TCATGCCCAG CAGTCTGGGT
GCCCAGCTTG CCGTTCGCTC CGCGCTGAAC ACCGTGGTGA ACGGTGGCAA GTTCAGGGAC
AACGTCGCGC AAGCTGCCAT CAGTATGGCG GCGGACGCGC TAAGCGGCGC GATCTTCGAC
AAGGTCGGAG ATGCGTTGGT CGGCAGCGGC CTTCCCAAGA AGGTAGCGGT CCATGCGATT
GTCGGCGGGT TGATCGGCGA GGCTGCCGGC GGCGATTTCC GTACCGCAGC CCTGGCTGCT
GGCGCCAACG AGGCATTGGT AAGCCTCGTC GGTGAAAAGA TCTTCCCTGG CGAGGCTCAT
GAGCGCGTGC TGGCGATGAC CTCGCAGTTG ATAGGGATGA CAGTGGCTGC GGCGGCCGGA
GGCGATACCA AGGCCCAAGA GAAAGCCGCT TGGGTGGCTC AGCAGGCCAC GGTGTACAAC
AATCTGAATC ACGCGGCGGC GGAGAGCCTG CTCAAGGAAA TCAAGGATTG TCGCGCGGCA
GGCGGCTGTG GCGAGGAGAA GTTGCAAGGC ATCCTCGGCA AGTACGAAAA GCTGTCGGCC
GAGCGTTCAA ACGCTATCGG CCAGTGTGCT TCGCGCCAGT GTGTGGACGA CATCGTCGAC
AGTTCGATCC GGATGGACGA TCCGGTTTCC AAAGAGCTGC TCAGCCTACT GCGGCAAACC
ACCTACGACA CACCCGGCTT GTTGCAGGGC AATCCTGATG CGATCGTGTC GCAGACGCCG
AATCCAAGTG GCTGGGGAGA TCTCTTTGCC CTGGACAAGC AACTGGCGTT CGCCAAGAAC
CTCAAGGAAG GCTGGCTGAC ACCGGAGGAA ACGGCTGATC TGGATCGCTG GAATGCGTCC
ACTTCCTGGC TGGATCGCAC TGCTGGCCGA CAACTGGATC CCAAGGAGAA GGCTTACCTG
CTCTCGGAGC TGGGGGGCGC GGCGGCTATG GCGCTTCTTG GGGGGAGGGG GAGTGTTGGA
TCGAACGCTA CTTTTGGTCA AATTAAAACA GTGCTTGATA CTGCGCAGGC TCCATATAAA
GGTAGTACTG TTATAGGACA TGCGCTATCT AAACATGCGG GTAGGCATCC CGAAATATGG
GGTAAAGTTA AAGGTTCTAT GTCTGGTTGG AACGAACAAG CTATGAAGCA TTTTAAGGAA
ATTGTCCGTG CTCCTGGGGA GTTTCGACCT ACTATGAATG AAAAGGGAAT AACTTTTTTA
GAGAAGCGTC TGATAGATGG TCGTGGGGTT AGGCTGAATC TAGATGGAAC TTTTAAAGGG
TTTATTGACT GA
 
Protein sequence
MDIRSPLNQC IALSLAGILF LNPIVAAAAG LALDKAAGGN TGLGQAGNGV PIVNIATPNG 
AGLSNNHFRD YNVGANGLIL NNATGKTQGT QLGGIILGNP NLKGQAAQVI LNQVTGGNRS
TLAGYTEVAG QSARVIVANP HGITCQGCGF INTPRATLTT GKPIMDGQRL ERFQVDGGDI
VVEGAELNVG NLEQFDLITR SAKLNAKLYA KNLNIVTGRN DVQADSLQAT PRAADGSEKP
QLAIDSSALG GMYAGAIRLV GTEQGVGVKL AGDMAASGGD IRIDASGKLS LAQASSQGDL
KIAAQAVELN GKTYAGGSAE IRSAEELVNR QSLAARERIA LEAAHIDNAG VIEAGVEPDE
RRNARGDLEL RSGTLRNAGS LVASRALEAK ASQALDNQGG SLKGATVRVD AGHLDNRGGK
LLAEGELRVE ASSLDNRQDG LLQSRDRAVV KTRGDLDNRG GQVIGLNDLE VGAATLDNGQ
QGLLGSQQST RVSAQALVNR GDGEVSGKRV EARVGSLDNR GGKLIGDDLL VVASGAIDNR
LGLFSAANRL DLRARSLDNS GKGTLSSRGG LEVSLGGLLD NRDEGNLLSQ GAQRVTVGQL
DNRAGGLLSS RSELNVHGAS LDNRGGVLVA DAGLSATGGA FDNRDGGSAS GKAGVRVEVA
SLRNDQGGKL LSDGRLDLAA NAVGNAGGRI AAKGDLQATL GSLAQQGGEL VSEKTLKVAA
DTLDNSQSGL IAANGDIAIE ARQVDNRAGE ISSTSKVAVN AREQLDNRGG KVIGDSGLRL
TVQRLLNQAK GVLAGRDGLS LDGGELFNGD GGRLDSQNSL SVSLGGVLDN QGGALVSEGS
LTARAARLDN RGGTFSSAGA LALTSQAALD NQGGRLLSDA GVTLKGASLD NSRSGVISAK
GAVDIRTGVL DNSRNGGIGS NAGITLVAAR LDNGQQGRVS AKGLLDANLK GLDQRGGGVL
VSETGVTLDL NGGTLVNRDG GLIATPGALL LRQLGAVDNG VGGEISSDRA FTLAAASLDN
RGGRLIGADS LTLRIAQALD NSLAGVISGA AGLDIAAARL DNSAKGTLAS RAGIDLRVDG
ALDNHAEGTV SGARLTLASA SLDNSGKGLL SGNAGLSVAT GALDNAEGGQ LTSQGVLDVS
SADLDNRGGA LSGKQSLRLS AANLDNRGGL LTSDGELELT AGRVDSADGG EISARGDLRL
TVERLVQRQG RLVGERGVSL DLRGGDLDNQ GGLISARGPL SIERLNVLDN RQGGEISSQQ
GFELLARRID NGQQGRIISA GKLRLDADAL GNAGAGLLSG WQGLTVTGGS LDNSAGGTLS
SKDGELAISL GGALDNHGQG ALVSKGAQRI DAASLDNAQG IVSGESDVTL SIAGKLDNGQ
GGLVSAQRAL SFERDDTLLN NAGGRINGGS LLLKGASLDN SDGQLISQGR LDAILGGALV
NTGAARLASG GDLLLRSASV DNRGGKLVSQ GLLEISAGSL DNSASGTLAS QADMSLRLGG
GALRNQQDGL IFSQAGALEV QAGSLDNRQG TLQAQGDNRL RIGGALDNQG GRLDSRAGNL
DLQSGSLDNG AGGVLNSAKG WLKLVTGLFD NSAGVTQAQS LEIRAGQGVR NQQGHLSALG
GDNRIVTADF DNQGGGLYAS GLLSLDGQRF LNQGAAAGQG GKVGAGRIDF SLAGALANRF
GQLESESELH LRAAAIDNSG GSLRALGRSG STRLVAGDLN NAYGVLESAN QDLDLQLGSL
ANAGGRILHT GNGTFGLDSG QVIRAGGELT TNGLLDIRAS EWTNSSVLQA GRLNLDIGTF
RQTAEGKLLA VQSFTGRGGD WSNDGLLASD GSLRLDLSGG YRGNGRATSL GDFALNAASL
DLGNAASLAG GANVTLGAGN LLVNRGRITA AGDLVASAAS LNNYGTLGGG GNLRLNAPAL
LNERGLLFSG ADMTLRAGDI TNLYGDVYSL GRLDIARDDA GNRAASLRNL SGVIESGKDF
SLRASLIENR RAVLESKSGL YTAKMEQTAC IEGVNAGDCS GKRNAIWTIT QRDKTEVTAS
SAMGQLLAGG DFAIDGGTLN NLSSLIGSGG NLTANLEVLD NQGLETGELE TIRVLRTARG
GDIGGIDQKS RNFTNLYWYQ SANFDPARAG EIPAALNAIL SDWSFEYEFP SKGPTPISSG
DQSYAAVIQA AGDVTVNAST RIDNGVTRPG YTFVGSGRQV GDSAVGGSGV SVVVPLTSQL
PPDLARRQVN PVTLPGFSLP HGDNGLFRLS SRFAEDGNGS AALGAGADRT QGGSGVSVGQ
QGAGNAAGTW QGQGVRVDGL AGAANVQGQG GSALGGSLPG VARVQGVPGN ATPSASHKYL
IETNPALTEL KQFLNSDYLL SGLGMNPDAS KKRLGDGLYE QRLIRDAVVA RTGQRYIDGL
SSDEALFRYL MDNAIAYKDQ LHLQLGVGLS AEQMAALTHD IVWLEEVEVN GEKVLAPVVY
LAQAEGRLAP NGALIQGRDV KLVSGGDLHN VGTLRARNDL SATADNLDNS GLIEAGKRLD
LLAGDSIRNR QGGVIAGRDV SLTALTGDVI NERSVTRYDS ALDGRTWERS FADSAARVEA
ANSLNVQAGR DIANLGGVLQ SRGDLSLDAG RDVTVAAVED RQGQTRWSTS RLQSVTQLGA
EVSAGRDLNV SAGRDLSAVA SALEARRDIA LSAGRDVTLA AAANEEHAYS KTRKVTYQED
KVAQQGTRVD AGGDLAINAG QDLRLIASQA SAGDEAYLVA GDKLELLAAN DSNYYLYDKK
KKGDFGRKET RRDEVTDVKA VGSQISSGGD LTLLSGGDQT YQGAKLESGN DLAIVSGGAV
TFEAVKDLHQ ESHEKSKGDL AWNSAKGKGQ TDETLRQTQI VAQGNLAIKA ADGLKIDVKH
IDQKTVSETI DVMVKADPSL AWLREAEKQG DVDWRKVREV HDSFKYSHSG LGAGAALAIA
IVVTYLTWGA GSSMAGVAAK SATGVAANSV ASAVATNAAI STVNNRGNLG AVAKDVTSSD
SLKGYAVAGI SGGFMPSSLG AQLAVRSALN TVVNGGKFRD NVAQAAISMA ADALSGAIFD
KVGDALVGSG LPKKVAVHAI VGGLIGEAAG GDFRTAALAA GANEALVSLV GEKIFPGEAH
ERVLAMTSQL IGMTVAAAAG GDTKAQEKAA WVAQQATVYN NLNHAAAESL LKEIKDCRAA
GGCGEEKLQG ILGKYEKLSA ERSNAIGQCA SRQCVDDIVD SSIRMDDPVS KELLSLLRQT
TYDTPGLLQG NPDAIVSQTP NPSGWGDLFA LDKQLAFAKN LKEGWLTPEE TADLDRWNAS
TSWLDRTAGR QLDPKEKAYL LSELGGAAAM ALLGGRGSVG SNATFGQIKT VLDTAQAPYK
GSTVIGHALS KHAGRHPEIW GKVKGSMSGW NEQAMKHFKE IVRAPGEFRP TMNEKGITFL
EKRLIDGRGV RLNLDGTFKG FID