Gene HS_0383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0383 
Symbol 
ID4239859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp394115 
End bp406831 
Gene Length12717 bp 
Protein Length4238 aa 
Translation table11 
GC content39% 
IMG OID638103926 
Productlarge adhesin 
Protein accessionYP_718593 
Protein GI113460529 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TCTTCAAAAC AAAATACGAT GTAACAACCG GTCAAACTAA AGTAGTGTCT 
GAATTAGCGA ATAACCGTCA GGTAGCGAGC CGTGTTGAGG GGTCGTCGGT CGCCGAGAAG
TGCGGTGTGT TTTTGGGTAA TTTTTTAGGG GCGTTTAAGG TCCTGCCGTT GGCGTTGGTG
ATGAGTGGGG TGTTTTCTAG TATCGGGTAT GGGGCAACTG TATTTTTGGA TAATAATAAA
ACTTCATCTA TTGGTTCTGA TAATAAGTCT ACAGCAGTAT GGAGTGATGA AGGAGTAGGT
AAGTTTAACA ATACATTAAA AGATAAACTA AAGAAAAGCG AATCAATACT ATTATCACAC
ATTGATAATA CAGCAGGTGC AGATACGAGT TTAACAAATA AAGACTTTTT TAGAACTGTA
GTTATCGGAT CTCGTACCGT TGCTGGAGGT AATGATGCTA CTGCTATAGG ATACAGAGCT
ATAGTTGGTA AAAATAAAAA AGAAACAAAT ACAAAGGATA GCCATCAAGG AACAGCGGTA
GGTTATCGTG CATTTGCCAA TGGAACTGAG TCAGTATCTA TGGGTAATGA TACAATCGCT
TGGGGAGATT CATCTATTGC AATCGGTTCT GATAATGCTA ATGGGAAAGA AGCAAAAAAT
CTTTCAAAAG AAATATACAA GTTATTCCAC GCTTTAAGAG ATGATTTTAA CTATACTTCT
GAATATGCTG CGGTTAATCA CAAAACACTT AGACAAGGTA AAGAAAAATC GCATGATATG
TTAGATAATG ACGGAAATTT GATGGCTAAG GATGGTTTTT ATATTGGTTA TAATGGAGGA
TATTATTTTA GAGAACCTGA TGAATCAGGA CAATTAAAGA ATGAAAATCT TAAATATATG
TTAAAAGATG GTATTTTATA TGAAAAGATA GATCAACAAG GATTTTCTAA AATTGAAAAT
AAAAAAAGAC ATGACGAAAT AATGAACTCA TTAAAAAATA ATGAAGATTA TAAAAGATAT
GAACAATACA TGGCGATATT AGATGCTGAC TATCAAAATT ATAAAGTCAC ACCTAATAGT
AAAAAAACAC ATACATGGGC AAGAGGAAAT AACTCGATTG CTATTGGTGG ACGTTCTATT
GCTTTTGGTA ATAATTCAAC TGCTTTAGGT ACTGCCTCAG TAGCTGCTAG AGACTATGCA
ACTGCTATTG GTTCAGATAC TATTGCTTAT GGTAAACAAT CATTAGCTGT AGGTAATAGA
GCAATAGTTT ACTCTGATGG TTCTGTAGGT ATTGGTCATA AAGTTCAAGC ACTTAATGCA
GGGGCTATGG TATATGGTAG AGAATCATTT GCAGGTGGAA CAGGTTCATT GGCTATAGGT
GACCGTACAT TTGCAAACGT AACTATGGGA GATGATTTTA AAACTACAGT AGAAGATTTA
TATTTAAAAG GCGATACAGA CACTATACAA AATTCTCATA AAGAAAAATT CTACCCTAAA
ACAGTATTAC AAGACGGAAC AGAAGAAAAT AAAGCAGAAC AATCTAGAGA TAGTGAGGGT
AGATATAATC AAGGGGCTGT TGCACTAGGT TCTTATGCAA CTGCACTTGG GGATAATACA
CTTGCACTAG GAAGATTTGC CTATGCTAAA GAAGACTCAG CAGTAGCACT AGGTAGATTT
GCATTTGCAC GTAATAAAGA ATCGCTTGCT TTAGCACCTT TTTCAAGAGC GATGGGAGAG
AGATCTGTAG CCTTAGGGCA TCAATCATTA GTTGAGGCAA AAGATTCTAT GGCTATAGGA
GTGGGTGCAA GAGTTTCAGG AGAAATTCCT GAAGAGTTAA AAGCTGCTAC TAATAAAAAT
ATAAAAGTTC CAAACGAGGA TTCAATGGCG ATAGGAAATA AAGCTGAAGC ATATTTTCCT
AAATCAATCG CATTGGGGGT AGAATCAAAA ACTGATTATT CAAGAAAAGA AATGGCACAA
GATGCATGGG CACCAAAACA TGCCATATCT TTGCCATCAT CACAAAAAAT TGGTTACTTA
TCAGTTGGTG GAAAGAATGC TGAAAGACGT ATCGTCAATG TCGCACCAGG AGCAAGTGAT
ACAGATGCAG TAAACGTATC ACAATTAAGG GCATTAGAAG AAGCTATTTT ATATGGAAGT
TCATTGGAAG AGGAAGAAGC ATATAATGGA GTTAAATATT TATCTATAAA AGGTATAGAT
GAATATAAAA AAATTGTTAG AAAAGAACAA GATTATAAAA ACTATTTGAA AGTTAAAAAG
GAATATTTAA AACTAAAAGC AAGAAGAGAT GTTAATAATG AAGAAATTAA TTTAACTAAT
ATAGAAAGTA AAGTAACAGC CTACGAGTCT AAATATGGTG ATTTTGCAAA TACTTCAAAT
AAATTAACAG AAGAAAAAAA CAAGGAATAT AAAATACTTC TAACTGAAGA TGTAAAAAAT
AATCCAGAAA AAGAAAAGGA AAGAAGAAAA AATTATTTAA AAATAAAATA TGAAGAAATT
GAAGATGCTT ACGAAAAAGA TATAGCCCAA GATAAAATTG ATAATTTGCT GACAGAAGAA
CAAAAAACTA AATTTAAAAA TAGTAACTTC CTTAGTGATG GAGCAAAAGC CTCAGGTTCT
ATGGCAATGG GGGTAGGGGC TTTAACGAAT TCTGTAGAAT CTATTGTTAT TGGTAGAAAT
GCTATGATAG AAAATAAAGC CGCCAATAAT ACAGTACTGT TAGGGAATAA CACCTCATCT
GACACCGCTA ACGCTGTCGC ATTGGGTAAT TTTTCAGTTG CTGATAGAAA ACCTGAACCA
GCGAGCAAAT TAGATGCTAC CGGAAGATCT AACGCTTATA TTTCGGATGA CTTAGCAAAT
TTAATAGATG AACATCATGC GTATGCAGCG GTGTCTGTTG GTCGTTATGG CGGGGATTTG
AAAGATCTTA CGAATAATGT CCAAACAGAA AAAGACTACG ACAAATGGTT GAAAGATAAT
AAACATAGAA CAAAAGAGCC TGCCTATGAG CAAGAAAGAC TTGCAGAAAC AGCGAAGATT
AAAAAATTCG CCTTAAGAAA AATTACCAAC GTCGCACCGG GTACGAAAGA CACCGACGTT
GTTATCCTTG CTCAGTTAAA AGAAGCGATG AAAGGCGTTG GACCGGCTAC ACATTATGTG
TCCGTCAAGG GGACTGATCA AAGCGATGAT TCAAATTATA AGAATGACGG AGCGAAAGGC
ACAGATTCTG TTGCCATAGG GGTAAGTGCT GCTACTGATA AAAGTGCTAC CGCCGGCATT
GCGATTGGTA AAGGAGCAAA ATCTGAAGCA GAAAATGCGG TTGCTATAGG CAATGGTGCA
AGTATTGACG TACCGAATTC CTTTGTCATG GGGTCTAATA ATATTGTTAA CCAATCCAAT
AAAGAAACAA GAGGTGCGGT CGTTGTGATC GGTAGCGGAA TTAAATTAGA CGAATCTAAG
AGTTCGATTG CTATCGGTGC GGTTTATTTG GAAGGTAGAG ATGGAGAAAA AGATGGCACT
GTCATCGAAA ACGCTGCTTG GACAGCGTCT ATCGGGAATA AAAACAAAAT CAAAAATGGT
ACGGATATTA TTGCCGTAGG GAATAATATT AAAATACAGG GTGATCTCAC TGCAGAAGAG
GAAAGTAAGT ACGCAGAGAG AATCCGTCAA AGACCAAGAC AAAAAACCAA AATAGAAAGC
GAAAAAACCG ACACTATCAA AAACTTTAAT ACAGAAGTTG TAGCGATTGG TAATGGGGCT
GATGCGATAA GAGCCAAAAA CAGTGTACTG ATTGGGGCAA AAACTAAAGC GGAATCTAAT
GCTACACAAG CTGTTATTAT AGGTTATGAA GCAACAGCGA AAGAAAATGC CGTCGGTGCG
GTCGTGATTG GACAAGGGGC GTCGGTAGAA ACCCTCGCCG GCGATTCCAT TGCCTTGGGA
CAAGGGTCAA AAGCGACAAA AAAAGAAACT GCACCTAGCA AGGCAACCTC AAGCAAAAAT
GTGAATTTTA CATGGACCGC CGGCATTGGA AACAATACAT CCGTTGTCAG CCTAGGCGAT
ACCAGCAAGG AAAGACAAAT CAAACACGTC GCTGCAGGTG AGGTAACGAA AGCGTCGACC
GACGCTATCA ACGGAAGTCA GCTTTACGCT GTCGCTGATG AGTTTTCTAA ATTGGCTGTT
GATGTATTGG GTGCGGAAGC AGATACAACG AGCGGGTTTA AGAAATCCAC ATTTAACCAA
TTAATCGGTG CAAACGGCTC GCCTGCTGCT GCGACACCAG CGACAGCAAA AACCTTCAAA
CAAGCGATTG ATGATAATAT CGCTAAAATT AACGAGGGCT TTAAATTTGG GGACGGCAAT
ACCGACGGAA CGCATTATCT CGGTTCAACA CTCAATATTA AAGCCGGCAA TATTGATAGC
GGTGCGTTCA CAAGTGATAA TATCAAAACG CACTATGAGA CAAATAACAA AAATATCCTT
ATCGGTATCA AGAAAGCTCC ATCATTTGAA AAAGTGACAG TAACTCAAGA TGTGACCGAC
AACAGTGACA AGATGGTCTT AACCACTAAA AACTATGTGG ATACAAAATT CAATAGTGCC
GGTTCTACAC TGAAATTTAC GGCAGACAAT GGCAATACGC AGACCTTAGC CAAAAATGGC
ACATTGCAGG TAAAAGGTAC TAGCGGAGAA ATCACCACGA CCGCCTCAAA TAACGATACA
GTCACTATTG CGTTAGATCA GACGCTTAAA GGCAAAATTA ACACTGCGGC TACAACAGCG
GCAACTGCGG CTCAAAAAGC AGAGGCTGCG ACGACCAAAG CGGATGCCAA TGCACAAGCG
ATTGCAACCG TTACTGATAC GGCAAATCAA AATAAAACGA CGATTAGTGC GGTAAAAGCT
ACAGCAGATC AGAATAAAAC AGATATTGCA AGTGCCAAAT CTGACATAGC GGAAAATAAA
CAAAGCATTG CAACTGTTAC TGAGACGGCA AATCAAAACA AAGGTGATAT TAGTGCATTA
AAATCTCAAA CAGAGACAAA TACACAAGCG ATTGCTCAGA AAGAAGATAA ATTGACGAAA
GGGAACATCA CGGCGAAAGC AGACGGACTT GTCGAGGTAA CAGGCGGAAC AGATGCGGTA
ATTGGCTCAG GCGTAACGAT CGGATTAAAT GAAGCAACGA AGACCAAACT AAGCTCAATC
GGGTCAGGGG CTATTGGTGA TACTACTACA ACCAATAGCG ATAAAACGGT TACCGGTCAG
ACGGTTTATA ATTATTTACA GAACTTCTCG CAAACACTTA CGCTCTCTAC AGATGGAACA
GCGGACAGCG GAAGCGTTAA TTTAAAAAAC GGCAAGTTGC ACGTTACAGG ATCTGATGGG
GTCGGCGTGG ATATTAGCGG TAGCAAAATC ACCGTAAAAC TTGACGACAC GACGAAAGGC
AAAATTAATA ATGCCTTGTC AGGAAGCGAT GCTGACGGTA AATATGCAAA AATTGATGGT
TCAAATATCA CAAAGAGTTC TTGGAGAACG AAGTTAGACG TATATACCAA ATCTGAAACA
AATAGTGCGG TAGAAAAAGC CAAAGAAACG GTTACCAACG GAACAGGAAT TAAAGTTGAC
TCATCAGATT CAACAAACGG CGGTAAACAG TTTACCGTTT CGTTGGATAG TGATACCCAA
ACAAAATTAG GCAATATTGG CACTGGAGAG GTAACAGCAA ACAATGATAA AACTGTAACC
GGCGGAAAAG TACATACGGC TATTGAAACT GCGAAAACAA CGTTAAATGC GGCGATAAAT
GGCAAAGCGA CTAAGAGCTT GGATAATATT GATAGTTCAG GTAAAACCGT TATCAAAAAT
TTAACAAAAG TTAAACACGG TAAAAATGTA ACGGTAGATA GCTCTATAGA GGGAGATGCT
CAAGTATTTA CGATAAATGC AGTAGATACT TCTGCAAAAG TAACCGCAGA GAGCGGTTCG
AAAATCCACG TAACTACTAC TGAGGGTACT GAAGGCAATA CAACCGTAAA AGAATTTAAG
CTGGATTTAA CAGAAGAGAC AAAAACGAAA ATTGACAATG CTTTAAGTGC ATCAGATGCT
GCCACAAATT ATGCCAAAGT GGACGCTTCG AATATCACAG ACGGTAGTGC TTGGAGAACA
AAATTAGATG TATATTCTAA AGCTGAAACA ACTACTGAAA TAAATAAAGC CAAAGAAACT
GTTGCAAGTG GAGATGGAAT TACGGTTACA TCCACTAACG GTGATGCGAA AACATTTACG
GTAGCATTAA GTCAAGAGAC TCGAACAAAA TTAAATAATA TCGGAACCGG AAAAGTAGAA
GCTAGTAATG CTCATACTGT AACAGGCGGT ACGGTACATA CGGCTATTGA GAAGGCAAAA
AACGCCTTAA AAGGCGAGCT TGCAAATACC GCCACATTTG GGTTACAAGG TAATGATAGT
CAAGATGTTG TTACCAAAAC ATTAAATAAT ACTCTAAAGA TTACAGGAAG CGAATCTGCT
AAGGCAGATA AGACAAATAT CTATGTATCT AAAAATGCGA GTAATGATGG TCTAAACATC
GAACTTGGAG AAAATTTAGC CGGTATTAAA AAGATTTCAA ATGGTGATAC AGAAATTACT
ATCGATACCG ATGGAGTTAA GATAAAAAGC GGTAAAGATG GTGCAGAATC TAAAATCACC
CTAAGTGCTG ATCATATACT ATCAGATAAG GATATATATG TTGGACCAAA AGATGATAAA
GGTTCTAACA AACTAGTTAA GCAATCAGAC GTAGATGGAC TCTCTATTAC TTTTGAAGAT
GAGAATAGCT CAACAGAGTC TCAATCAAAC GGAACACAAA ATGTTAAAGT TGGTAAGAAA
GTCATTTATG GTAAATCAAG TGAAATTACA CCAACAGTTT CTGCTTCGGG TGATGATGCT
AAAGTAACAT TTAGTATTAA TGATAAATCT ATATCTGAAA GCAAGTTAGA GGATACGCTA
AAAGGCAAAA TAGATAAAAT AGCAACCAAT GAAAGCGGTA TTAGTAATTT ATCTGCTCAT
AAATTAACAT TTAAAGATGG CGGAACAAGC TCATTTGAAA GAGCAAATAA TACGAGCAAG
AATGTCGTAT TTAAAGGTAC AGGTAACTTA AATGTTAAAT TAGCTACTGA AACAAATAAT
GATACAGGAA CATTTACGAT AAGTGTTAAT GAAACAGATT CTATTAACGA AAAAGCGGGA
ACAGATAAGC CAGATACAGA ACCAAAAGGT AAACACGAAA ATAAATTAAC AACAGAAAAA
GCCGTTGTTG ATTATGTGCA AGGGAAAATA TCTGCTATAC AAACTAATTT AGACAACACA
ATAGGCACTG GTACTATTGA TGATGCGGAT ACCAATCAAA AAACCGTTAC AGGGCAGACT
GTTAAGACTT ATGTAGATGG TAAGACCTTT AAATTAGCCG GAAATGATAG TGATGCTGTA
TCAAGTCAAC TTGATGGAAC GATTCACATT AAGGGATCAA ACAAAACATA TCAAGCAAGT
GGAGATACAG AAAAATCAAA TATTTATGTT TCTAAAGTTA CTGAAACTGC TCAAGGTGCT
GATCCTAAGG AATATTTACA AATTCAATTA GCTGATGCTT TAACCGGTAT TAAAACTATA
GAATTAAAAG ATGACAAAGG AAGTCGTTCA AAAACTATAG CGATAAATAA TAAAGGGGAC
TTAGTTGTTA GAAGAGAAAT CAATGAAAAT AATCAACCGA AAAGCGTTGA AAACGAAATC
ATTACTTCGG AAAATATAGG TGATCAGACT ATTACTTATA AATCTAATGG TAGTAATAAA
AATGGAAAAA CAGACAAAGA ATTTACAGTT AAATTATCAG AAGGATTAGA TTTTACTAAA
ACTGATAATA TAGCCGTAGA AGTAAGTGAT AATGGAGTTG TTAAACATAC ATTAGTTAAT
AACCTTACTC AAATAGATAG TATTAGTAGT GGCAAAGATG CCAAAGGAGC AAAAATCTCT
TTTACATCTG ATACTGATAC GACCGCTGAT CCTGCTAATA ATGATGTTAA GAATAAAATA
GAATTTACAA TAGGTACTCA ATCAACTTCC ACACAAGGTC AAGATAGTGC AACAGTAACT
TCTACAACAT TTAGCTTTAG TGAGAAAGGG TTAGATTTAG GTAGCAAGCA AATCAAAAAT
GTGGCAAGTG GCATAAGTCA AAATGCGTCA AGCGATGTCG GAGATGCTAC AAGTAATGTT
AGTGATGTAT TAACTGGAAC TTCTATAGAG AATATAAAAA ACAATGCAGT AAATGTAAAA
GATTTATCCG ATGTCGCTAA GGCGATTATT GATAAAGGGC TGACATTTAA AGGAACTGAG
TTAAAAGATG ATAAAACTCA ACAACAAGCT ACTATAAGTT TAGGTAGTAC CATTGAAATA
GATAGCTCTG AAAGTAAAAA TAGCAAGGCA TCAGATGGTA AAGATGCTCC TAAAGAAAAA
GATATAAAAG TATCAGTTGG TGCTAACAAA ATAACATTAG CATTAAATAA ATCTGAAGCA
TTATCTTTAG AGGACGAGAG AGTCGTTACA TCAAAAGCAG TAAAAACAGC TCTTGATGAT
ATGCAAAGTG AAATTAATAA TAAAATTTCA AAAACAGATG CGAATAATCC ATTTGAAACG
ACTTACAAAG ATAAAGATGG TAAAGAGCTA GAAAAACACG GAGATAACTT CTATAAGAAA
GACGAAATAC CGGACGGTGC AAAATATAAT GCTGATAAAG GTAAATTTAC TGATAAAGAT
GGTAGAGAAT TACCGCAACA ACCTACAGCG ATAGCGAAAA ATGAGGTTAA AGAGGAAATT
AACCTAAAAG GTGATACACC TAAGAAAATC GCCAATGTTG ATTCCGGTTT AGGGCTGGAA
AAATATCAAG AGCCTAATAC CGATAATTTA GACGATGAAG CTAAAAATAA AGAGATAGCA
AAAGCTAAGG AAAATCATCA AAAGGCGAAA AAAGACGCTA TTGATAAGTT GTTAGGAAAT
AACGCTGATA AAGCTAATAA CATCAAAGAT AGTGATCCAA TGTTAAATAA TGTTGCTACT
ATAAGAGATC TGCAAGCATT AGGACAAGCA GGACTGGATT TTGCCGGCAA TGATGCTACT
TCAGTACATA GAAATCTTGG GCAAAAACTG GTAATTAAAG GTGATCAAGA TGCACCTGCT
GGATCTTTTG AATCTGCAAA AGATAATATT AATGTAGCTG TGGAAGGAGA AGCATTAGTA
GTACAACTAT CTAAAAACCT AAAAAATCTT ACCTCAGCAG AGTTTACCTC TGAAGAAACA
AAAGATGGAA CCAAACAAAC ATTTAAGACA ACTATTAACG GCAAATCTAT TGCCTTAGAA
GGCAAAGATG GCACTACCAC AAATATGAGC AGTGATAAGA TTACGTTGAA AGATAAAGAC
GGTAACACTG CTGACATGAC TGGTAACGAA ATTGCCCTAA AAGGCAAAGA TGGTAAACCA
ACTATTGCTA AATTAAACCA AGACGGCTTA ACCGTGGGCG ATAAAGACGC AACTAATGGC
GACAAAACCC ATGCCGTGTA TGGCAAAGAC GGCTTAACTG TCCATGGCAA AGACGGTAAA
AGTGCGGTGA GCTTGAAAAT GAAAATGTCG GAGAAAAACG GTAAAAGCGT TCCAACCCTT
GAATTTGCCA AAGGAGCTGA CGATAAAGGC ACAGGCGTGA TTACAGGCTT GGCGGATTTA
ACGAAAGATT CCGATGGCAC AAGTGTGGCG AATAAAAACT ATGTGGATAC ACAATTGACT
GCTGCGAAAG CCGAAGCGGA TAAAACTGCT ATGAAGTATG ATGATGAAAA CAAAACATCC
ATCACTTTAG GCGGTAAACC GCAAGCAGGC GGTACTGCAC CTTCACCTGT TATGATTGAT
AACCTCAAAT CCGGTCTGGG TATTGATGAT ATTCAAAGTG GTGGCGTGGC TTCAATCAAA
CAAGGCAAAC AAGGTGAGTT GGTAAAACAA CTCGTAGCAG GTGAGCTTGA TACCACGAAA
GACGCTAGCG GTAAAGCAAA AGACAATCTG CATAAAGCCG TAAATTTAGC GGACTTAAAA
GCCGTTGCAC AAGCCGGATT AAACTTTGCC GGCAACGATG GACAGGATAT CCACAAAAAC
CTCAGCGAAA AACTGGAGAT TGTGGGGCAA GGCTTGGATA AAGACAAAGT TACTGCATTC
AAAGGCACAA ACGGCAATAT TGCGGTGAAA ACGGATAATG GTAAATTATC CATCTCCCTC
AATGAAGCCT TGACCGGCTT GAAGTCTGCC GAGTTTATTT CTGAAGAAAC AAATTCTGAT
GGAACTACAC AAAAAACTAA AACAACTATT AATGGAAAAG GAACAACAAT AGTTGAATTA
GATAAGGAGG GTAATACCAA AGAAAATGGT AAATCAGCGT CTTACACATT GGATAAAGTC
GAGTTAAAAG ACGGCAATAA ATCCAATACC TCAACGGCGG AAGGCAATGC GATTGTCAAT
GGGGATAAAA TCCATACGTC AACGGCGGAA TCCGATTTGC TTTTAGATAA AGCAACCGGA
GACAGCAATA CCAGAACCGC AACAGCGAAT GTGATTGCGG ATAAGGCAGG CAATCAATCT
GTGCTTGATA AAGACGGCTT AACCGTGGGC GATAAAGACG CAACTAATGG CGACAAAACC
CATGCTGTTT ATGGTAAGGA CGGCTTTACC GTGAAAGGCA AAGACGGCTC AACTGAAGCA
ATCAGCCTGA AAGTTACCGA AAAAGACGGT AAAGAAACAG CGACCCTTGC TTTTGGTAAA
GGTACTGATG GCAAAGGCAC AGGCGTGATT ACAGGCTTGG CGGATTTAAC GAAAGATTCC
GATGGCACAA GTGTGGCGAA TAAAAACTAT GTGGACGAAA AAGTCGAGAG CATTAACGAC
AAGTTGAAAA ACAACTTAGG GTTAAAAGAA ATCGACAATC CGGACTATGT TAAAGCGGAA
GAAGACTTAG CGAAAGCGAA AGAGGCATTG GAAAAAGAGA ACAATCTTGC GAAAAAAGCC
GAATTGCAAA AAGCGGTGAC CGATGCCGAA GCGAAAGTTA ACGAGTTAAG CAAAAACAAA
AAACTGATAG TGACACCGGA CGGACGGGAC GGCAAATCCT ACTTGGAAGC AGGAGCAGCA
GCAACCCACG GACCGACAGA CAAAGACGGG CTTAACGGTA AAAATGCTAC TGAAAAAGTC
AACGCTTTGC GTAACGGCGA AGCGGGAGCG GTGGTGTTTA CCGATAAAGA CGGTAATCGT
CTGGTCAAAG CTAATGATGG TGAGTACTAT AAAGCAGATA ATGTTAATGC TGATGGAACG
GTTAAAATTG CAACTGATGG AAAAGATAAA CCAAAACCAG TTGAAAAACC ACAACTTTCG
TTAGTAAACC ACGAGGGCAA TACAACGACG CCAACTGTCT TAGGCAACGT GGCGAGCGGT
TTGGATATTA ATGCCGAAAA AGTGAAACAG GCAGAGAAAG AGGTGAAAGC CAAACGCTCT
GAAGCTGAAC GTAAAGCGAC CATGTTAAAA GCGAAAGCTG CTCTTGTAGA GCAGAAAGAG
GCAGAAATTA CCGCACTTAA ACAGGAAATT GAAAACTTGT CAGGTGATGA AAAAACGCAA
AAAGAAGCAG AGTTGAAGGC AATTGAAGCC GAGTTATCTC AATTCAATGA CGAATTGGCA
ACGGCAACAA AAGACTTGAA AACCGCCAAT GATGCGTTGA AAACCGCCAA TGATGAGCTG
ACAAATTTCA CAGAGAATAG AATTGGCAAC TTAGTAAAAG GTGAAAATAT TAACCCTACA
AACGGGGCTA ATATTGGTGA TTTACAGGCA GTAGCAAGAG CGGGCTTAAA TTTTGAGGGC
AATGACGGCG TGCCTGTCCA TAAAAATTTA GGCGAGAAAT TGACCATCAA AGGCGAGGGA
ACATTTAACA GCGACAGCAC CGCCGCCGGC AATATCAAAG TGGAAATGGC ACAAGATGGC
AAAGGCTTAG AAGTTAAATT GTCTGACCAG TTGAAAAACA TGACTTCGTT TGAAACTCGT
GAAGTGGACG GTAAGAAATC CCGCTTGGAC AGCAACGGTT TACAAGCGAG CAGTCCAGAC
AGCGAAACTT TTGTAAATGC ACAGGGTACC CGTATTACGG GCAAAGGAAA TCATGCCGGT
CAATCGGCAA GTTATACTCT CGATGGCATT AAGCTACAAG GCAAAGCCGG TGAACCGTCA
TTGATGGCAA CCCATGCAGG GTTGATGGTA TCAGGCAATA ACGGTAATAT CGTCATCAAT
GGTAACCGTG GGGAAATTTT AATTCCGGAT GTCAAACCTG ATGCAAGTGG ATTTGTTGCA
ATCAATAAAA ACTATGTCGA TGCGAGAAAT CACGAGTTAC GTACACAAAT ACATCATGCC
GATCGCCGAT TGCGTGCCGG TATTGCTGGA GCGAATGCGG CAGCAGCATT GGCATCAGTC
TCTATGCCTG GCAAATCCAT GGTTGCGATT GCGGCGGCAG GGCATGATGG CGAAAGTGCG
TTGGCGATTG GTTATTCTCG CATCAGCGAT AACGGTAAGG TTATGCTGAA ACTGCAAGGT
AATAGCAATT CCCAAGGCAA AGTATCCGGT GCCGTGTCGG TAGGTTATCA GTGGTAA
 
Protein sequence
MNKIFKTKYD VTTGQTKVVS ELANNRQVAS RVEGSSVAEK CGVFLGNFLG AFKVLPLALV 
MSGVFSSIGY GATVFLDNNK TSSIGSDNKS TAVWSDEGVG KFNNTLKDKL KKSESILLSH
IDNTAGADTS LTNKDFFRTV VIGSRTVAGG NDATAIGYRA IVGKNKKETN TKDSHQGTAV
GYRAFANGTE SVSMGNDTIA WGDSSIAIGS DNANGKEAKN LSKEIYKLFH ALRDDFNYTS
EYAAVNHKTL RQGKEKSHDM LDNDGNLMAK DGFYIGYNGG YYFREPDESG QLKNENLKYM
LKDGILYEKI DQQGFSKIEN KKRHDEIMNS LKNNEDYKRY EQYMAILDAD YQNYKVTPNS
KKTHTWARGN NSIAIGGRSI AFGNNSTALG TASVAARDYA TAIGSDTIAY GKQSLAVGNR
AIVYSDGSVG IGHKVQALNA GAMVYGRESF AGGTGSLAIG DRTFANVTMG DDFKTTVEDL
YLKGDTDTIQ NSHKEKFYPK TVLQDGTEEN KAEQSRDSEG RYNQGAVALG SYATALGDNT
LALGRFAYAK EDSAVALGRF AFARNKESLA LAPFSRAMGE RSVALGHQSL VEAKDSMAIG
VGARVSGEIP EELKAATNKN IKVPNEDSMA IGNKAEAYFP KSIALGVESK TDYSRKEMAQ
DAWAPKHAIS LPSSQKIGYL SVGGKNAERR IVNVAPGASD TDAVNVSQLR ALEEAILYGS
SLEEEEAYNG VKYLSIKGID EYKKIVRKEQ DYKNYLKVKK EYLKLKARRD VNNEEINLTN
IESKVTAYES KYGDFANTSN KLTEEKNKEY KILLTEDVKN NPEKEKERRK NYLKIKYEEI
EDAYEKDIAQ DKIDNLLTEE QKTKFKNSNF LSDGAKASGS MAMGVGALTN SVESIVIGRN
AMIENKAANN TVLLGNNTSS DTANAVALGN FSVADRKPEP ASKLDATGRS NAYISDDLAN
LIDEHHAYAA VSVGRYGGDL KDLTNNVQTE KDYDKWLKDN KHRTKEPAYE QERLAETAKI
KKFALRKITN VAPGTKDTDV VILAQLKEAM KGVGPATHYV SVKGTDQSDD SNYKNDGAKG
TDSVAIGVSA ATDKSATAGI AIGKGAKSEA ENAVAIGNGA SIDVPNSFVM GSNNIVNQSN
KETRGAVVVI GSGIKLDESK SSIAIGAVYL EGRDGEKDGT VIENAAWTAS IGNKNKIKNG
TDIIAVGNNI KIQGDLTAEE ESKYAERIRQ RPRQKTKIES EKTDTIKNFN TEVVAIGNGA
DAIRAKNSVL IGAKTKAESN ATQAVIIGYE ATAKENAVGA VVIGQGASVE TLAGDSIALG
QGSKATKKET APSKATSSKN VNFTWTAGIG NNTSVVSLGD TSKERQIKHV AAGEVTKAST
DAINGSQLYA VADEFSKLAV DVLGAEADTT SGFKKSTFNQ LIGANGSPAA ATPATAKTFK
QAIDDNIAKI NEGFKFGDGN TDGTHYLGST LNIKAGNIDS GAFTSDNIKT HYETNNKNIL
IGIKKAPSFE KVTVTQDVTD NSDKMVLTTK NYVDTKFNSA GSTLKFTADN GNTQTLAKNG
TLQVKGTSGE ITTTASNNDT VTIALDQTLK GKINTAATTA ATAAQKAEAA TTKADANAQA
IATVTDTANQ NKTTISAVKA TADQNKTDIA SAKSDIAENK QSIATVTETA NQNKGDISAL
KSQTETNTQA IAQKEDKLTK GNITAKADGL VEVTGGTDAV IGSGVTIGLN EATKTKLSSI
GSGAIGDTTT TNSDKTVTGQ TVYNYLQNFS QTLTLSTDGT ADSGSVNLKN GKLHVTGSDG
VGVDISGSKI TVKLDDTTKG KINNALSGSD ADGKYAKIDG SNITKSSWRT KLDVYTKSET
NSAVEKAKET VTNGTGIKVD SSDSTNGGKQ FTVSLDSDTQ TKLGNIGTGE VTANNDKTVT
GGKVHTAIET AKTTLNAAIN GKATKSLDNI DSSGKTVIKN LTKVKHGKNV TVDSSIEGDA
QVFTINAVDT SAKVTAESGS KIHVTTTEGT EGNTTVKEFK LDLTEETKTK IDNALSASDA
ATNYAKVDAS NITDGSAWRT KLDVYSKAET TTEINKAKET VASGDGITVT STNGDAKTFT
VALSQETRTK LNNIGTGKVE ASNAHTVTGG TVHTAIEKAK NALKGELANT ATFGLQGNDS
QDVVTKTLNN TLKITGSESA KADKTNIYVS KNASNDGLNI ELGENLAGIK KISNGDTEIT
IDTDGVKIKS GKDGAESKIT LSADHILSDK DIYVGPKDDK GSNKLVKQSD VDGLSITFED
ENSSTESQSN GTQNVKVGKK VIYGKSSEIT PTVSASGDDA KVTFSINDKS ISESKLEDTL
KGKIDKIATN ESGISNLSAH KLTFKDGGTS SFERANNTSK NVVFKGTGNL NVKLATETNN
DTGTFTISVN ETDSINEKAG TDKPDTEPKG KHENKLTTEK AVVDYVQGKI SAIQTNLDNT
IGTGTIDDAD TNQKTVTGQT VKTYVDGKTF KLAGNDSDAV SSQLDGTIHI KGSNKTYQAS
GDTEKSNIYV SKVTETAQGA DPKEYLQIQL ADALTGIKTI ELKDDKGSRS KTIAINNKGD
LVVRREINEN NQPKSVENEI ITSENIGDQT ITYKSNGSNK NGKTDKEFTV KLSEGLDFTK
TDNIAVEVSD NGVVKHTLVN NLTQIDSISS GKDAKGAKIS FTSDTDTTAD PANNDVKNKI
EFTIGTQSTS TQGQDSATVT STTFSFSEKG LDLGSKQIKN VASGISQNAS SDVGDATSNV
SDVLTGTSIE NIKNNAVNVK DLSDVAKAII DKGLTFKGTE LKDDKTQQQA TISLGSTIEI
DSSESKNSKA SDGKDAPKEK DIKVSVGANK ITLALNKSEA LSLEDERVVT SKAVKTALDD
MQSEINNKIS KTDANNPFET TYKDKDGKEL EKHGDNFYKK DEIPDGAKYN ADKGKFTDKD
GRELPQQPTA IAKNEVKEEI NLKGDTPKKI ANVDSGLGLE KYQEPNTDNL DDEAKNKEIA
KAKENHQKAK KDAIDKLLGN NADKANNIKD SDPMLNNVAT IRDLQALGQA GLDFAGNDAT
SVHRNLGQKL VIKGDQDAPA GSFESAKDNI NVAVEGEALV VQLSKNLKNL TSAEFTSEET
KDGTKQTFKT TINGKSIALE GKDGTTTNMS SDKITLKDKD GNTADMTGNE IALKGKDGKP
TIAKLNQDGL TVGDKDATNG DKTHAVYGKD GLTVHGKDGK SAVSLKMKMS EKNGKSVPTL
EFAKGADDKG TGVITGLADL TKDSDGTSVA NKNYVDTQLT AAKAEADKTA MKYDDENKTS
ITLGGKPQAG GTAPSPVMID NLKSGLGIDD IQSGGVASIK QGKQGELVKQ LVAGELDTTK
DASGKAKDNL HKAVNLADLK AVAQAGLNFA GNDGQDIHKN LSEKLEIVGQ GLDKDKVTAF
KGTNGNIAVK TDNGKLSISL NEALTGLKSA EFISEETNSD GTTQKTKTTI NGKGTTIVEL
DKEGNTKENG KSASYTLDKV ELKDGNKSNT STAEGNAIVN GDKIHTSTAE SDLLLDKATG
DSNTRTATAN VIADKAGNQS VLDKDGLTVG DKDATNGDKT HAVYGKDGFT VKGKDGSTEA
ISLKVTEKDG KETATLAFGK GTDGKGTGVI TGLADLTKDS DGTSVANKNY VDEKVESIND
KLKNNLGLKE IDNPDYVKAE EDLAKAKEAL EKENNLAKKA ELQKAVTDAE AKVNELSKNK
KLIVTPDGRD GKSYLEAGAA ATHGPTDKDG LNGKNATEKV NALRNGEAGA VVFTDKDGNR
LVKANDGEYY KADNVNADGT VKIATDGKDK PKPVEKPQLS LVNHEGNTTT PTVLGNVASG
LDINAEKVKQ AEKEVKAKRS EAERKATMLK AKAALVEQKE AEITALKQEI ENLSGDEKTQ
KEAELKAIEA ELSQFNDELA TATKDLKTAN DALKTANDEL TNFTENRIGN LVKGENINPT
NGANIGDLQA VARAGLNFEG NDGVPVHKNL GEKLTIKGEG TFNSDSTAAG NIKVEMAQDG
KGLEVKLSDQ LKNMTSFETR EVDGKKSRLD SNGLQASSPD SETFVNAQGT RITGKGNHAG
QSASYTLDGI KLQGKAGEPS LMATHAGLMV SGNNGNIVIN GNRGEILIPD VKPDASGFVA
INKNYVDARN HELRTQIHHA DRRLRAGIAG ANAAAALASV SMPGKSMVAI AAAGHDGESA
LAIGYSRISD NGKVMLKLQG NSNSQGKVSG AVSVGYQW