Gene HS_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1085 
Symbol 
ID4240585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1203500 
End bp1214713 
Gene Length11214 bp 
Protein Length3737 aa 
Translation table11 
GC content38% 
IMG OID638104647 
Productlarge adhesin 
Protein accessionYP_719297 
Protein GI113461228 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TCTTCAAAAC CAAATACGAT GTAACAACAG GTCAAACGAA AGTTGTGTCT 
GAATTAGCGA ATAACCGTCA GGTGGCGAGC CGTGTTGATG GGTCGTCGGT CGCCGAGAAG
TGCGGTGGGT TTTTCGGCGG TATGTTAGGG GCGTTTAAGG TCCTGCCGTT GGCGTTGGTG
ATGAGTGGGC TGTTATCGAG TGCGGCGTAT GGGTATGATG TTTGGGTTCA TAGTAGCAAT
GTTACAGATG GACAAACGAC AAGCAACCTA AACCGTAAAA CGCAAATTAT GTATGGAGCA
AATAAGGCTT TTGGAAAGGT TGATCCATAT ACACCATCTA CTCTTCCCGA TGAAGCGGTT
ATCCTCTCGG ATTATGCTAA TAAAACAGGA AAAAATGCTT CGTATAGCAA TAAAGACTTT
AATCAAACTG TTGTGATAGG GTCTCGAGCA GTCGTGGGCG GTGATAAAGG AACCGCTATC
GGTTTTGGAG CCGCTACCGG AGAAAATACA GCGACTTCAG CTAACAATCC AACCGTTACG
TATGAAGAAG CTAAATCTAT TCTGCTCAGA GGCGAAGATG GTTCGTATAA AGACTTACGA
GAACAGTATT TTGAATTTGC CACTAATCCT AAGTGGAAAC CTACAACAGA AGAAAAAAAA
CGAAACATTA TTAACCAAGT GATAACAATG GAGAAAGAGA AACGCAAAAA AGATCCTAAA
ATAGCTTTAA ATAAAGAAGG TACGGCTGTC GGTTATCGAT CTTTTGCACG AGGCGAAGAA
GCGACCGCTT TGGGTAACGA TGTGGTGGCA TGGGGAGATT CCTCTATTTC TATCGGAAGC
GATAATGCCG CAGGGCATGA TGCCAAACCG CTTTCTAAAG AGATGTTTAG ACTTTTTTAT
AATGTAAGAG ATGATTTTAA CTATACCAAA GAATATGCTG CCGTTGATTT TGAGATTATG
AGAGATGATA GCGGTAAGCC TATGCTTAAT GAGCAAGGAC AGTTAGTCGC TGAAAATAAT
CCTAGCTACC CAATATTCTA CAACGGGAAA TACTATTTTA GAAAAAATAA TCAAAATAAC
GATGAAGGAT ATTACGTCAT GGTAGGTGAA GATTTTTATT TCTCCAGACC AAATGCACAA
TATCTAGAAA AAATAGGTTC TGACACCCCG GGGTATGCCG ACAAAATAAA AGAGATAAAG
TCCCTAAACG ATTATAAAAG ATATGAACTC TATCTAAAAA AAGAAGGAGA GGCTTATCAA
AAATATTTAG CAGATCATAG AAATACCAAA ACACATACTT GGGCAAGAGG AAATAACGCC
ATTGCTATTG GGGCACGTTC GATTGCCTAT GGAGATAATT CAACCGCTTT AGGGACATTT
GCTGTAGCGG CTAAGGATTA TTCAACCGCT ATCGGCTCAA ACACCGTCGC CTTTGGTAAA
GGTTCATTGG TGATGGGTAA TAACAGCTAT GTCTATGCAG ATGGATCTGT AGGTATCGGG
AATAAAGTAC AAGCCATCGG TGCCGGATCT ATGGTATATG GTAAAGACTC TTTTGCAGGG
GGCTTAGGAT CGTTAGCGAT TGGTGATCAC ACCTTTGCCA ATGTGAAGAT GGGTAAGGTA
TTTAATGGTG ACGGGCTTGA TATTAATGGA TATAACTTAT TAGCAAAAGA TAAAAAATAC
CCTGAATACG CTAATGACAC ACAAGGATTA AAAAATAGCA AAATCACTAT TGAAGATTTA
TATCGTATCG GTGATACAGC AGCGATACAA GATATAATGA AAGAGCATAT TAATGCCCAC
ACAACCGTCC AAGAAGGAAC GGGCGAAGAA AAAGCAGAGC AATCTAAAGA TAGTCAGGGA
AGACATAATC AAGGTGCTAT TGCCATAGGT TCTTATTCTG TTGCATTGGG GGATAATGCA
ATCTCATTAG GTAGATATGC CTATGCGAAA GAAGATAGTA CTGTTGCACT CGGTAGATTT
GCGTTTGCAC AAAAAGAATC TTCATTTTCA ATAGGAAATT TCACAAGAGC ATTAGGTAAA
CAATCAGTTG CTTTTGGTAC TCATTCATTA GTAGAAGCTG ATGATTCTAT GGCATTAGGT
ATAAAAGCAA AAGTTTTAGC TGAAATGCCA ACCGCACTTC GAAGTGAAAA AAATAAGAAA
AATAGCAAAC CAGATTTTAC CATTAAAAAT GCAATGGCAA TTGGTAATGG TGCGGAAGCC
TCTTTTTCAG ATTCCATCGC ATTGGGCGTC AATGCAAAAA CTGATTATAC TCAAACTGAA
ATGGCACAAG GCGGATGGGC ACCTAAAAAT GCCATATCAA TACCTTCATC AGAAAGAATC
GGCTATTTGT CAGTGGGTGG AAAGAATGCG GAAAGACGTA TCGTAAACGT TGCACCGGGT
GCGAGCGATA CTGATGCGGT CAACGTATCT CAATTAAGAG CCTTAGAAGA AGCTATCCTG
TATGGCAATA CGTTAGAAGA AGATAATATT AATTCCGGCG TGAAATATGT CTCTGTCAAA
GGCTTAGATG ACTTAAAGTT ATTAGTCACA AAAGAACAGG ATTACAAAAA CTATATTAAA
ATTAAAAGAG AATATTTAAA ACTTAAAGCG AGAAAAGAGA TTGGTGAGGA AACAATTAAT
ACAGATGAAT TAGATGCAAA ATTGAAAAAA TATGAAACTA GATATGCTGA TTTTTCTGCT
AAATCATCAA AGTTACAAGA GGAATATGAT AACCCTGCCT ATAGTTTATT ATTCCCTAGT
AGCGATACTG AAGAAGATAA AAAAAGGAAA AGAAAAGACC GTTTAAAATC CAAAATAGAT
GAAATAGAGA AAGCATATGC TGATGATGAA AAAGTTAGTG TATTAACTAT GGCAGAAAAA
GAAAAAATAA ACAATTCCAA TTTTAATAGC GATAGAGCAA AAAAATCAGG TTCGATGGCA
ATTGGTGTAG GAGCTTTAGC AAATTCTGAC AACTCTATTG TTATAGGGCA AAATGCTAAG
ATTGAAAGTG ATAAAGCCCA TAATACGGTA TTATTGGGTA ATAATACATC ATCAAATTTT
GCTAATGCTG TTGCACTTGG TAACTATTCT GTTGCGGATA GACCTGCTGA AAAATTTACA
CAAGCCGAAC CTGAAGCTTA TATAAATGAT GAATTAGCAA AAGCTATAGG TGGTGAAACT
TATGCTGCTG TTTCTGTAGG TAAAAACGGT GGAAATATTA GTGAAATTCA AGAAATTGTC
AAAGCAAAAA AAGAATTTGA TAAGTGGGTT CAAGCAAATA AAAGCAGCCC TGATCAAGAA
AAATATACAA AAGAAAAAGC TGAAAAAGAA GCAACTTTGA AAAAACTTGC CTTAAGAAAA
ATTACCAACG TCGCACCAGG TACAAAGGAC ACAGACGTTG TTATCCTTGC TCAATTAAAA
GAAGCGATGA AAGGCGTAAA AAATGTAGGA ACTTTGCATT ATTTGTCGGT GAATAATCCT
ACTTCAAAAG GTTCAAATTA TGACAATGAT GGTGCCAAAG GAAGCAATTC TGTAGCCATA
GGTGTAGATG CTGGTACTGA TAAAAATGCT AGTGCTGGTA TAGCGATAGG AAAAGGGGCA
AGTTCAAAAG CTCAAAACGC TATCGTCATA GGGACAAATG TCAGTGTTGA TGTACCTAAT
TCTTTTGTGT TGGGATCTGA TAATAAGGTT GAAACTACTG CTGATAAAAG GAGAACAAAT
GATGCAGTTG TAGTAATGGG TAGTGGTACA ACAATTAAAA ACTCATGGGC AACTACAGCT
ATAGGTGCAG TAAAAGCTTC TAATGGTAAA CATATTAAAA ATGGAACCCC TATAACTGGA
GCATATATAG AAAATGCACC ATGGGCAACT GTTATTGGTA ATAAAAGTAA GGTGTATAAC
GGAACTGATA TCATTGCATT AGCGAATAAT ATTGAAGTAA ATGTAAATAA AGCAGATTTT
GATACCGATA AAAATGCTAA TGATAATTTA GTGATAATAG GAAATAAAGC AAAAGCAGCC
CGTGCTAAAG ATAGTGTCGT TATAGGTTCT GGAGCAAAAG CATTAACTGG AGATGAAAAA
AATCATGCTT ATGAAAAAGT TGAAAAGGCA GTAAGTATAG GCCAAGCAGC GGTTGTAAAA
GCAACTGGAG CGGTTGCACT TGGACAAGGT GCAACAGTAG AAACCGCCGC CGGCAACTCC
ATTGCGTTGG GACAAGGGTC AGAAGCGACT AAAAAAGAAA CCGCACTTAA TGAAGCTAAA
ATAGAGGCTA TTAATGTTAA ATTTAATTTC AAAAGTGGAG TGAGTGCTAA TGGTCAAGAA
AATAATAAAA AATCCGTGTT AAGCATAGGT AAGAAAAATA ATGAGCGTAT CATTAAACAC
GTCGCTGCTG GTGCGGTAAC GTCAGATTCA ACCGACGCTA TCAACGGAAG CCAACTTTAC
GCCGTCGCCG ATGAATTTTC AAAATTAGCG GTGAATGTAT TGGGTGCGGA AGTTGAAGAC
GCTAATAAAA CGGGATTTAA AAAATCGGAA TTTACAGCAT TATTAAGCTC ACCAAGTACA
AATCCTCCTC AACCACAAGG ACAAACACAA AAAACAGCAA TGACATTTAA AGATGCTATC
GAGGCTAACA TCGCTAAGCT TAATAGTGGC TTTGTTTTTG GCAGTGGAGA TGAAAGTGGT
GAACAAGGAA CACATTATCT AGGCGATAAA TTAATCATTA AGGCTGGGAA TATAACAGTT
AAAGATGATA CTTTAAACAA AGAAGATAAA TTTTTAAGTG ATAATATTAG AACTCAGTAT
GAAAAAAATA ATAAAAATAT CTTAATAGGT ATAAAAGAAA CACCATCTTT CAAAAATGTA
TTAATTACAG GAGAAATTCC TGAGGATGCG TCAGATAGTG CTAAGAAAAA TACTTATGAT
AACTATGCTG TCAACAAAAA ATATTTAGAC AAAAGACTAG AAAATGTGGG TTCTAACTTT
ACGGTAAAAG GCGACACGCC TAAAGATGGA AAAAACACAA GTCTTGAAAT TAATAAGGAC
AATAATGTCT TAACTATTAA TGGGGATAGT AATATTACAA CCGCCGTTGA TAAGGACAGT
AAAAAATTAA CCGTATCGCT TAATAAGGCT TTAACAGGGA TTACCTCTAT TGGAAAAGAT
GATAAGGCTA AAATAGAATT TAGTAGTGCT GATTCTACTC ATCATATTAC ATTTACAGTA
GGCGAAGGTA AAAATAAGGC AACTGTTAAA TTAGATAAAG ATAGTCTTGA TTTAGGTAGC
AAGCAAATTA AAAATGTTGC AAGTGGAATA GGTTCAACTA CACCAAGCGA TGGCGGAGCT
ACCACAAGTA ATGTTAATAA TGTATTAACT GGAACTTCTA TAGAGAATAT AGAAAACAAT
GCAGTAAATG TGAAAGATTT ATCCGATATC GCTAAGGCAA TTAGTGATAA AGGAATACAA
CTTAAAGGAA ATGGAACTTC TGCTGATACA AAAACATTGA AGTTAGGAAG TGTGTTGACT
GTTGATAGTT CACAAAGTAA AAAAACAAAT GCTAATGAAA AAGATATAAC TACAAAATTA
TCTTCTGACA GTAATGGAGC GTTAACTTTA ACATTAGAGT TAAATAAAGC GACTTCAATT
GATGGAAACG ATGAAAGAAT AGTTACATCA AAAGCGGTAG CAGATAAATT AAAAGATTTC
GCTACTACAG AAACATTAGA AGAAGACTTT TTAAGAATCA CTGGAGAAAA TATTGGCGAT
AAACAAAAAG AATTTGGATC TAATGTGGGA ATTAGTGAAA TAAAACTTGA TGATGAAGAA
AAAACAGAAT TAGTACAGGC TCAAGCACTA ATAGATTATC TAAAAGGCAG TGGAGATAAA
TCTGTAAAAG TCTCTGATTC TTCAAAAACA GTTGCAGAGG GAGAAGGCTC TATCTCTATA
GGATACGATG CAAAATCACA AAATGAAGGC TCAATAGCTT TAGGATATGG AGCCAATGCA
TTAAATTCAG GGTCTATATC AATAGGACAA GGCTCAAATG TATTAGGAGT AAATTCAATT
GGTTTAGGTA AAGATAATAA AGTTAATGGT AATTTCTCAT TTGTAGTAGG AGATGAAAAT
GAGATAGGTT CAAAATCAAC ATCAACTTAT GTAATGGGAT CAAATAATAA TATAAGTGGT
GAAAGAAATA TATCTATTGG TTCTAGTAAT AAAATAGAAA AAGATGAAAA TATATTATTA
GGATCTAATA ATGAAGTATC TGGAACTGGA AATATATTAT TAGGCTCACG TATAAATGTT
GGTGAAGATA TTAAAGATGC AATAGTATTA GGAAATCAAT CTGTTGCGGT ATCTAATGCG
TTATCAGTAG GAACACCTAG AAATAGAAGA CGCATAGTAT TTGTAGGCGA TCCAGAGCAA
GATTATGACG CCGTCAATAA AAAATATGTG GATAATTTGT TAGCAAACAA AGCCTCTGCA
TTTATGGTAA CTTCAGATGA GGGTAATACT GATAATATCA GTGGGACTTT ATCTATAAAA
GGTGCAGAAG CTGAAGGTAC TGCAAATAAC AAAAAACATC AAAATATCAC TACAAAAGCT
AGTAAAAGTG AGTTAATAAT TGCACTTAAT AAGGACTTAA AAAGTATTGA GTCGATTGGT
AAAGATGACA GCAACGCTTT AGTGTTTAAA AATGATACGA GTAACACAGA TGGATCAAGT
AGCACGAACA TAGCAGAGTT AAAAGTCGGT GGACATGCGT TAACTTTTAC CCCAGAGAAT
GGAACAACTG ACACGAATAA AAAAGTTAAA ATCTCAAATG TCGCTGGTGG ACAGATTGCA
GAGAATTCAA CTGATGCGAT TAATGGTAAA CAACTTCACG ATTTAGTCGG TAATTTAGGG
CTTTCTGTTG AAAGCGATGG TAACTTTACC GCACCATCAT TTACTAAAGT TAAGGGAGAT
GATACAACTT CTACCGTTTC AAAAGATAAA TACACGACCT ATAAAGATGC GATTAATGGG
CTAATTACAG CGGTCAATAA AGGTGTTACC TTTAAAGGCA ATGATGGCAC TAACAGTTCG
ACTAAATTAC AATTAGGCGG AACATTGACG ATTGATAGCT CGCCTGTTAG CAGTAGCGAT
ACGACTGGAG CAAATAGTGC GACTGTTGAA AAAGACATCA CGGTAACATT AGCACCATCT
AATGGTAGCG ATCCTCAAAG TGCGGGTACA CTTACGCTGA AATTAAATAA AGCTGATAAA
GTGGAGGAAA ACAACGGAAA AGTCGTTACC TCAAAAGCAG TCGCGGACGC GTTGAAAAAT
TATACCAAGA CTACTGACTT AGTTGATAAG TTAGGCACGG CGTATTTAAA AGTAGATGGC
TCGAATATTG ATGATAAACA AGCTGACTTT GGAAAAAATG TCGGTATTGG CAAAGTAAAC
CTTGAAAATG GTAAAAAAAG CACAACGGAA TTAGTTCAAG CACAAGCATT AATAGATTAT
CTAAAAGGTA CAGGAGATAA ATCCGTGAAA GTGTCGGATT CACCAACAAC GGAAGCGTTG
GGGGATGGTT CGGTATCTGT AGGACATGGT GCGATATCGA GAAATGAAGG ATCTATTGCC
ATGGGGTACG GAGCAAACGC ATTGAATACC GGTGCAGTAT CAATAGGTCA AGATTCCAAT
GTGCTTGGAA TGAGTTCGAT CGGTATCGGT AGAGAAAACA CGGTACGAGG CAACTTCTCA
TTTGCCGTGG GGGATAATAA CGTTATTGAG AAAGAACAAA ATTACGTAAT GGGATCGGAC
AACAAAATCA CCGGTAAACA GAATATTTCA ATCGGTTCAA GAAATGAAGT TGAAGGAAAC
GAAAACATCA TCTTAGGCTC GAATATCAAG GCAGATGATG AGATACATAA TGCCATTGTA
TTGGGGACAG GTTCGCTGGC GAAGTCCAAT GCATTGTCAG TCGGGTCAAT GCGACATAAG
CGGAAAATTG TGTTTGTTGC CGATCCGACG GACAACTATG ACGCCGCCAA TAAAAAATAT
GTGGACGAAA GAGGGCTTAA ATTCAAAGGA AATGACGCAG GTCAAACAGA ACAACTGGTC
AACTTAGACA AGTTGTTGAC GATCGACAGT TCGGAAGCCA AGAAAGATGA TAGAGGCATG
AAAGAAAAGG ATATTAAAAC GAAAATCGAA AAAGAAGGAG ACAATGCTAA ATTAATCTTA
ACTTTGAATA AAGCGGATAA TGTTTCCGAA CACGATGAAA GAGCGGTTAC CTCCAAAGCG
GTGGCAGCGA AATTAATGAA CTATGCAACA AAATTTGCAG TTGAGGGGAA TAAAGGCTCT
AAATTTACCG TTAATAACAG CTTGAAAATT AAAGGTAAGG TGGGTTCAGG TTCGGCTAAA
ACACATCAAA ATATCACTAC CACAACGAAG AAAGGTGCTT CTACGCTAGA AATTGCTCTC
AATTCTGATT TGAAAGGCAT TAAGTCTATT GGAGGTCGAG AAATCGGAGC TGGTAAGGGT
AAAAAAATCG CTTCTAGCAT TGAATTTAAC AAACAAGGAG CGAATGGTAC AAAACCAAAA
ACTAATGTTG TCATTGAATC CAATGGTGGA ACCTTTGTCT TTGACAGAAC CGGCTTGACG
TTAAATACCA AACAAATTAA AGGACTTGCG AGTGGGCTGG GCTTGCAAGC CGTAGCCGAT
GGACAAGGTA CCAATAATGA CGAGGCGAAG AAAAAAGCCG AAGAAGCCAA CCAAGCTATC
CTTGACAAGG TACTTGCCGG AAACCCTGAT ATGAATAAGA CAAACGCTGT CAATGTTCAA
GACTTGTCTG CGGTTGCCAA GGCTATTGTA GGGAAAGTCA CCGCACAACA CTCCGAAGCA
GAGAAAGTGG CGGTGAAATA TGATGATGAC ACAAAAACAT CCATCACTCT AGGCGGTAAA
GGCACCAATG GCACTAAGTC ATCCCCTGTT GCGATTGATA ACCTTAAATC CGGTTTGGGT
ATTGATGATA TTAAAGACAG TGGCATTGCT TCAGCTGCAC AAGGTAAACA ACGTGAGTTG
GTGAAAAAAC TCGTGTCAGG CGAGCTTGAT AAAGATTCAT CGCATAAAGC CGTGAATGTT
GCCGACTTAA AAGCTGTCGC ACAAGCGGGA TTGGATTTTG CCGGTAATGA CAGCGACAGC
ATTGTGCATA AAAACCTCGG CGAAAAACTG GAGATTGTGG GTCAAGGCTT AGATAAAGCT
AAAGCCGCTG ATTTCAAAGG CACCGACGGC AATATTGCGG TGAAAGCGGA TAGTGCTAAG
AGCAAGCTAG AAATTTCACT CAATGAGGAG TTAAAAGGCT TGAAGTCTGC CGAGTTTAAA
GACGACAAAG GTAATACTGT TAATATCAAC GGCAATGCAA TTACATTGAA AGGTAAAGAT
GGTAATCCGA CTATTGCGAC CTTGAACGAT AAAGGTTTAA CCGTGGGTGA TAAAGACGCA
ACTAATGGCG ACAAAACCCA TGCCGTGTAT GGCAAAGACG GCTTAACCGT GAAAAAAGGC
AAAGACGGCT CAACTGAAGC AATCAGCCTG AAAGTTACCA AAGATAAAGA CGGTAAAGAG
ACAGCGACCC TTGCTTTTGG TAAAGATGCT GATGGTAAAT ATTATATAGG TGCGATTACC
GGACTTGCTG ATTTAGATGA TAAGGCTGAC GGTTCAAGTG TCGCCAATAA AAACTATGTG
GATACGGAAG TTAAAAAACT TGATCAAAAA CTCTCATCAG CCAATTCAAA TAGACCATTT
GATTATTATC TAGACAATGA AAAAGTGGTT AAAGGTCAAG ATGGTAAATT CAAAAAATTG
AAAGATGGCA AGCCTGATCA AGCCCTTTCT GATGAAGAAG CGAAAAAAGT CGTTATTAAA
GCTGAACCGA CGACAGCTCC AATGGGTATC AGCAATGTGG CGAGTGGGTT GGGGTTAGAA
GCACAAACCG AAGAAGCAAA ACAAAAAGCG ACTGAGCTAA CAAAAGCCAT TGATAAGAAA
GTGAAAGCAA TAGACCAAAA AGCTACCGCA CTTTCAAATC AAGCACAAAC TGTAACAGAT
TTAACCTTAT CGGTCAGTGC ATTAGAAATG GCGATGAATG CAATGTCTGA GGGCGAAGAG
AGAAAACAAG CGGAAGAAAA GCTAAAAGAA AGCAAAGAAA AACTTGAAGC TGCTAAAAAA
GAGCTTGAAA CAGCGAAAAG CGAGCTAACA AAAGCCAAAG AAGGCTTAAC AGAAGCCAAT
AAAGAGTATG AGAAACATTA CGGAGGCTAT GAGAAAGTTG CAGAGTTAGT AAATCCGAAA
CAAGATAGCA AAATAGATTT AACCAACACT GCAACAATCG GTGATTTGCA AGCGGTGGCT
AAATCCGGCT TGAAGTTTAA AGGCAATGAT GACATGGAAA TTCGCACGCC GTTGAGCGGT
ACTTTGGCGA TTAAAGGTGA AGAAGATACG AATGGCAATA AATTTAACAG CGACCGCACT
GCCGAAGGCA ATATCAAAGT GGAAATGTCT CAAGATGGCA CAGGGTTAGA AGTGAAATTG
TCTGACCAGT TGAAAAACAT GACGTCCTTT GAAACCCATG AAATCAACGG TAGAAAATCT
ACGTTAAACA GCAACGGCTT GCAAATAGTC AGCCCAGCCA GTGACAGTAC TTTATCTGCT
CAAGGCACGC ATATTGTGGG CAAAGGGGCA AATGCTGGTA AGTCGGCGAG CTATACGCTG
GATGGCGTGA CCTTGCAGCA AGCGAATAAC AGAGCCACAC TATCACCTAA CGGCTTAACG
GTGGTCACAG GGTCAGGTGA TCAGATTCAA ATTGACGGCA AGGAAGGGGA AATCCGTGTG
CCTGATTTAA CACCGAACTC ATCGCCGAAT GCCGTGGTCA ACAAGGGATA TGTCGAGAGA
TTGCAGACAC ATACTGACCA AAAATTCAAC CATCTCGACA ATAAAATTGA GATATTCAAT
AAAGACTTGA GAGCTGGGGT TGCTGGAGCA CATGCTGCTG CCGCATTGCC TACAGTAACG
ATGCCGGGCA AATCGAGCCT TGCTTTATCT GCAGGTACCT ATAAAGGAAA CAATGCGGTG
GCATTAGGTT ATTCGCGTTT GAGCGATAAT GGTAAAATAA TGCTGAAATT ACATGGTAGC
AGAAACTCTG CCGGTGATTT CGGCGGTGGT GTAGGGGTTG GCTGGACCTG GTAG
 
Protein sequence
MNKIFKTKYD VTTGQTKVVS ELANNRQVAS RVDGSSVAEK CGGFFGGMLG AFKVLPLALV 
MSGLLSSAAY GYDVWVHSSN VTDGQTTSNL NRKTQIMYGA NKAFGKVDPY TPSTLPDEAV
ILSDYANKTG KNASYSNKDF NQTVVIGSRA VVGGDKGTAI GFGAATGENT ATSANNPTVT
YEEAKSILLR GEDGSYKDLR EQYFEFATNP KWKPTTEEKK RNIINQVITM EKEKRKKDPK
IALNKEGTAV GYRSFARGEE ATALGNDVVA WGDSSISIGS DNAAGHDAKP LSKEMFRLFY
NVRDDFNYTK EYAAVDFEIM RDDSGKPMLN EQGQLVAENN PSYPIFYNGK YYFRKNNQNN
DEGYYVMVGE DFYFSRPNAQ YLEKIGSDTP GYADKIKEIK SLNDYKRYEL YLKKEGEAYQ
KYLADHRNTK THTWARGNNA IAIGARSIAY GDNSTALGTF AVAAKDYSTA IGSNTVAFGK
GSLVMGNNSY VYADGSVGIG NKVQAIGAGS MVYGKDSFAG GLGSLAIGDH TFANVKMGKV
FNGDGLDING YNLLAKDKKY PEYANDTQGL KNSKITIEDL YRIGDTAAIQ DIMKEHINAH
TTVQEGTGEE KAEQSKDSQG RHNQGAIAIG SYSVALGDNA ISLGRYAYAK EDSTVALGRF
AFAQKESSFS IGNFTRALGK QSVAFGTHSL VEADDSMALG IKAKVLAEMP TALRSEKNKK
NSKPDFTIKN AMAIGNGAEA SFSDSIALGV NAKTDYTQTE MAQGGWAPKN AISIPSSERI
GYLSVGGKNA ERRIVNVAPG ASDTDAVNVS QLRALEEAIL YGNTLEEDNI NSGVKYVSVK
GLDDLKLLVT KEQDYKNYIK IKREYLKLKA RKEIGEETIN TDELDAKLKK YETRYADFSA
KSSKLQEEYD NPAYSLLFPS SDTEEDKKRK RKDRLKSKID EIEKAYADDE KVSVLTMAEK
EKINNSNFNS DRAKKSGSMA IGVGALANSD NSIVIGQNAK IESDKAHNTV LLGNNTSSNF
ANAVALGNYS VADRPAEKFT QAEPEAYIND ELAKAIGGET YAAVSVGKNG GNISEIQEIV
KAKKEFDKWV QANKSSPDQE KYTKEKAEKE ATLKKLALRK ITNVAPGTKD TDVVILAQLK
EAMKGVKNVG TLHYLSVNNP TSKGSNYDND GAKGSNSVAI GVDAGTDKNA SAGIAIGKGA
SSKAQNAIVI GTNVSVDVPN SFVLGSDNKV ETTADKRRTN DAVVVMGSGT TIKNSWATTA
IGAVKASNGK HIKNGTPITG AYIENAPWAT VIGNKSKVYN GTDIIALANN IEVNVNKADF
DTDKNANDNL VIIGNKAKAA RAKDSVVIGS GAKALTGDEK NHAYEKVEKA VSIGQAAVVK
ATGAVALGQG ATVETAAGNS IALGQGSEAT KKETALNEAK IEAINVKFNF KSGVSANGQE
NNKKSVLSIG KKNNERIIKH VAAGAVTSDS TDAINGSQLY AVADEFSKLA VNVLGAEVED
ANKTGFKKSE FTALLSSPST NPPQPQGQTQ KTAMTFKDAI EANIAKLNSG FVFGSGDESG
EQGTHYLGDK LIIKAGNITV KDDTLNKEDK FLSDNIRTQY EKNNKNILIG IKETPSFKNV
LITGEIPEDA SDSAKKNTYD NYAVNKKYLD KRLENVGSNF TVKGDTPKDG KNTSLEINKD
NNVLTINGDS NITTAVDKDS KKLTVSLNKA LTGITSIGKD DKAKIEFSSA DSTHHITFTV
GEGKNKATVK LDKDSLDLGS KQIKNVASGI GSTTPSDGGA TTSNVNNVLT GTSIENIENN
AVNVKDLSDI AKAISDKGIQ LKGNGTSADT KTLKLGSVLT VDSSQSKKTN ANEKDITTKL
SSDSNGALTL TLELNKATSI DGNDERIVTS KAVADKLKDF ATTETLEEDF LRITGENIGD
KQKEFGSNVG ISEIKLDDEE KTELVQAQAL IDYLKGSGDK SVKVSDSSKT VAEGEGSISI
GYDAKSQNEG SIALGYGANA LNSGSISIGQ GSNVLGVNSI GLGKDNKVNG NFSFVVGDEN
EIGSKSTSTY VMGSNNNISG ERNISIGSSN KIEKDENILL GSNNEVSGTG NILLGSRINV
GEDIKDAIVL GNQSVAVSNA LSVGTPRNRR RIVFVGDPEQ DYDAVNKKYV DNLLANKASA
FMVTSDEGNT DNISGTLSIK GAEAEGTANN KKHQNITTKA SKSELIIALN KDLKSIESIG
KDDSNALVFK NDTSNTDGSS STNIAELKVG GHALTFTPEN GTTDTNKKVK ISNVAGGQIA
ENSTDAINGK QLHDLVGNLG LSVESDGNFT APSFTKVKGD DTTSTVSKDK YTTYKDAING
LITAVNKGVT FKGNDGTNSS TKLQLGGTLT IDSSPVSSSD TTGANSATVE KDITVTLAPS
NGSDPQSAGT LTLKLNKADK VEENNGKVVT SKAVADALKN YTKTTDLVDK LGTAYLKVDG
SNIDDKQADF GKNVGIGKVN LENGKKSTTE LVQAQALIDY LKGTGDKSVK VSDSPTTEAL
GDGSVSVGHG AISRNEGSIA MGYGANALNT GAVSIGQDSN VLGMSSIGIG RENTVRGNFS
FAVGDNNVIE KEQNYVMGSD NKITGKQNIS IGSRNEVEGN ENIILGSNIK ADDEIHNAIV
LGTGSLAKSN ALSVGSMRHK RKIVFVADPT DNYDAANKKY VDERGLKFKG NDAGQTEQLV
NLDKLLTIDS SEAKKDDRGM KEKDIKTKIE KEGDNAKLIL TLNKADNVSE HDERAVTSKA
VAAKLMNYAT KFAVEGNKGS KFTVNNSLKI KGKVGSGSAK THQNITTTTK KGASTLEIAL
NSDLKGIKSI GGREIGAGKG KKIASSIEFN KQGANGTKPK TNVVIESNGG TFVFDRTGLT
LNTKQIKGLA SGLGLQAVAD GQGTNNDEAK KKAEEANQAI LDKVLAGNPD MNKTNAVNVQ
DLSAVAKAIV GKVTAQHSEA EKVAVKYDDD TKTSITLGGK GTNGTKSSPV AIDNLKSGLG
IDDIKDSGIA SAAQGKQREL VKKLVSGELD KDSSHKAVNV ADLKAVAQAG LDFAGNDSDS
IVHKNLGEKL EIVGQGLDKA KAADFKGTDG NIAVKADSAK SKLEISLNEE LKGLKSAEFK
DDKGNTVNIN GNAITLKGKD GNPTIATLND KGLTVGDKDA TNGDKTHAVY GKDGLTVKKG
KDGSTEAISL KVTKDKDGKE TATLAFGKDA DGKYYIGAIT GLADLDDKAD GSSVANKNYV
DTEVKKLDQK LSSANSNRPF DYYLDNEKVV KGQDGKFKKL KDGKPDQALS DEEAKKVVIK
AEPTTAPMGI SNVASGLGLE AQTEEAKQKA TELTKAIDKK VKAIDQKATA LSNQAQTVTD
LTLSVSALEM AMNAMSEGEE RKQAEEKLKE SKEKLEAAKK ELETAKSELT KAKEGLTEAN
KEYEKHYGGY EKVAELVNPK QDSKIDLTNT ATIGDLQAVA KSGLKFKGND DMEIRTPLSG
TLAIKGEEDT NGNKFNSDRT AEGNIKVEMS QDGTGLEVKL SDQLKNMTSF ETHEINGRKS
TLNSNGLQIV SPASDSTLSA QGTHIVGKGA NAGKSASYTL DGVTLQQANN RATLSPNGLT
VVTGSGDQIQ IDGKEGEIRV PDLTPNSSPN AVVNKGYVER LQTHTDQKFN HLDNKIEIFN
KDLRAGVAGA HAAAALPTVT MPGKSSLALS AGTYKGNNAV ALGYSRLSDN GKIMLKLHGS
RNSAGDFGGG VGVGWTW