Gene Cthe_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0056 
Symbol 
ID4808751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp75394 
End bp89286 
Gene Length13893 bp 
Protein Length4630 aa 
Translation table11 
GC content43% 
IMG OID640105465 
ProductIg-like, group 2 
Protein accessionYP_001036490 
Protein GI125972580 
COG category[N] Cell motility 
COG ID[COG5492] Bacterial surface proteins containing Ig-like domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGGTAA ATATGAGGAA AAATTTTAAA GTACTTATTT CCATTCTGCT GTGTTTTATG 
ATGCTTTTTG GGGAGATAGT ACCTGCCGGA TTGCCGTCGG CCAAGGCTCT GGCGGAAACC
GATAATGTTC TTGAAGTTCA GGATTCATCT TTTAATCAGG AAAATAATAA TTTCTCACCC
GAAAAGACTG TTTCAGAAGA TGTCTACGAG GAAGAAGAAT TGCTTCAAAA CCCTGATGAA
GAAATTCCGG GGTTAAAAGC TGCTTCTTCG ACTATGAGCG TATCCAGTGT ATTTAGCATA
TCCGGTAAAG TTGTCCTGCC GGAGGGAAAA GTAGCTCCAT CCGGAGGTTT GACTGTAAGA
GTGTATGCTA AAAGCAGCTC AAAAACTAAC AATATAACTG TTATTATCCC CGAAGGAAAA
AGCAGTGCTG ATTACACTGT GATTGTTCCT GAGGCAACTA CGTATACTTT GTATTATCAG
ACATCGAATT CCGATTATGT GGACACAGGT TATTACAGTG TAATTGGAAC TGTCAGAAGC
TTTTCTGCGG CGGATCAGAT AAGACTGACA GCAGATGTGC CTTCAGAGAC GGATATAGAT
CTTACTCTCA TAGCAAAAAG GATTATTTCG GGAACTGTAA CCCTTCCTGA CGGCGAAATC
GCCCCAGCCG GCGGAGTGTC AGTAACCGTA ACGGCGGTAA GCGGATCTGA CAAGGCAACG
GCTAATGTAG TCATTCCGGA AGGATTGGAT TCTGCAAATT ATACATTAAA AGTTCCCCCA
AGCACATCGG GAAAAGGATA TCTTGTCAGC TGTCAAACTT CGAACAAAAT TTATTTGCAG
ACTGCATATT ACAGTATGCG CGGGTCTGTA AGATATGACA CTCTTGCAAC TTTGGTTGAT
GTTATTGACG AAGATAGGAA AGATATCAAC TTAAACCTTA TTGCAAAGAA AAAAATAACC
GGTACTGTAT CTTTGCCTGA AGGTACGGCA CCTAAAGACG GTGTTACAGT TACCGTATGG
GCAACCTATG GTTCGGATAA AGATTCTCAA ACTGTAACAA TACCTGAAGG AGCTTCATCG
GTTGAATATG TATTGTATGT TCCCGATGGT TCAGGCTATA CATTGTATTA TGAAACAACG
AATGTGGCAT ACCTTAGTAA AGGTTATTAC AACGAAAACG GAACGGTTAA GGACAGTAAA
GCGGCAACGC CTGTGGATAC CGTTTCGAAA GATGCGGCAG ATAAAAATAT AACACTTATT
GCAAAACGTA TTGTTTCGGG AAAAGTGTCT TTGCCAAAAG GAGTTGCTCC TGAAGGCGGT
ATAAAAGTAG AAGTAATTGT GGCATTGAAT AATACCAAAA TTGAAAGTAC AACAATTACA
ATACCTGAAG GCGAATCGGA AGCCACTTAC TCCATGTATG TTCCGACAGG TTCGGGCTAT
AATGTTTCTT ACAAGACATC CAACAGCTTG TATGTCGAAC AGGGTTATTA TAAAAAGGGT
TCAACGGTCA GATATTTAAG TTCGGCAACT TTGATTACCG TTGAAAATGA GGACCTGAAT
GACATAGATC TTGAACTTAT TGAAAAGAGA ACCATAAGTG GAATTTTGTC CATGCCGTCC
GGTACGGCAC CCAAGGGAGG AATAAGCTTT GAAATTACGG CTTACAACAG CTCTACTGAA
GCAAAGGTTA CTGTTACAAT TCCGGAAGGC GAAAGCTCGG TACCGTATTC AATAAATGTT
CCGGCGGATG AAGGGTACAT AATCAAATGC AAATTGCTGA CGCTTCAGGC AATATATATG
AATGAACAGT ATTACAGCAG CGGCGGTACC GTATACAATG TCGCTTCTGC TTCGAAAATC
AGCACTCTCA GCGGTAACCA GCCGAATATA AACATAATGC TTCTTGAAAA AAGAACCGTA
AGCGGTACAT TATCTTTTCC CGAAGGATAT TATGCTCCTC CGGGCGGGAT AATTTTCACC
CATGAGACCG ACGGAAGAAT AACCTTCATT TATATTCCAG AAGGAGAAAG GTCCGCACCT
TTTACAATTT ACTACAATCC TGGAGTATAC AAACTTTATT ACGAATGTGA TGAAGATGAT
ATATTTGTTT CACCGGGATA CTACAGCAAG GGCGGTACTG TTATGGACAA AAACAGTGCC
GATGATATTG ACGTCAGAGA AGGAAACCAG GTGGGTATAA ATCTGGTTCT GCTCCTTAAG
AAAACCATAT CGGGTAAAGT GGTTCTTTCC AATGGAGTTG CGCCGGAAGG TGGCTATACT
GTAAAAGTAA AGGCTTCCGG AAGCAAGGGA AGTGCGGAAC AGACCGTTGT GATTCCAAAA
GGAGAAAAGT CTGCGGATTA TATTCTGTTT GTCAACCCCG GAGACAAATA CAGGGTATGG
TATGAGACTT CAAAAGAATA TAATTTCGTA AGTCCTATGT ATTACAATTC CGACGGAATG
GTAAGAGACA GTAGTAGCGC ATCTCTCTTG GATTTAACCA AGGAAAACAA GACAGATATA
GATTTGACTC TTACCGAAAT GCGTTCTGTC AGCGGTAATA TTGTTCTTCC ATCCGGTGTT
GCACCGTCGG GAGGTATTTC GGCAGATATT GTTGTTTCCA ACGGAAAAGA CAGTGGAAAA
GTAACAGTAA AAATTCCGGC GGGAGAAAGA TTTGCAAGCT ACACCGCATA TGTTCCTGCA
GGAAAGGACT ATAAGGTTAA ATATACGGTA GATGCAAAAA GCGATTATGC TACCGACGGT
TACTATGGGG TAAGCGGAAC CACGCTCAAC GCAAGTTCTG CTGCATCTCT TGACCTTACT
TCGGAAAACA AAACTGAAAT AAATATGACT TTGATACCTA AAAGAACAAT AAGCGGTACG
GTATCTCTGC CTTCGGGAAT GACTCTTGGC AGTGATACAA AAGTGACTGT TTATGCAGGG
GACAGCTACA GCACAACTGT TACAATACCT AAAAACGGTT CTTCTGTGGA ATATGCTATC
AAGGTTCCGC CAAACAGTGC GGGATCAGGC TATAAGGTAT ATTACAAACT TAGCTCCACT
ACTATGCTTA TTTCACCGGG ATACTACAGC AGCAGCGGAA TGACTGCTTC GGAGATTGGA
GCGGAGCTTG TGGATGTAAG CAGCACAGAC GCGACGGTAG ACCTTGTCCT GATACCGAAG
AGCAGTATTA ACGGAAGCGT AATACTTCCG GAAGGTGTTG CTCCAAAAGG TGGCTTGAAA
GTAACTGTTA CAGCTTCAAA TAATAAAAAC AAAGGGTCTG TTACCGTTAC AATACCGGAA
GGAAAAAGCT CGGCAGAGTA TTCTGTTTAT GTTCCGTCAG GAACCGGATA TCTTGTAGAA
TACTCAGTTA CTGATGAAGA ATACGTTAAA AAAGGTTATT ACAGCACATC GGGAACTGTC
AGGGATTCAA GTGCTGCAAC TTTGATTAGC CTTGACGGAG AGAGCAAGCA AAATGTAAAC
CTTACACTGC TTAGAAACAA TAAAATAACA GGTAATGTCA CTCTTCCAAA GGGAGTTGCT
CCGTCAGGAG GATTAAAAGT TACGGTTTGT GCCTCCAACT CAACGGGCAG CGTTAAAACT
ACGGTAACAA TACCTCAAGG CTACAATACA ATAAGCTATA CTCTTTTTGT ACCGCAAGGA
AAGAATTATA CCGTATGGTA TGAAGCTTCG GACAGGAGAT TTATGCCGAC AGGTTATTAC
AGCGACGGAG GTACAACGGT TGATAAATCC AAAGCTAAGC TTATTGATGC AACTATCAAT
GTTTCATCGA TTAATATTGA TTTGATTCCG AAGATGGAAG TAAGCGGAAG TGTAAAACTT
TTATCGGCAC CTGCACCTGC AGGGGGAATC AGTGTAAAAC TTACAATAGA CAACAAGATT
TCCAGTGATT CGGTGGATGT GGTGATTCCG GAAGGGTTTA TGTCTGCTCC TTATGCCCTT
TATGTTCCGG CAGGCGAAAA CTACATCTTA AAATACACTA CATCAAGTTC AGGTTTTGTA
AGTCCTGGAT TCTATTCCGA ATCCGGTTCG AAAGAGACTG AAAAGGAAGC TTCCTATCTG
AATATAACAG GTGACAGGAC AGGTATCGAT ATACCGCTTA TTACCAAAAG AACCATTCGC
GGTACGATAT TGCTTCCTGA AGGATATGCT CCGGCAGGCG GAGTCAGCGT AAAAGTAACT
GCGCAAAGCT CGTCTGATAA AATAACGGCA AGCTTTGTCA TTCCTGAGGG AGAAAACGCA
GTGCCCTACA CATTGTATGT TTCCTCCGGC AAGGAATATA TTGTAAAATA CGAAACTACT
GATGAAAAGT ATGTAAGTAC AGGTTTCTAC AGTATGTTAA AAACCACCCG TCTGGAATCG
GAAGCGGATA AATTGGACAC GACCGAAGCA GACCAGGTGG GTATCAACCT TAAACTTATA
AGCAACAAAT ATATAAGCGG AACTGTCTCA ATTCCGTCAG GAATAGCACC TTTCGGGGGC
ATAAAAGTAA CGCTCAAAGC TACGAACGGC AAGGACACAA AAACTGTTAA CGTTGTCATA
CCTGAAGTAA GCAGTTCGAC AACCTACAAA ATTTATGTTC CTGAAGGCAA AGATTATGAA
CTCAGTTATT CCATAAGCAA CGGAGACGGA AAATATTTCC CTACCGGATA CTATAATGGA
CTTACAGCAA CCAGGGAAAA AGGAGAATGT GTGCTTCTTG ACCTTTCCGG CGAAAGCAAG
GAAGGAATAA ATATTACATT GATACCCAAC AGGCTGGTCA CCGGAAGCCT TATATTGCCT
TCGGGAGTGG CTCCTGCAGG AGGCTTGAAG ATTACTGTCA GGGCTTCAAA CAAAAGGGAT
TCGGTACCAA CTACGGTAAA CATGCCTCAG GGCAGCAGCT CCGTAGTGTA TAGAATGTAT
TTGCCGGAAG GCAGTGATTA TACTATCGGA TATACCATAA GCCATGAAGA TTATTTGAAC
GGTTACTACA ATATTAATGA AACGGTTCTT ACTCAATCTG AGGCAACAAC CTTTAATGTT
GAGAAGAGTG ACATCTTGGG ATTAAACATT GTTCTTATCG CAAAAAGGAC AATAACCGGT
ATTGTAAGTC TGCCTGACGG CAAGACGGCC CCGGCAGGAG GAATTGATGT TACGATATCT
GCCGGAAGCT ACAGTACCAA GGTGACAATT CCTCAGAGCA AAAACTCTGT GTCTTACACA
TTGAAGGTGT CACCAAATGC AGTCGGTTCG GGATACGCAA TAAAATACGC AATTACCTCT
GCTACGGATT TTGTCCAGAC CGGTTTCTAT GGTGAGGATA AAACTGTGGC GAGGGTACAG
GAAGCGGATT TGGTTGATGT CAGCCTGGAA AACAAAGCCG GTATCAACCT TACCATACTG
GAAAAACGTG TTATCGCCGG TGTGTTCAAG CTTGATTACG GTTATGCGCC TTCAGGAGGT
ATGAATGTCA CAATATCTGC GACAGGTAAG AACGCCGAAG GCAAAAGCAT GACCTATCAG
CAGGTAATTT ACTTGCCGTT TGGCCACAGC CAGGCAGAAT TTAAGCTGAA TGTGGATCCA
AGCTATTATG CTATGGGTTA CAAGCTTAAA TATACTATGG ATTCTGAGTA TGGTTATGCT
GAAATCGGAT ATTACAATGA TATAGAAGGA ACAGTGCAAA ATGAAAAAGA GGCAACATTG
ATTTATGTGG ATTACGAGGA TCAGTTAGGC AAAGAAATAA CTGCCATCAG CAAGAACCAT
ATAAGCGGTA GTGTTTCTCT TCCTGACGGA GCAATAGCTC CAAAAGGAGG AATAAAAGTA
AGTGTTCATG CTGAAGGGCC GGGCGGCAGC GGTTCTGCAA ATGTTACTAT TCCCGAAGGC
CAGTCCGGTG CAGGCTATGT ACTGATTGTT CCTCCGGGAA CACAATACAA GCTGTATTAC
AGCATGGCAC CAAATAACAT GTACGTTTCA AGCGGATACT TAAGTTCTGA AGGAACTGTT
TTAGACTCTA AAAAAGCAGA GCTGTTTGAT GTAAAAGGAG ATGTTGACAA CAAGAATATT
GTACTTATAC CCAAGAGAAA AGTATCCGGA GAAGTGACAC TGCCGGTAGG GGTATTTGCG
CCCAAAGGCG GCCTTAAAGT GGAAGTGACC GTTCAAAGCA GTCAAAGCAG TGATTCATTG
ACTGTTACCA TACCGGAGAA CGGTCAGTCG CAAAGCTTTG AGCTGTATCT GCCGCCACAG
AACGGATATA AATTGTTCTA CAGTTTAGCT TCCGGTACCA CCTATGTAAG CAAGGGATAT
TATGCTCAAA GCGGTACTGT CATAGATGAT AAGCTGGCAT CGGAGATAGA TTTGAGAGAC
AAGGATTTGA CCGGTGTAAA GCTTTCGCTT GTAGAAAACA ACATTATTAA AGGAAGCGTA
ATTCTGCCTT ACGGTGTCGC ACCTGAGGGA GGAATTGAGG TAAGAATTAC TGCCGACAAT
GGTAAATTCA AAAATACAAC GAATGTAACA ATCCCTGAAA ACGGCAATAA GATTGATTAT
GTGCTGCCTG TGCCTCCGGC ATCGGGATAT AGGGTATCTT ATCAGGTTTC AACCGGACTT
GATTTTATAC CTACCGGCTA TTATGGCAGT GAAGGAACGG TTACGTCCTC AAGCGGAGCA
TTAAAACTTG ATTTGACTTC GGGAGGAAAA GAAGGAATTG ATCTTTCTCT TGTTTACTAT
AACTCGATAA GCGGAACCGT ATCCTTGCCT GAAGGAGTTG CACCGAAGGA AGGAGTAACG
CTTACAGTAT TTGCAGCCAA CTCCAGAAAC AAGAGAGAGA CAACAGTTAC AATCCCGAGC
GGTAAAAGCT CTGCAAATTA TAATATTTAC ATTCCTGACG GCTATGGATA CAAGGTATAT
TATGTCATGA CCTCGGATGT AAAATATGTT GACAAAGGAT TCTATGCAGG CATCGATACG
GTTACCGATG AAAAAGAGGC TGCCACCGTG GATGTCAGCG GGGGTTCGGT AACAGACATA
AATCTTACTG TCATTGCAAA AAGAACAATA AGCGGAACCA TATCTTTAAA AGACGGCGAA
AAAGCACCTC AAGAGGGTAT TGCTGTAAGG ATAACGGCTA TTGACGGAGA TGAACAGACA
GTCGTAATTC CTTATGGTAA AAGCTCGGTT GCATACTCAT TGAATGTGGT TCCAAATGCA
GCAGGAAAGG GCTACAAAGT AAGATTTGAA ACCATTAAGA ACTACGGATA TGTGCGTTAC
GGTTATTATA CAAAAGACGG CGCGGTAAGA AGCGAAGCTA ATGCTGAGTT TGTGGATGTA
AGCAGAGGGG ACAAAGACAA TGTGAACTTT GAATTGGTAC GTCCGCGTAC AATAAAAGGT
ACGGTCGGAC TTCCGGAAGG TGCTGCGGCA AGCAGGGATA TAACTGTAAC CGTTATAGCA
TCCAACAGTA TTGACAGCGC GGACACTGTT GTCTATATAC CGAAAGGCTC AAAAGAGGCC
GCTTATACCC TCTTAGTTCC GCCTAATGAC AGCAATGATG AATACAAGGT AAGGTATGAG
AACTGGCATG ACAACAGTTT TGCCGATATT GGATACTATG GTTCGTCGGA AACTGTCAGA
AGTGCTGATT TGGCAAAGGG AGTCAATGTA AGAAAAGAAA ATGCCGAAGG CATCAATCTT
ACCCTTATTG CCAAAAAGAC TGTTTCAGGA AAGATTTCGC TTCCTTATGG AACAGCACCA
AGAGGTGGAC TTACTGTAAC GGTTTATGCT GAGAATAATA CCGATAAGGT TATTTCCTAT
GTAACCATAC CCGAAGGCAA AAACAGCATG GACTATTCAT TAAGCGTTCC TGTGGGCAAA
GGATACAGGA TAGGATATGA AATGTCCATA AAAAATGACT TTGTTCCGTG GGGATATTAC
GGAAACAAGG GCATGGTCTT TATGCCCGGC GAGGCCTATC TTATGAATAT AAGCAGTGAT
ATTGGCGGCA TTGATTTGGA ACTTATAGAA AAGAAGTCCA TCAGCGGTAA AGTAATTCTT
CCTGAAGGAA CAGCTCCAAA AGGAGGAATT AAAATAGAGG TCTATGCTGA AGATGCCGGA
GACACTTGGG TTACAATTCC TGAAGGACAA AGCTATGCTG AGTATACGAT GAAGGTTCTG
CCGAGCCTTC AGGGCAGCGG TTACAAAGTT AAATACGTAG TTTCGTCGGA TTATGGCCTT
GTAGGATACG GTTATTACAA CAAAAACGGT ACGGTCAGAA ACAGCAAGCT GGCTGAGCCG
GTTGACGCAA ACTACAAAGA TGTTGCCAAT ATAGACATTG AATTAATGAA ACCGAGAGTA
ATAAGCGGAA AGGTTTCTTT GCCGGACGGT GTTGCACCTT CAGGAGGTAT TTCATTAAAT
ATTGCGATAT TTAATGAAAC AGACGGAAAT TCGCAGATTA TTACAATACC GGCTGGAAAA
TCTTCGGCAA CTTATTCCAT AAGCGTGCCT CCGAATGACC CGGGCTATGA GTATACAGTC
AGGTATGAAA ACTGGTCGAA CAAGATTTAT ACAACCTATG GATACTATAA CAGCAAGGGA
ACGACCAATA ATATGTCCTA TGCACAGTTG GTGGATGTAA ATGAGAAGGA CGCTTCAAAC
ATTGACATGG TTCTTCTTAA GAAGGCCACG GTCAGTGGAG TTATTGAAGT ACCCGAAAAT
GCAGTGCTTC CGGAGGAAGG ACTCCATGTA AGGATATACG TATCCAACGA TGTCGAAACC
TACTCAGCGA ATGTGACTAT ACCTTACGGT ACATCATCAG TGCCATATTC AGTGCATGTG
GAAGAAGGAA CCGGCTACAG GCTGTTCTAT GTACTGGATA GCAATGAAAG CTTTATGGAA
TACGGATATT ATGCTGACAG CGGAGTATAT ACCGATAAAA AGATGGCTAA GGTATTCAAC
GTATATAATC AAAACATCAG CGGCTACAAG TTGAAGCTTT TAGAGAAGAG AAGAATCTCG
GGTAAACTGA TTGTGCCGGA CGGTGCCTTT GAAAACGAAG GCTACTTTGA GGTTGCCATT
AGTGCAACAA ACGGATTTGA CACCGGTACA GCCAAAGTTC TCGTTCCTTA TGGTAAGGCA
GAAGCGAATT ATACATTGAC TTTACCGGCA GGCAGCGGAT ATATACTTCA GTACGAGATT
TCCAAGATAA AAGGATATAC ATCCGTTGGA TATTATGGTG CGGAAGGTAC CGTAAGAAAC
AGAAACGAAG CATTAGCCCT GGATTTGAGG GAAACTGATT TAACAGACAT TGATATTTCC
CTGATTCAGG ACATGACGAT AAGCGGTACA ATCAGAATTC CACAAGGAAC GGCTCCGGCA
GGAGGCATCG AAGTTGTTGT AACAGCAACC GATGAATCCG GAAACATGGC ATCAGAGAAA
GTAACCATTC CGGAAGGCGA AAGTTCGGCG GACTATTATC TGAATGTTCC GCCAAACGCT
CCTGATTCAG GATACAGGGT AAGGTACAGT GTGGCAGCTG ATAAATATGC GGCTATTGGT
TATTACAGTG AGAGCGGTAC AAAGGCATTG CCGAATGAGG CAACACCGGT GGATGTAAGC
AGTAAAAATG CGGAAGAGAT AAATCTGGTG CTTATTGAAA AGAAGATAGT GAGAGGTGTT
GTATCCATTC CGGAAGGGGC TGCCGGAAAA GGCGGGCTGC CCGTAACCAT AAAGGCGACA
AGCAGATTGT TCAGCGTGGA GGATGCGGTA ACGGTGACCA TACCGGAAGG AGAGAGCATG
GCTCCGTACA CCTTGTGGGT ATCACCGAAT GTAGAAGGAG CAGATTATAT AATTTCCTAT
GAGGTAAGTA ATACGGCTTA TATAAACAGC GGATATTACA ATCAGGCAGG TACGGTAGTG
GATATTAATA TGGCAACTCC TGTTGACGTA AGCAACGGGG ATTATACCAA TGCCAATATT
TCACTGATAA GAAGCAGAAA GATAACAGGT AAATTGATAC TTCCTGAAGG AGAAACCGCT
CCTAAGGGTG GTTTGCCGGT AACAGTTTTT GCAGAAAAAA CAGGTTATAC CGGTTACAGA
GTGACAAAGA CCGTAACAAT ACCTGAAAAA CAGAGCAGTG TGGACTACAT CCTGTACGTA
CCTGAAAGTA CGTCAAAGGC AGTAAAACTT AAGCTCCAGG CTTCAACCGG AAACGGAACG
GAGGACAAAG CCGACGATTA TGTTTTGAAT ACGGAAGTCT TTGTACCGGT AATTGACGGC
AGCGGAAGTT CCGAATACAA AGTGGGTTAT TCGTATACTC AGGATGAACA GTACTTTAGA
AGCGGATTCT ATACTAAGGA TGGAACAGTG CCTGCAATAA GCATGGCCGG CACAGTGAAT
GTAGCCAAAA AGGATGTTCA GAATGTAGAT TTGACTCTGC TTAAAAAGAA CAGAACAATA
AAAGGAGTTG TCAAACTTCC GGACGGCAAG ACGGCTTCCG GAAATATTGA GGTGGAAATA
ACAGCTGAAA ACGATGCGTT GGACTTTGCT CCGCAGAAAA CAGTAACAAT AGGCAAAGGA
AAATCATCTG TTGAATTTGA GATTGCAGTT CCGTCACTTG ACAGCTATCG CATAAAATAT
ACGATTAAAT CAACAACCGA CGGATATGTA ACCAGCGGTT ATTACGCAAC AACCGGAACC
GTAGGAAAAC CTGAACTTTC AACACTTGTA AGCACCTCAT CCGGAAATGT GGACGGAATT
AACTTAAATC TGATACCCGG CATGGAAATA AGAGGAGGAG TTTCTCTTCC GACAGGACAG
CAGGTAAACC GCAATGACTT CTGGCTGTGG GTTTCGGCCT CTAATGAGAA TTATGAATCT
TCGGTATATG TCACAATAGC TAAGGGAAGT TCTTCAGCAA ATTACTCTTT ATATGTACCC
GAAGGTTCAG GATATATTGT GAGTTACTCA ATTCTGCCTC TGTTTGGAGA GTATGTGCAA
AAGGGTTATA ACAATGCAAG TGTAACCACA GCAAATAAAG ACAGTGCGAC AAAGTATAAC
GTGACAAAGA ACCTGAGTGG TATCAATCTC ACACTTTTGC CTCTGGACAG AGCTATATCG
GGTACTGTAT CGCTTCCTGA CGGTACGGCT TCGGTGGGAT ACACAATCCA TGTTCCCGCC
AACAAGAGAG GCAGTGGCTA TCAATTGGAG TATTCGGTTG TTTCGGGCAA TGAAGGCGGT
GCATACAAGG AGAAAGGATA CTTTAGCCTT GCCGGAACTT CGGCTGACAA GGCAAAGACA
TCAATTATCG ATGTAAGCTC AAAAAACAGC ACGGGCAATG ACATGACACT GCTTGCAGAT
ACCATGGTAC CTGTAGATGC AATTATGCTG GATAAGTATC AGGTAACCAT ACAGAGCGGC
AAAACGGTAA ATCTCAAAGT TAAATATTTA CCGGAGAATG CTACAAACAA GACAATTAAA
TGGACTTCCG GCAACACAAA TGTTGCAGAG GTTTCGTCCG AGGGTGTTGT CAAGGCAAAA
GCAACGGGAA CGTCAGTTAT AACTGCAAGA ACTCACAACG CAGTAATAAC AGTATTTACT
GTCAGAGTTG TTCCGGCCGA AAGTTCTGCC TTGAGTATTG ATAAACTGTC TGTGAGCATA
AACCCCGGAG AAAAGGAACA GCTCGAGGCA ATATTCACTC CTCAGAGTGA GGATGACAAA
GTAATATGGA AGTCTGACAA CACCGATGTG GCAACAGTAT CGGAGAACGG TCTGGTAACG
GCGAAAAAAT CCGGTACGGC GGTAATTACA GCTATGAGTT CGAAAGATCC TTCAGTATAT
GCAACTTGTG AAGTTGTTGT CATAACTCCT GTTACCGGTG TGGAAATTGA CAAGACCAGG
CTTGAAATAA AAGTAGGATA TAATGAAAAA CTTACAGCCG GGGTACTGCC CGAAACTGCA
AGTTATAAAG GAATTACGTG GATATCAAGT GATGAAAGTA TTGTCAGGGT GTCACAATCC
GGAGAAGTGA CTGCCGTGAG TATCGGAACA GCGGTTGTAA CCGCCAGCAG CATATATAAT
CCTTCTTTGA AGGCGGTATG CACTGTTAAA GTAATACCGG TTCCTGTAGA GGAAATAAAA
CTTGACAAGC AAACAGTCAC ATTGTATCCG GATGAGTATA TCCTGATAAA TGCGGAGGTC
CTTCCTTCGA ATGCTTCAGA CAAGAGAGTT GCATGGAAGT CAGAGAATAC AAATATTGCA
ACTGTGACGG CAGAAGGTCT CGTAAAGGCG GTAAATATCG GTGAAACAAA GATAATTGCA
ACCAGCCTTT ACGATTCATC CAAAACGGCT GTGTGTGTAG TTAAAGTGGT TGCAAGACCG
GTTACAAAAG TGACATTTAA GAATGTACCT GAGTCCATAC TGGTTGGCCA AAGCAAAGCG
TTGGAGGTAA CCGTATCTCC GTCCAATGCT ACCGATAAAA CACTGGTTTG GACATCAAGC
GATGAAAAAA CTGCAACGGT AACCCAGGAT GGCGTTGTGA CAGGAAAGGG AGTCGGAACG
GTTACAATTA CCGCAGCATG GAAGAACGAC CCGAGTGTAG AGGCAGTATG TACATTGAAA
GTTGAGCCGG TAAAAGTAAC AGCAATCAAA TTGAAACAAA CCAGCATATC TTTGGGTATT
GGAGACGAAG TAAAACTTGT TGCCGAAATA ATACCGGAGA ATGCCACTAA CAAAGAAATT
ATATGGTCAA CAAGCGACAG GGATATTGTA ACGGTATCAT CCGACGGAGT GGTTAAAGCT
GTTTCCATGG GAAGAGCTAC AATAACTGCT ACCAGTAAAG AATCCTCAAG CATTAAAGCT
ACATGCAGCG TAACAGTGAC AGGAATAGAA GTAAGTCAGG TAAGAATTAA CAACAAACCT
TCAACGCTGG CTGTGGGAAG CACCCACAAA CTGACAGTTA CTATAAATCC GACAAATGCT
TCGGATAAAA CACTGGAATG GAGTTCCAGC GATACATCCA TAGCAACGGT AAGTTCTTCC
GGAGTTGTAA CGGCGAAAAA GGTCGGTACT GTAACAATTA CGGTAAGCAG CAAGTCGAAA
CCGTCCTGCA AGGATTCGTG CACAATAAAA ATAGTTGAAG CAACACCATC ACCGACACCT
GCGGAAATTG TCAATGAACC CGTTGTAATT CCGGGCGGAC CTGGCGGAGG CGGCGGAGGA
GTTCCGGTTG CTTCCATCGG TGGCTCACCT ACACCTTCGG CGACACCGAC AGTGACTCCG
ACAGCAACGC CGACATCAAC ACCGTCAGCA GGAAAGACTC CGGCAGCGAG TCCGACACCG
GCACCGGTTG TTACGGATAC AAACAGTCTG GACTTCTTTA CAGATATAAA GGGTCATTGG
ACGGCAGAAT TCTTTGCTGA TTTGCTAAGA AGAGAAATCG TCAATGGATA TCCGGACAGA
ACTTTAAGGC CAAATGCTCC GATAACCAGA GCCGAGGCAA CGGTAATGGT AGTGAAGGCG
GCAGGCTTTG AAGTGTCCGA TAGTATACCA CTTACATGCA CTGATAAAGA TTCGGTACCG
GCATGGGCAA AACCCTATGT TGCAACGGCA ATGCATAAAG GTGTGGTTAA GGGTTATGAA
GACGGATCCT TCAGGCCTTC GAACCGACTT ACCCGTCAGG AGACAGTGGT TTTGGTACTG
AGGGCATTCG GAATAGAAGA AGCGCAGGAT AAGACCCTTG CCTTTGCTGA TTCGGATAAG
ATACCGGCCT GGTCGAGAGG CTATATAAAG AAAGCAGTGG AATTGGGAAT AATCAAAGGT
TACAGCGACA ACACCTTTGG GCCGCAAAGG GAGATTACCC GTGCCGAAGT TGTTACCATA
ATATCAAAGT GTATTCAGCT TAAGGGAAGG TAA
 
Protein sequence
MGVNMRKNFK VLISILLCFM MLFGEIVPAG LPSAKALAET DNVLEVQDSS FNQENNNFSP 
EKTVSEDVYE EEELLQNPDE EIPGLKAASS TMSVSSVFSI SGKVVLPEGK VAPSGGLTVR
VYAKSSSKTN NITVIIPEGK SSADYTVIVP EATTYTLYYQ TSNSDYVDTG YYSVIGTVRS
FSAADQIRLT ADVPSETDID LTLIAKRIIS GTVTLPDGEI APAGGVSVTV TAVSGSDKAT
ANVVIPEGLD SANYTLKVPP STSGKGYLVS CQTSNKIYLQ TAYYSMRGSV RYDTLATLVD
VIDEDRKDIN LNLIAKKKIT GTVSLPEGTA PKDGVTVTVW ATYGSDKDSQ TVTIPEGASS
VEYVLYVPDG SGYTLYYETT NVAYLSKGYY NENGTVKDSK AATPVDTVSK DAADKNITLI
AKRIVSGKVS LPKGVAPEGG IKVEVIVALN NTKIESTTIT IPEGESEATY SMYVPTGSGY
NVSYKTSNSL YVEQGYYKKG STVRYLSSAT LITVENEDLN DIDLELIEKR TISGILSMPS
GTAPKGGISF EITAYNSSTE AKVTVTIPEG ESSVPYSINV PADEGYIIKC KLLTLQAIYM
NEQYYSSGGT VYNVASASKI STLSGNQPNI NIMLLEKRTV SGTLSFPEGY YAPPGGIIFT
HETDGRITFI YIPEGERSAP FTIYYNPGVY KLYYECDEDD IFVSPGYYSK GGTVMDKNSA
DDIDVREGNQ VGINLVLLLK KTISGKVVLS NGVAPEGGYT VKVKASGSKG SAEQTVVIPK
GEKSADYILF VNPGDKYRVW YETSKEYNFV SPMYYNSDGM VRDSSSASLL DLTKENKTDI
DLTLTEMRSV SGNIVLPSGV APSGGISADI VVSNGKDSGK VTVKIPAGER FASYTAYVPA
GKDYKVKYTV DAKSDYATDG YYGVSGTTLN ASSAASLDLT SENKTEINMT LIPKRTISGT
VSLPSGMTLG SDTKVTVYAG DSYSTTVTIP KNGSSVEYAI KVPPNSAGSG YKVYYKLSST
TMLISPGYYS SSGMTASEIG AELVDVSSTD ATVDLVLIPK SSINGSVILP EGVAPKGGLK
VTVTASNNKN KGSVTVTIPE GKSSAEYSVY VPSGTGYLVE YSVTDEEYVK KGYYSTSGTV
RDSSAATLIS LDGESKQNVN LTLLRNNKIT GNVTLPKGVA PSGGLKVTVC ASNSTGSVKT
TVTIPQGYNT ISYTLFVPQG KNYTVWYEAS DRRFMPTGYY SDGGTTVDKS KAKLIDATIN
VSSINIDLIP KMEVSGSVKL LSAPAPAGGI SVKLTIDNKI SSDSVDVVIP EGFMSAPYAL
YVPAGENYIL KYTTSSSGFV SPGFYSESGS KETEKEASYL NITGDRTGID IPLITKRTIR
GTILLPEGYA PAGGVSVKVT AQSSSDKITA SFVIPEGENA VPYTLYVSSG KEYIVKYETT
DEKYVSTGFY SMLKTTRLES EADKLDTTEA DQVGINLKLI SNKYISGTVS IPSGIAPFGG
IKVTLKATNG KDTKTVNVVI PEVSSSTTYK IYVPEGKDYE LSYSISNGDG KYFPTGYYNG
LTATREKGEC VLLDLSGESK EGINITLIPN RLVTGSLILP SGVAPAGGLK ITVRASNKRD
SVPTTVNMPQ GSSSVVYRMY LPEGSDYTIG YTISHEDYLN GYYNINETVL TQSEATTFNV
EKSDILGLNI VLIAKRTITG IVSLPDGKTA PAGGIDVTIS AGSYSTKVTI PQSKNSVSYT
LKVSPNAVGS GYAIKYAITS ATDFVQTGFY GEDKTVARVQ EADLVDVSLE NKAGINLTIL
EKRVIAGVFK LDYGYAPSGG MNVTISATGK NAEGKSMTYQ QVIYLPFGHS QAEFKLNVDP
SYYAMGYKLK YTMDSEYGYA EIGYYNDIEG TVQNEKEATL IYVDYEDQLG KEITAISKNH
ISGSVSLPDG AIAPKGGIKV SVHAEGPGGS GSANVTIPEG QSGAGYVLIV PPGTQYKLYY
SMAPNNMYVS SGYLSSEGTV LDSKKAELFD VKGDVDNKNI VLIPKRKVSG EVTLPVGVFA
PKGGLKVEVT VQSSQSSDSL TVTIPENGQS QSFELYLPPQ NGYKLFYSLA SGTTYVSKGY
YAQSGTVIDD KLASEIDLRD KDLTGVKLSL VENNIIKGSV ILPYGVAPEG GIEVRITADN
GKFKNTTNVT IPENGNKIDY VLPVPPASGY RVSYQVSTGL DFIPTGYYGS EGTVTSSSGA
LKLDLTSGGK EGIDLSLVYY NSISGTVSLP EGVAPKEGVT LTVFAANSRN KRETTVTIPS
GKSSANYNIY IPDGYGYKVY YVMTSDVKYV DKGFYAGIDT VTDEKEAATV DVSGGSVTDI
NLTVIAKRTI SGTISLKDGE KAPQEGIAVR ITAIDGDEQT VVIPYGKSSV AYSLNVVPNA
AGKGYKVRFE TIKNYGYVRY GYYTKDGAVR SEANAEFVDV SRGDKDNVNF ELVRPRTIKG
TVGLPEGAAA SRDITVTVIA SNSIDSADTV VYIPKGSKEA AYTLLVPPND SNDEYKVRYE
NWHDNSFADI GYYGSSETVR SADLAKGVNV RKENAEGINL TLIAKKTVSG KISLPYGTAP
RGGLTVTVYA ENNTDKVISY VTIPEGKNSM DYSLSVPVGK GYRIGYEMSI KNDFVPWGYY
GNKGMVFMPG EAYLMNISSD IGGIDLELIE KKSISGKVIL PEGTAPKGGI KIEVYAEDAG
DTWVTIPEGQ SYAEYTMKVL PSLQGSGYKV KYVVSSDYGL VGYGYYNKNG TVRNSKLAEP
VDANYKDVAN IDIELMKPRV ISGKVSLPDG VAPSGGISLN IAIFNETDGN SQIITIPAGK
SSATYSISVP PNDPGYEYTV RYENWSNKIY TTYGYYNSKG TTNNMSYAQL VDVNEKDASN
IDMVLLKKAT VSGVIEVPEN AVLPEEGLHV RIYVSNDVET YSANVTIPYG TSSVPYSVHV
EEGTGYRLFY VLDSNESFME YGYYADSGVY TDKKMAKVFN VYNQNISGYK LKLLEKRRIS
GKLIVPDGAF ENEGYFEVAI SATNGFDTGT AKVLVPYGKA EANYTLTLPA GSGYILQYEI
SKIKGYTSVG YYGAEGTVRN RNEALALDLR ETDLTDIDIS LIQDMTISGT IRIPQGTAPA
GGIEVVVTAT DESGNMASEK VTIPEGESSA DYYLNVPPNA PDSGYRVRYS VAADKYAAIG
YYSESGTKAL PNEATPVDVS SKNAEEINLV LIEKKIVRGV VSIPEGAAGK GGLPVTIKAT
SRLFSVEDAV TVTIPEGESM APYTLWVSPN VEGADYIISY EVSNTAYINS GYYNQAGTVV
DINMATPVDV SNGDYTNANI SLIRSRKITG KLILPEGETA PKGGLPVTVF AEKTGYTGYR
VTKTVTIPEK QSSVDYILYV PESTSKAVKL KLQASTGNGT EDKADDYVLN TEVFVPVIDG
SGSSEYKVGY SYTQDEQYFR SGFYTKDGTV PAISMAGTVN VAKKDVQNVD LTLLKKNRTI
KGVVKLPDGK TASGNIEVEI TAENDALDFA PQKTVTIGKG KSSVEFEIAV PSLDSYRIKY
TIKSTTDGYV TSGYYATTGT VGKPELSTLV STSSGNVDGI NLNLIPGMEI RGGVSLPTGQ
QVNRNDFWLW VSASNENYES SVYVTIAKGS SSANYSLYVP EGSGYIVSYS ILPLFGEYVQ
KGYNNASVTT ANKDSATKYN VTKNLSGINL TLLPLDRAIS GTVSLPDGTA SVGYTIHVPA
NKRGSGYQLE YSVVSGNEGG AYKEKGYFSL AGTSADKAKT SIIDVSSKNS TGNDMTLLAD
TMVPVDAIML DKYQVTIQSG KTVNLKVKYL PENATNKTIK WTSGNTNVAE VSSEGVVKAK
ATGTSVITAR THNAVITVFT VRVVPAESSA LSIDKLSVSI NPGEKEQLEA IFTPQSEDDK
VIWKSDNTDV ATVSENGLVT AKKSGTAVIT AMSSKDPSVY ATCEVVVITP VTGVEIDKTR
LEIKVGYNEK LTAGVLPETA SYKGITWISS DESIVRVSQS GEVTAVSIGT AVVTASSIYN
PSLKAVCTVK VIPVPVEEIK LDKQTVTLYP DEYILINAEV LPSNASDKRV AWKSENTNIA
TVTAEGLVKA VNIGETKIIA TSLYDSSKTA VCVVKVVARP VTKVTFKNVP ESILVGQSKA
LEVTVSPSNA TDKTLVWTSS DEKTATVTQD GVVTGKGVGT VTITAAWKND PSVEAVCTLK
VEPVKVTAIK LKQTSISLGI GDEVKLVAEI IPENATNKEI IWSTSDRDIV TVSSDGVVKA
VSMGRATITA TSKESSSIKA TCSVTVTGIE VSQVRINNKP STLAVGSTHK LTVTINPTNA
SDKTLEWSSS DTSIATVSSS GVVTAKKVGT VTITVSSKSK PSCKDSCTIK IVEATPSPTP
AEIVNEPVVI PGGPGGGGGG VPVASIGGSP TPSATPTVTP TATPTSTPSA GKTPAASPTP
APVVTDTNSL DFFTDIKGHW TAEFFADLLR REIVNGYPDR TLRPNAPITR AEATVMVVKA
AGFEVSDSIP LTCTDKDSVP AWAKPYVATA MHKGVVKGYE DGSFRPSNRL TRQETVVLVL
RAFGIEEAQD KTLAFADSDK IPAWSRGYIK KAVELGIIKG YSDNTFGPQR EITRAEVVTI
ISKCIQLKGR