Gene Ccel_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2378 
Symbol 
ID7311047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2809055 
End bp2820142 
Gene Length11088 bp 
Protein Length3695 aa 
Translation table11 
GC content40% 
IMG OID643609303 
Productamino acid adenylation domain protein 
Protein accessionYP_002506691 
Protein GI220929782 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAATA AATCAATGAA GCTGAAAGAT ATAAAAATAG ATAAATTGCC GGATCTTAAG 
AATATCAGCT TTGAGGAAGT ATCAACACAA GATGTGGCTA TAATAGGAAT TTCCGGTAAA
TTTGCTTCCT GTGGCAATAT GGATGAATTC TATGTCAGCC TGCGGAACGG TGATAATCTG
GTCGGTAAGT TTCCGCAAAG AAGAAAGCAG GATATCGACA CCATACTTCG TTACATGGGA
ATATCTGAGA AAGAAACGGT GTATTATGAA GGGGCTTTTT TAAATGAGAT AGACAAGTTT
GACTGTAGTC TGTTCAATAT ATCACCTAAA GAAGCCGGGC TTATGGACCC AAACCAGAGA
CTCTTTTTGC AGACTGCCTG GAATGCAATA GAGGATTCTG GATACGGAGG CGACAGGTTA
AGGAGTTCAA GAACGGGGGT TTATGTTGGC TACTGCGGGG ATTTTAATGA GGATTACAGA
AAGTATGTAC AGCTAATGGA CCCTTCAATG GCTGCCCTTT CTGTAACAGG TAATATAAAA
TCCATAGTTG CCAGCAGGAT TTCATATCTA CTGGATTTAA AAGGCCCAAG TATGCTTGTG
GATACAGCAT GTTCATCAAC CCTTGTAGCT ATTCATTTAG CATGTCAGGC ATTAAGAAAG
GGAGAGTGTG AAACAGCAAT AGCGGGTGGG GTTAAACTTA TCTTATTGCC GGTGGATAAT
AATCCTCAGA ACGGTATAGG AATAGAGTCA TCATCCGGTG TAACAAGATC CTTTGATGAA
TACTCAGACG GAACAGGCTT CGGAGAGGGA GTAGCAGCAA TAGTTCTGAA GCCTCTGAAA
AAAGCAATTG AAGACAGGGA TAACATTTAT TCAATAATAA AAGGCAGTGC GGTCAATCAG
GATGGAACAT CAATGGGAAT TACGGTGCCT AATGCTCAAG CTCAGTCGGA TGTAATAATA
AGAGCATGGA AGGATGCCAA AATAGACCCT GAGACAATAT CCTATATAGA AGCCCACGGA
ACAGGGACAA ATCTAGGAGA CCCAGTTGAA ATAGACGGTA TCGAAATGGC ATTTGCCCAA
TATACCAACA AAAAACAATT TTGTGCTGTG AGTTCCGTAA AAACAAATAT TGGACATCTT
GACAGTGCGG CAGGCATAGC AGGACTTTTG AAAGCCGTGT TGTCTCTGAA ACACAAGGAA
CTGTTTCCAA GTCTGCATTT TGACAGACCC AACCAAAAAA TAAATTTTCA AAATTCTCCT
GTATATATAA ATGATTCACT GGAGCAATGG GAAAATCACG GTGGACCAAG ACGTTGTGGG
ATCAGTTCTT TCGGATTAAG CGGGACAAAC TGTCATTTAG TATTGGAAGA ACCACCCCAG
ATAAAATCCA TTCCGGAGGA ATCACCGGGT TGTAATTACC ACATTTTTAC ACTATCCTCA
CAAAACATCA ATGGTTTGAT GGAGCAGCTT AAAAATTATG AAAAATACCT TAACAGAGAG
AGGGAAGTCA GTCTGGGGAG TCTTTGTTCC ACCACAAACA CCGGACGGGG ACATTACAAT
TTCAGACTGG CCTTTGTAAT TACTAGCAAA AATGAACTCA AGGAAAAGAT TCAAAAGCTT
TCTACGGGTG GATTGGAAAA TTTCGAAAAG TACGGTGTTT ACTTCGGTAA ACACAAGATT
GTTGATAAAG AAAAGGAGTC ACGAGGAAAT AACGAGTATT CCCAAGAAGA AATAGCTGAT
TTGAGCCTTC AAACTGAAAC ATATCTGAAT GAGGCTATAA ACAATGAAAG TCAGCAAAAA
AAGGTTTCCC TTGAAAATAT CTGTGCTCTT TATGTAAATG GAGCGAATGT GCAGTGGAAT
AGGCTTTATA AAAGCGGATC ATATGCAAAA GTCAGCCTCC CGGGGTACTC CTTTATGAAA
AAAAGGTGCT GGATAGAGCA CGAAAATTCA AACAGCCACA ATTCAAAGCA TCCATTAATA
GACAAGCATA TTCTTGAATC TGTGGATTTG GATGTATTCT CTACTACATT CAAAGTGGAC
AGGCAATGGG TTCTCAATGA ACATCAAATA GGCAGCAATT ATGTTGCTCC CGGTACTACT
TATCTGGAAA TGATCAGGGA GGCGAGCCGA AAGCATTTCG GAAACCATCC TTTGAAAATG
CAGCAGGTAA CATTTATACA TCCACTTATA TTAAAAGCTG ATGAAGAAAA AGAAGTACAT
ACAATAATAA AAAGGGATGA AGATACCCTT GAATATACCA TCGCAAGTAA ACAGAAGGAA
AATGGACAGT GGCTGAAGCA TTCTACAGGT GCAGTAGCGG GAATTCCCAA AGAGGATGCA
AGCTCCCTTA GTTTTGACTT TGAAAAGATA AAAGAAGCCT GCCCTAACGT AATAGATATA
GGGAATATGG GAAGCAGGGA GGGCATACTC AGGTTAGGAC CCAGATGGGA TACCCTAAAG
GCAATGTACA CTGGGGACAG AATGGCCTTG GCGTGTTTTG AACTGGGAGA AGAATATTCG
GAGGATCTGA GCAAGTACCA CATGTATCCT TCACTTATGG ATTGTGCAGT TAATATCGCT
AACATGAGTG TGGGAGAAGG GCTGTATCTG CCTCTGTCCT ATGGAAGCCT GAAGATATAT
GAAACATTAC CCCGAAGTTT TTATAGCTAT TTAGTAAGAA AAGACAATAT GTCCTTAAAT
AATGAAATAG CTATCTTTGA TATAAAGCTT ATGGATGAAT ATGGAAAAGC ACTGGCGGAA
ATAGAAGATT ATTCTGTTAA AAAGGTTCAT AATACGGGGC TTGAAGCGGA AGACCCTGAC
AACCTTTTTT ATAAAACCGG ATGGGAGAAA CAGGAGCAGG AATTTTTGAC AGGTGAGAAT
GCTGATTTTT CAGGTGTTAC GATTATATTC AATGATGAAT ATGGTATGGG TGAGGAATTG
GCACAAAAAC TAAGAGCAGC CGGGACGGAA ACTGTGGAGG TTGAATTCGG CATAGATTTC
CGGCAGATAA ACCATAGCAA ATATATGACA GGCACAGACG AAGGTGACTT TAGAAAGTTA
TTTTCAGAGT TAAATGGCAG AGCTGTTAAC CGGATTATAT ATTTGTCTGC CTTGAATTTT
GATACCGATA TGGATATAGA CGGCTTGGAT GAGGCTATGG AAAGAGGTGT TTACAATCTG
CACCGTTTGA CAAAAGCCAT ATCCGAGAGT ATGCAAAATG ACCTTGATTT GGTAATTATA
GCTGATAATG CTTCGGAAGT AGTTGAATAT CAGGAAAAAA TAAATGCACT CAGTGGGTCC
TTATATGCAA TGGGGAGGGT AATCGGTCAG GAATATGACA ACATAAGGTG CAGGTGTATT
GATATAGACA AAGCTGTGGA TACAGACTTA ATCATTTCTG AGTTGGCCCA AAAGAAGTTT
TTATTTAATG TTTCATATCG TAACGGACAA AGGTATATTC AGAACCTGCA AAGACAATCC
CCGTATAGTA ACGGCGGAGA TAAGATTCAA ATCAGGGAGG AGGGTGTGTA TATTGTTGCC
GGAGGTTTAG GGGGAATTGG GCTGGAGATA TGCAGATATC TCGCAGCAGA GAAAAGAGTG
AACCTGGTGA TGATTAACCG CTCCCCTTTG CCTGAACGTA GTTTATGGGA CGATTTACTT
AAAAACACCT ATGAGGGAAT AAATGATACA GACTATAACA TCATTGACAG AATCCAGTCA
ATAAAAGAGT TGAAAGACAT GGGCAGCAAA GTATTGTGTT TCAGTGCTGA TTTAGCAGAT
ATGGCGTCAG TAAGACTGGT TCTTGATGAA GTACGCAAGC ATCATGCAAA CATTAATGGG
GTCATAAATC TGGCAGGTGT GTTTGAGGAA GGAATACTCA TCAAAAAGGA TACTGAAGCA
CTAAAAAGGG TCATTGCCCC AAAAGTAAAC GGGACACTGA TTTTAGACAG ACTGACGGAA
AACGACAACC TGGACTTTTT TGTTATGTTT TCTTCAATAG CTTCATTCAT AGGAGGTCAT
GGACAGGGAG AATATTCAGC AGCAAATGCT TTTATGGATT CCTTTGCAGA ATATAAAAAC
AGAACAGGGA GAAAGGTAAT TTCGATTAAT TGGACCGGAT GGAATCAGGT TGGTATGGCT
TCCCAATACA ATCTGTCTGA TGGAAGAAGA ATATTCAAGC TCATTTCAAA AGAAAAGGCC
ATAAAGGCTT TTGACAGGGC TTTGACTGCC AGTCTGCCTA GGGTGGTTAT AGGAGAGTTG
GACTATGAAT TTATGGCTGA ATACGCCATG GACAATGCAT ATATACAGTT ATCCCAGGAA
ATAAAAAACA GTTTGAAGCC TTATATAAAG ATGCCGCAGC AGGTAAAACA ACCCGGAAAG
AGGAAAATCG TACTTCCTAA AGGCAGGCAA AATCAGGAAT ACACACCCAT GGAGAAGGGC
GTTGCAGCGG TTTGGGGAGA AGTGCTGGGT CTTGAAGAGA TTGATATATA CGAAAATCTG
TATAATCTGG GCGGGGATTC CATAATTGCA TTGAGGATTG CAAATGAGAT TAATAAGAAG
CTCAATGCAA GCATCAGGAT AAGTGACCTG TTTGAATACT TGACAGTACA AAGGCTTGCA
GCATTCCTTG AGGATAAGAC ACCTGAGAAG GAAGTGTCTG TCAAAGACCT CAGGGAATCC
GAAAAGGAGC ATACTTACTA TGAACTGTCC AATATCCAAA AAAGAATTTG GTTTTTACAG
AATTATGATC CTCAAATGAC AGTATACAAT CTACCGCTGG TATCATCTAT AAATGCGGAG
TTGAACGTTC AGGTACTGAA AGAGGCCGTT AACCTGCTTA TTCAGAGGCA TGAGGTCTTA
AGAACTGTAT TCGGAGAAGA AAATGGAGAG CCTTACCAGA AAATTCTGAC CGTTTATGAA
TATGAGCCTG AAGTGGTGGA TTTAACAGGG GAGCCTCATA AGGAGAGCGT GCTCAATGAG
CTTATAGCTC AAGAAAATAA AAAACCCTTC AATCTGTCCA ATCCACCCAT GAGAGTTGTT
GTTTACAAGT TGAGTAGTTC CTCCTACTGC CTCTATCTCA ATATACATCA TATTGTTACC
GATGGATGGA GCATGGGAAT ATTCAGTTAC GAACTAATGA AATTGTATGA GGGAATTGTC
TTGGAAAAGG TTGTGGAACT GGAGCCGCTG AATTTCAGGT ATACGGACTG GGTAAAACAG
CAGCTTGAAT GGCAGGATAG TTCGGAGTTT GTGGAGATGG AAAATTACTG GCTGCAGGAA
TTGTACAAAC CGCTTCCGGT ATTAAATCTT CCCGTGGATT ATAAAAGACC GCAGATGCAA
ACCTACAACG GCAGTTTCAT TAAATTTAGC ATAGACGGGA ATACAACAGC CAAATTAAAA
GAATTTGCAA GGCAACATAA TATAACCCTG AATATGGCCC TATTGTCTGC TTATTTTGTA
CTGCTTAAAA AGATTACTGC AGACAAAGAC ATAGTTGTAG GCTTACCGGT GACAGGAAGG
GAAAACAAAG AGCTGGAGAA TATAATGGGT GTATTTATAA ACACCTTATC AATCCGTATA
AATTTTGAAA ATATTTCTTC ACTCAATGAC CTTATAACCT GTGTACGTGA AAAGTGTCTG
AAGGCATATC GGAATATAAA ATACCCCTTT GACCTGATAA TATCGAAATT AAATCCTGAG
CGTGACTTGA GCAGAAGTCC TATTTTTTCA ACTGTTTTTC AATTATACGA TAAAATTCCG
CCTGAGACTG AGGGCAGCAG TATGTTTGAA CTTAGTATGC TTTGTAGGGA AGAGGATAAC
CGTATAGAAA TCAGAGCCGA ATATAACACC GATCTTTTTG AAAAACAAAC TATTGAAAGA
TTTGCAGTAT ATTATACAAA TATTATTAAT GCCTTCTTGA TAGAAAGTGA TATGAGTATT
GACGGTATAG AAATCTTATC CCTGGAGGAG ACAAATAGAA TCATAACACT CTTTAATGCT
ACCGAAAGGG AATATGACAG AACTGCTAGC ATAGATACTC TTTTTGAACT TCAGGCAAAG
GCTTTACCAC ATTCTCAATG TATAATACAG GGGAATACAA TTTATACATA TGCCCAGATA
AATTCATTGG CAAACTGTAT TGCCAGGACT CTTTTAGAAA AGGGTGTAAT GAAGGGTGAT
ATAGTTGGTA TCATGGTGGA ACGTTCATGT AATATGCTGG TGGGGATACT TGGTATTCTC
AAGGCCGGAG CGGCATATCT GCCTATAGAC CCTGAATACC CGGGGGAACG CATAAATTAT
ATGCTGAATG ACAGCTCGGT AAAGGTTTTA CTGACCAGCG GAAAGTTAAA AGGAACAGTT
GCTTTTTACG GTATTTCAGT TGATATGGAT GATGACGGAC TGTATACAGG TAATTGTGAA
AATTTGTCAA TAAATAACAG GCCGGATTCT CTGGCTTATG TTATATATAC CTCCGGATCT
ACAGGAAAGC CTAAAGGGGT AATGATTGAG CACCAAGCAG TATGTAACTT TATTGAAGGT
ATGGTTGAAA AAATTGAGTT TGGCAGCGGT AAATCAATTC TGGCCCTAAC CTCCATGTCC
TTTGATATAT TTATACTTGA AACAATTCTG CCCCTCTGTA TAGGTATGAA GGTAGTAATA
GCAAGTGAAG AACAGCAAAA GGATCCAAAG CTGTTAAGCG AAATAATTAA ACAAAACAGT
ATTGAAATGC TACAAATGAC ACCGTCACGT CTGCAGTTGC TTTTGAGTGA CAGCAGAGGA
CGGTCTAGTC TGTCAGTGCC CCAGGTGCTG ATGGTAGGAG GAGAAGCCTT TCCACAAGCC
TTGTTGGACG AGGTCAAAAG GTGTACAAAT GCAAGAATAT ACAATATGTA TGGGCCCACC
GAAACCACAA TATGGTCTAC CATACGGGAA CTGACAGACA GAAGTACAAT TGACATAGGA
AAACCCATAG CAAATACACA GGTCTATATT GTCAGTGAAA GCGGTAACCT TCAGCCAATT
GGCATCCCGG GGGAATTATG CATTTCCGGG GACGGATTAT CAAGAGGATA CATAAACAGA
CCAGAACTGA CTTTGGAGAA GTTTTTAGAA AATCCATATA TGCCAGGCAA AAAAATGTAC
AGAACCGGAG ATTTTGTAAA GTGGCTTCCT GACGGTAATA TAGAATATAT ACGCAGAATC
GACCATCAGG TCAAGCTGAG AGGTTACAGG ATTGAGCTAG GCGAAATAGA GGAACTGCTG
CTTAAATATT CAGGAGTGAG GGAAGCAGTA GTAGATGTGA AGGGTGAAGA TAGTGAAAGC
AGAAAACTTG GTGCATATGT GACAGCTGAC AGGGACTTAA CGGAAGTAGA GCTTAAAAAA
TATCTGGAAA ATGAACTTCC GCAATATATG ATACCCACAT ATATTATGGT ACTTGAAGAA
CTTCCTTTGA CTCCAAACGG CAAAACAGAC AGAAAAGCTC TGCCCTGTCC TGTCCTGACT
GGCTTAAATA CCAACGGCTT TGTTGATGCT GTAAATGAAA CTGAAAAGGC ATTGCAGAGG
ATTTACCAGG ATGTATTGAA TATTCAAAGG GCGGGAGTAA ATGATAATTT CTTTGCACTT
GGAGGACATT CTTTAAAGGC TACAATCCTA GTGTCCAGAA TATACAAGGA GTTGGGCACG
GAAATACCTC TTAGTGCGGT TTTTAAAACA CCAACAATAA AAGAACTGGC ATGTTCTATT
AATGGAACTG ATATCAGTGA ATACCAACCC ATTATACCGG TTGAAAAGAG CAGTTCTTAC
AGTTTGTCAC CTGCTCAGAG AAGAATTTAT CTGATGGAAG CTGTGACAGG GGAAAGTACT
GCCTATAATA TACCTGTTTT AATGGATTTG GAAGGGGAGT TTGACAGGGA ACGTTTTGAA
CAGGCCATTA AAAATCTGGT AGCAAGACAT GAGATACTAA GGACCTACTT TGACGCCGTG
GATGGGGTTC CGGTACAGAT AGTACAGGAG TGTATGGAGC CGGAAATTGC ATATATTGAG
ATTAAGGAAG ACGATTCAGA CGGAAACATT CAGAATTTAA TAAAACCGTT CAATTTAAAA
AGCGGGCCGC TTTTTAGAGT AAAGGTGCTG CTGACTGACA ATGGCAGAAA GACCATTATG
TTTGATATTC ATCACATTAT TTGCGATGGT ATATCTTTAG GAATTCTTAC ACGTGAGTTT
ATAGAGCTAT ACAGAGGTAA GGAACTTAAG GAATTAACAG TACAGTACAG GGATTATGTT
TCTTGGAAGA GCAGTCTGTA CGGCAGTGAA AGATATAAAC AGCAGGAAAA CTACTGGATG
GGCATGTTTA GCGGAGATAT CCCTGTTCTT AATATGCCAG CTGATTATAC AAGACCAACT
GTGCAAAGTT ATGAAGGAGA CACAGTGTAC TTTGAATTAA ATAAAGAAAT GTCAAAAAAG
CTTCATGAAG TTGAAACCAG TGAAAAGACT ACTGCCTCCA TGACACTACT GGCTGTTTTT
AATGTTTTGC TTTATAAGTA TACGGGACAG GAGGACATTA TTGTTGGAAT GCCTGTAGCC
GGGCGTGGAA ATGCTGATCT GGAAAATATG CTTGGTGTAT TCATCAATAC TCTGGCTATG
AGAAACAGAC CTGAGGGAAC AAAAACATTT CACAACTTCC TGCAGGAGGT ACGCCATAAT
GCTTTAAATG CTTATGAAAA TCAGGACTAC CAGTTTGAGG AATTAGTAGA TAAATTGAAT
GTCCCAAGAG ATTTAAGCAG AAGTCCTTTA TTTGACGTTA TGTTTATAAT GCAAAACACA
AGCTTTCCTG AGATAGAAGA AGAGGGACTG AGATTTAAGT CCCGGGAGTT TGACAGTAAA
TCCTCAAAAT TTGATTTGAC ATTAGAGTCT GTAGAAAAAG AAAACACCCT GCATTTCAGG
CTGGAATACT GCACAAAAAT ATTCATGAGG GAGACTGTGG AAAGAATGTC TTTGCATTTC
CTGAATATTC TGGAGCAGGT TCTGAATAAC CCTGATATTA CTCTATCAGA TATTGACATA
ATTACAAAAG AAGAAAAGGA AAAAATAGAA GGTGTTTTCA ATGCTACAGA TGTCGATTTC
GGTGATAGAG GAAAACTTAC CGTCCATGAG CTTTTCGAAA GGCAGGCTGA ATTCAGGCCT
GATTCTATAG CTGTAATGTG TGAGGGTACA GGTATAACCT ACAACGAATT GAATGAAAAG
GCGAACAAGC TTGCTCGGTT ATTGCAAAAT GAAGGTATAA AGAGGGAAGA ATCGGTAGGA
ATCATGGTTC ACAAGAGCAT TGAGATGATA ATCGGAATGC TGGGAATACT AAAGGCAGGA
GGGGCATATG TACCTGTTGA CCCTGATTAT CCGGCAGACA GAATACATTA TATGCTGAAG
CACAGCCAGA CAAGATTTTT GATAATCGAC CAAAGCTCCT TTGAAAAAAC AGAAATGATA
AATACTGAAG AAAATAGTCT GGAAGTAGTT ATAAATCTGT CCGAAGGAGC AGGCAAAACA
GCAGGGCTTA TAAAGTATAC GGCTGAGGAT ATAAAAAACC TGTCGCCGTA TAATCTCAAA
AATAAAGCCA ACCCTAAAAA TCTCATGTAT ATAATTTATA CGTCAGGCTC CACAGGGCTT
CCAAAGGGAG TTGGAGTATC TCATGCAAAT GCCGTCAACT ATCTGAATTG GAGCATTGAA
AATATGAGTT TGAGCCATAA AGATGTAATG GCATTAGTGA CCTCTATGAG TTTTGATATT
TCCGTATTTG AAATATTCGG CTCTCTTTTA AGCGGTACGT GTCTTTGCAT AGTACCTGAC
AGCAGGATGA AGGATGGCTC TCTTTTCATG GAATACATAG ATGCCGGTAA GGTCACCATA
TGGCATTCTG TTCCGGCATT GATGATACAA CTGCTTACTG CTGTCAAGAG CAGGAAAACC
CTTGGTAATC AGGAGCTTTT CTCACGTATA AGGTGCATTA TGATAGGCGG TGAAGCGTGG
ACCTACGAGC TGGCAAAAGA TATTCGGGAA TATTTTCATC ATGCAAGGAT AGTAAACATG
TACGGCCCTA CAGAAGCGAC TATTTGGGTA ACAAGCCATG ATGTCCGGGA TAATCCCGGT
AGTTCAACAG TAATACCCAT AGGTAAACCT ATCTCCAACA ACAAAGTTTT AATACTGGAT
TCTTGTAAAA AAATGTGTCC CATAGGCATT CCGGGTGATA TCTATATAAG TGGTTTAAAT
GTAACAAGAG GCTATTACAA GGACGAGGAA AAAACAAGGG AGGTCTTCAC CCTTTATGGC
GAGAAAGGGA GTATTATTTA TCGGACCGGG GATGTGGGTA GATATCTCAG TGACGGCACC
ATAGAGTATC TGGGAAGGAA AGACGGAATG ATTAAGGTAC GGGGATATAG AATTGAAATA
GGCGAAATTG AAAATGTTCT GCTGCAAAAT GAAGAAATTA TACAGGCAGC GGTAGTGGCA
AAGAAGTCAG GAGAAACAAG TAAACTTATC TGCTACTATA CAGCCCCAAG AGAACACACC
TATGAAGAAC TGAGGGGCTG CCTGGAGAAG AAGCTACCTG ACTATATGAT TCCTGCACAA
TTTATATGGT TGGAAAAGAT GATACTGACA CCAAACGGGA AGATAGACCG AAAGTCACTT
GCCGCACTTG ATATTGGCGA ACCTTTCCAA AGCAATGAGA ATTATGCAGT GCCGGAATCT
GAGGTGGAAA GATTTCTGGC AGGTATATGG AGCCAGCTTC TTGACATGAA GAAGGTGGGC
ACCAAAGATA ATTTCTTTTC CTTGGGCGGG AATTCTCTTC TGGTAAACCA GATGCATTCA
ATGATTGATG AAAAATATCC CGGAAAAATC AAGGTTATTG ATATTTTTAA ATATCCAACA
ATATCAAAGC TGGCAGACTT TATGGAAGGC TCAGAACAAA AGAAGTATGA AAATGTTACT
GTTCCAACCT CAGACGATGG TGATGATGAC ATTATCAAAT TACTGGACGG CTTTGAAAGC
GGAGATATAT TAATTGATGA AGTGTTGTCA AAATTAGATG ATATTTAG
 
Protein sequence
MINKSMKLKD IKIDKLPDLK NISFEEVSTQ DVAIIGISGK FASCGNMDEF YVSLRNGDNL 
VGKFPQRRKQ DIDTILRYMG ISEKETVYYE GAFLNEIDKF DCSLFNISPK EAGLMDPNQR
LFLQTAWNAI EDSGYGGDRL RSSRTGVYVG YCGDFNEDYR KYVQLMDPSM AALSVTGNIK
SIVASRISYL LDLKGPSMLV DTACSSTLVA IHLACQALRK GECETAIAGG VKLILLPVDN
NPQNGIGIES SSGVTRSFDE YSDGTGFGEG VAAIVLKPLK KAIEDRDNIY SIIKGSAVNQ
DGTSMGITVP NAQAQSDVII RAWKDAKIDP ETISYIEAHG TGTNLGDPVE IDGIEMAFAQ
YTNKKQFCAV SSVKTNIGHL DSAAGIAGLL KAVLSLKHKE LFPSLHFDRP NQKINFQNSP
VYINDSLEQW ENHGGPRRCG ISSFGLSGTN CHLVLEEPPQ IKSIPEESPG CNYHIFTLSS
QNINGLMEQL KNYEKYLNRE REVSLGSLCS TTNTGRGHYN FRLAFVITSK NELKEKIQKL
STGGLENFEK YGVYFGKHKI VDKEKESRGN NEYSQEEIAD LSLQTETYLN EAINNESQQK
KVSLENICAL YVNGANVQWN RLYKSGSYAK VSLPGYSFMK KRCWIEHENS NSHNSKHPLI
DKHILESVDL DVFSTTFKVD RQWVLNEHQI GSNYVAPGTT YLEMIREASR KHFGNHPLKM
QQVTFIHPLI LKADEEKEVH TIIKRDEDTL EYTIASKQKE NGQWLKHSTG AVAGIPKEDA
SSLSFDFEKI KEACPNVIDI GNMGSREGIL RLGPRWDTLK AMYTGDRMAL ACFELGEEYS
EDLSKYHMYP SLMDCAVNIA NMSVGEGLYL PLSYGSLKIY ETLPRSFYSY LVRKDNMSLN
NEIAIFDIKL MDEYGKALAE IEDYSVKKVH NTGLEAEDPD NLFYKTGWEK QEQEFLTGEN
ADFSGVTIIF NDEYGMGEEL AQKLRAAGTE TVEVEFGIDF RQINHSKYMT GTDEGDFRKL
FSELNGRAVN RIIYLSALNF DTDMDIDGLD EAMERGVYNL HRLTKAISES MQNDLDLVII
ADNASEVVEY QEKINALSGS LYAMGRVIGQ EYDNIRCRCI DIDKAVDTDL IISELAQKKF
LFNVSYRNGQ RYIQNLQRQS PYSNGGDKIQ IREEGVYIVA GGLGGIGLEI CRYLAAEKRV
NLVMINRSPL PERSLWDDLL KNTYEGINDT DYNIIDRIQS IKELKDMGSK VLCFSADLAD
MASVRLVLDE VRKHHANING VINLAGVFEE GILIKKDTEA LKRVIAPKVN GTLILDRLTE
NDNLDFFVMF SSIASFIGGH GQGEYSAANA FMDSFAEYKN RTGRKVISIN WTGWNQVGMA
SQYNLSDGRR IFKLISKEKA IKAFDRALTA SLPRVVIGEL DYEFMAEYAM DNAYIQLSQE
IKNSLKPYIK MPQQVKQPGK RKIVLPKGRQ NQEYTPMEKG VAAVWGEVLG LEEIDIYENL
YNLGGDSIIA LRIANEINKK LNASIRISDL FEYLTVQRLA AFLEDKTPEK EVSVKDLRES
EKEHTYYELS NIQKRIWFLQ NYDPQMTVYN LPLVSSINAE LNVQVLKEAV NLLIQRHEVL
RTVFGEENGE PYQKILTVYE YEPEVVDLTG EPHKESVLNE LIAQENKKPF NLSNPPMRVV
VYKLSSSSYC LYLNIHHIVT DGWSMGIFSY ELMKLYEGIV LEKVVELEPL NFRYTDWVKQ
QLEWQDSSEF VEMENYWLQE LYKPLPVLNL PVDYKRPQMQ TYNGSFIKFS IDGNTTAKLK
EFARQHNITL NMALLSAYFV LLKKITADKD IVVGLPVTGR ENKELENIMG VFINTLSIRI
NFENISSLND LITCVREKCL KAYRNIKYPF DLIISKLNPE RDLSRSPIFS TVFQLYDKIP
PETEGSSMFE LSMLCREEDN RIEIRAEYNT DLFEKQTIER FAVYYTNIIN AFLIESDMSI
DGIEILSLEE TNRIITLFNA TEREYDRTAS IDTLFELQAK ALPHSQCIIQ GNTIYTYAQI
NSLANCIART LLEKGVMKGD IVGIMVERSC NMLVGILGIL KAGAAYLPID PEYPGERINY
MLNDSSVKVL LTSGKLKGTV AFYGISVDMD DDGLYTGNCE NLSINNRPDS LAYVIYTSGS
TGKPKGVMIE HQAVCNFIEG MVEKIEFGSG KSILALTSMS FDIFILETIL PLCIGMKVVI
ASEEQQKDPK LLSEIIKQNS IEMLQMTPSR LQLLLSDSRG RSSLSVPQVL MVGGEAFPQA
LLDEVKRCTN ARIYNMYGPT ETTIWSTIRE LTDRSTIDIG KPIANTQVYI VSESGNLQPI
GIPGELCISG DGLSRGYINR PELTLEKFLE NPYMPGKKMY RTGDFVKWLP DGNIEYIRRI
DHQVKLRGYR IELGEIEELL LKYSGVREAV VDVKGEDSES RKLGAYVTAD RDLTEVELKK
YLENELPQYM IPTYIMVLEE LPLTPNGKTD RKALPCPVLT GLNTNGFVDA VNETEKALQR
IYQDVLNIQR AGVNDNFFAL GGHSLKATIL VSRIYKELGT EIPLSAVFKT PTIKELACSI
NGTDISEYQP IIPVEKSSSY SLSPAQRRIY LMEAVTGEST AYNIPVLMDL EGEFDRERFE
QAIKNLVARH EILRTYFDAV DGVPVQIVQE CMEPEIAYIE IKEDDSDGNI QNLIKPFNLK
SGPLFRVKVL LTDNGRKTIM FDIHHIICDG ISLGILTREF IELYRGKELK ELTVQYRDYV
SWKSSLYGSE RYKQQENYWM GMFSGDIPVL NMPADYTRPT VQSYEGDTVY FELNKEMSKK
LHEVETSEKT TASMTLLAVF NVLLYKYTGQ EDIIVGMPVA GRGNADLENM LGVFINTLAM
RNRPEGTKTF HNFLQEVRHN ALNAYENQDY QFEELVDKLN VPRDLSRSPL FDVMFIMQNT
SFPEIEEEGL RFKSREFDSK SSKFDLTLES VEKENTLHFR LEYCTKIFMR ETVERMSLHF
LNILEQVLNN PDITLSDIDI ITKEEKEKIE GVFNATDVDF GDRGKLTVHE LFERQAEFRP
DSIAVMCEGT GITYNELNEK ANKLARLLQN EGIKREESVG IMVHKSIEMI IGMLGILKAG
GAYVPVDPDY PADRIHYMLK HSQTRFLIID QSSFEKTEMI NTEENSLEVV INLSEGAGKT
AGLIKYTAED IKNLSPYNLK NKANPKNLMY IIYTSGSTGL PKGVGVSHAN AVNYLNWSIE
NMSLSHKDVM ALVTSMSFDI SVFEIFGSLL SGTCLCIVPD SRMKDGSLFM EYIDAGKVTI
WHSVPALMIQ LLTAVKSRKT LGNQELFSRI RCIMIGGEAW TYELAKDIRE YFHHARIVNM
YGPTEATIWV TSHDVRDNPG SSTVIPIGKP ISNNKVLILD SCKKMCPIGI PGDIYISGLN
VTRGYYKDEE KTREVFTLYG EKGSIIYRTG DVGRYLSDGT IEYLGRKDGM IKVRGYRIEI
GEIENVLLQN EEIIQAAVVA KKSGETSKLI CYYTAPREHT YEELRGCLEK KLPDYMIPAQ
FIWLEKMILT PNGKIDRKSL AALDIGEPFQ SNENYAVPES EVERFLAGIW SQLLDMKKVG
TKDNFFSLGG NSLLVNQMHS MIDEKYPGKI KVIDIFKYPT ISKLADFMEG SEQKKYENVT
VPTSDDGDDD IIKLLDGFES GDILIDEVLS KLDDI