Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2471 |
Symbol | |
ID | 4809851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2942646 |
End bp | 2943926 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640107886 |
Product | hypothetical protein |
Protein accession | YP_001038866 |
Protein GI | 125974956 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000072504 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGA CAAAGAATGA TATTGCATGG GAACGAATTT TTAAGAAATA TAGAATATTA GAGAAAATAA AGAAAAATGG GGCTTTTGAA ATAACGTCAG GGCAAATAAA TGAGTTTAGA GAAGCAAGGT TAATGACAAA ATTTGATCAC CGAAAAAATT TACCGAAGAT TTTTGAAGAA AATAATTTTT CTATTCTTCC TATTACTAGA GGTAGTTATT TAATTGCGCA GTTTAAGGCT TATCATAGGC TTGAGGAAAA AGAAACAGAA ATAATCAAGA TTCCATTTCC TACTTATATT GAAAGTATTG ATTATGAAAA CATAACAAGC GAGGCTGCGG CTTTAAACTG TGCGTATGTT TCAGGTATAT TGGCTGATTT TATTGAGGAT GAAGAAATGG TTCCAACAGT TACAGGTCGA ATGAGTTCTG ATGCGTTTTG TTTTTATATT AATACTTATT CGGGGTCTAA GTTTAAAGTT AATGTTACTA ATGCTCAAAT TGAGATAGAT GGTGGATATG AAGGGCTGGG AACCTTTTCT TTAATTGAAG CGAAAAACTC GTTATCAGAT GATTTTATAA TACGACAAAT ATATTACCCT TATAGGTTAT GGCATGATAA AATTAACAAA AAAGTTAAGC CAATATTTAT GACTTACTCT AACGGTATTT TTACTTTTTA TGAGTATGAG TTTCAAGACC CTGAAGATTA TAATTCTCTT ACTTTAGTAA AACAAAAAAA ATATAGCATA GAGGAAACAG AGATTGGGCT TGATGACATA ATAGAGATCT ACAAAAGGAC AAAAATTATA AATGAACCAG AAGTTCCATT TCCACAAGCA GATTCATTTG AAAGGATAAT TAATCTTTGC GAGCTTTTAA ATGAATCAGA GTTGACTAGA GATGAAATAA CAACAAACTA TGATTTTGAC TCTAGGCAAA CGAATTATTA TACAGATGCA GCTAGATACT TGGGATTAGT ACATAAGCGT AAAGAAGGTA GAGAGGTAAT ATTTTCGTTG ACAGAAGAGG GGGAAAAATT ATTTAAACTG AAATATAAGC CAAGACAATT AAAATTTGTT GAATTAATTT TGTCCCACAA AGTTTTTAGA GAAGTTTTTG AATTGTGTCT GAAAAATGGA AAAATGCCAG ATAAACATGA AGTAGTGAAG ATTATGAGAT ACAGCAATTT GTATAAAATA GAATCCGAGA AAACATTTTA TAGGCGTGCT CAAACTATAA TGAGTTGGAT TAAATGGATA TTAGAATTAA CTAGATTGTA G
|
Protein sequence | MSETKNDIAW ERIFKKYRIL EKIKKNGAFE ITSGQINEFR EARLMTKFDH RKNLPKIFEE NNFSILPITR GSYLIAQFKA YHRLEEKETE IIKIPFPTYI ESIDYENITS EAAALNCAYV SGILADFIED EEMVPTVTGR MSSDAFCFYI NTYSGSKFKV NVTNAQIEID GGYEGLGTFS LIEAKNSLSD DFIIRQIYYP YRLWHDKINK KVKPIFMTYS NGIFTFYEYE FQDPEDYNSL TLVKQKKYSI EETEIGLDDI IEIYKRTKII NEPEVPFPQA DSFERIINLC ELLNESELTR DEITTNYDFD SRQTNYYTDA ARYLGLVHKR KEGREVIFSL TEEGEKLFKL KYKPRQLKFV ELILSHKVFR EVFELCLKNG KMPDKHEVVK IMRYSNLYKI ESEKTFYRRA QTIMSWIKWI LELTRL
|
| |