Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1301 |
Symbol | |
ID | 4809553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1578033 |
End bp | 1579280 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106724 |
Product | hypothetical protein |
Protein accession | YP_001037726 |
Protein GI | 125973816 |
COG category | [R] General function prediction only |
COG ID | [COG1323] Predicted nucleotidyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTTC TGGGGTTGAT TGTGGAATAT AATCCTTTTC ACAACGGTCA TCTTTACCAT CTTGAGGAGT CAAAAAAAAT AAGCGGGGCT GACTTTGTCG TATGCGTCAT GAGCGGCAAT TTCATTCAGC GCGGAGAGCC GGCAATTGTA AACAAGTGGG CAAGAACAAA AATGGCCTTG TCAGCAGGAG CAGACCTTGT AATTGAGCTT CCCCTTTCCT GTGCCATGGC CAGTGCCGAA TACTTTGCCT CCGGCGCCGT AAGAATTTTA AATGATATAG GAATAGTTGA CTATATTTGT TTTGGAAGTG AACACGGCGA TGTCAAGACT CTCGATTATA TAGCCCAAAT TCTTGTTGAA GAGCCTGAAA GTTACAAATC TTTCCTGAAA GAAGAACTGG ACAATGGCCT GTCATATCCT GCCGCCCGCG AATCAGCCCT GAAGAAATAC ACCGCACATA GCATTAATAT CCCGCAAATA ATCTCCTCAT CAAACAACAT ACTGGGTATA GAATATTTAA AGGCGTTAAG ACGCATAAAA AGCAGCATAA TACCTCTTAC AATAAAGCGC ATTAACAATG ATTACAACAC GGAAAATATC ACCGGAAGCA TTTCCAGCGC ATCATCCATA AGAAAATATA TTTCAACCTC AAATTCAACC TCTTTTGATG ACGTTCTTGC CATGACAATG CCCAAAACAA GCGTCGATAT ACTTTTTGAA GAATTCAGTG CCGGAAGGGG GCCGGTTTTT AAAGAGGATT TTTATCCTGT TGTAACTTCC CTCATACGAA AAATGACGCC GGAACAAATC AGAAATTTTG CTTATGTTTC GGAAGGCCTT GAAAACAGGA TAAAAAGTGC CGCCGATACC GCAGGTACAT ATGAAGAGCT GGTGGAAAGC ATATGCACCC GAAGATACAC CAAAACCAGA GTGCAAAGAA TCCTGATGGG CATACTTATG GGAGTAACCT CGAAGGATTT GGACATGCTA AGCCGTTTTG ACAGTCCTCA ATATGCAAGG ATTCTAGGCT TTAATTCAAA AGGAAAACAG CTTCTTTCCC AAATAAAGAA AAAATCATCA ATACCTCTGG TGTTAAAGTT GTCTGATTTC ATAAAATCCT GTGATCCGGT GCTGAAAAGA AAGCTTGAAT TGGAGATACT TGCCACCGAC CTTTATGTGA TGTGCTATAA AAATCCTGCC TTTAGAAAAG CCGGCCAGGA GTTTACTCAA AATATCATCA TTATGTAA
|
Protein sequence | MKVLGLIVEY NPFHNGHLYH LEESKKISGA DFVVCVMSGN FIQRGEPAIV NKWARTKMAL SAGADLVIEL PLSCAMASAE YFASGAVRIL NDIGIVDYIC FGSEHGDVKT LDYIAQILVE EPESYKSFLK EELDNGLSYP AARESALKKY TAHSINIPQI ISSSNNILGI EYLKALRRIK SSIIPLTIKR INNDYNTENI TGSISSASSI RKYISTSNST SFDDVLAMTM PKTSVDILFE EFSAGRGPVF KEDFYPVVTS LIRKMTPEQI RNFAYVSEGL ENRIKSAADT AGTYEELVES ICTRRYTKTR VQRILMGILM GVTSKDLDML SRFDSPQYAR ILGFNSKGKQ LLSQIKKKSS IPLVLKLSDF IKSCDPVLKR KLELEILATD LYVMCYKNPA FRKAGQEFTQ NIIIM
|
| |