Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0250 |
Symbol | |
ID | 4808598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 305803 |
End bp | 306993 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105662 |
Product | hypothetical protein |
Protein accession | YP_001036682 |
Protein GI | 125972772 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00470202 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGAAA TTAAAGGAAA ATATAATACG GCAAAAGTTT TTACCAACAA CGTTGAACCG GAGGCAATGG CTCAGATATT GGAACTTTGC AACCAGGAGT TTGTAAAGGA CAGTGTAATC AGAATTATGC CTGATACCCA TGCCGGTGCA GGTTGTACCA TTGGAACCAC CATGACCATT GACGACAAAA TTGTGCCAAA TCTGGTGGGT GTTGATATAG GCTGCGGAAT GGAGACCATA AAGCTGAAAA ATAAAAATAT TGAATTGAGC AAGCTGGACA AAGTAATTCA TGAATTTATC CCGTCGGGTT TTGACATTCG CAAAAAAGAG CATCCGTATG TCAGGAATAT TAATTTGGAT GAGCTATTGT GCAAAAAACA CGTGGATATA AACCGGGCAA AGCTGAGTAT TGGCACCTTG GGCGGTGGCA ACCATTTTAT TGAGGTGAAC AAAGACAGTG AGGGAAACCT GTATCTTGTG GTGCATTCGG GAAGCAGACA TCTGGGAAAG CAGGTTGCGG AGTATTATCA GGAGTTGGGT TACAAGGAGC TTGTAAAAAA CACTGAAGTG ATTAAAGAAA TTATAGCAAA GCTGAAAGCG GAAGGCCGTG AAAAGGAAAT CGAGAAGGAA ATAAAAAAAA TCAAGCCGCC GAATATAAGC AAGCAGCTTG CCTATGTCCA GGGCAAAAGT TATGAGGACT ATCTTGCCGA CATGAAATTG GTGCAGAAAT TTGCGGTATT GAACCGCAAA GCAATTGTGG ATGAGATTGT GAGGCGCATG AATTTTAAAA TTGAAGAGCA GTTTACCACT ATTCATAATT ATATTGACTT AGACAGCATG ATACTCAGGA AAGGTGCCAT TTCAGCACGC AAAGGGGAAA GAGTGCTGAT TCCGATTAAC ATGAGGGACG GAAGCCTTAT CTGTATCGGC AAAGGCAACA AGGATTGGAA CTATTCCGCG CCCCATGGGG CGGGAAGACT GATGAGCAGG GCTAAGGCAA AAGAAGTTAT AACGCTTAAG GAATTCAAAG AATCAATGAA GGGAATTTAT TCAACAACGG TCAACAAATC CACCATTGAT GAATGTCCCA TGGCTTACAA GCCTATGGAG GAAATCATTG AAAATATTCA GGACACCGTT GAAATTGTAG ATATTATTAA GCCAATATAC AACTTTAAAG CGGCTGAATA G
|
Protein sequence | MIEIKGKYNT AKVFTNNVEP EAMAQILELC NQEFVKDSVI RIMPDTHAGA GCTIGTTMTI DDKIVPNLVG VDIGCGMETI KLKNKNIELS KLDKVIHEFI PSGFDIRKKE HPYVRNINLD ELLCKKHVDI NRAKLSIGTL GGGNHFIEVN KDSEGNLYLV VHSGSRHLGK QVAEYYQELG YKELVKNTEV IKEIIAKLKA EGREKEIEKE IKKIKPPNIS KQLAYVQGKS YEDYLADMKL VQKFAVLNRK AIVDEIVRRM NFKIEEQFTT IHNYIDLDSM ILRKGAISAR KGERVLIPIN MRDGSLICIG KGNKDWNYSA PHGAGRLMSR AKAKEVITLK EFKESMKGIY STTVNKSTID ECPMAYKPME EIIENIQDTV EIVDIIKPIY NFKAAE
|
| |