Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2844 |
Symbol | |
ID | 4809124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3360466 |
End bp | 3361671 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108264 |
Product | hypothetical protein |
Protein accession | YP_001039236 |
Protein GI | 125975326 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000924463 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTGCCG CAGTAGTTAT TACCGGGTTG GGGATAGCGG CGGCATTGAC AGGCGGTATA TTGGGAGTCA TACTGGCAGG AGCATTCTGG GGAGCATTGG CCGGAGGATT GATAGGGGGA GCGGTTGGAG GAATAGCCGC TGCGATAAAT GGAGGATCGT TTCTGGAAGG ATTTGCGGAC GGCGCTTTAA GCGGAGCAAT TTCCGGAGCG GTGACAGGAG CGGCATGTGC CGGGCTTGGT GCTTTAGGAG CTCTAGCAGG GAAAAGCATC CAATGTATGA GCACAGTGGG AAAAGCGATA AATGTTACGT CAAAGGTTAC GGCAGCACTT TCTTTTGGTA TGGATGGATT TGACATGCTG GCAATGGGAA TATCATTGTT TGATCCATCC AATGCATTGG TTGAATTCAA CCGGAAGCTG CATTCCAATG CACTTTATAA CGGATTCCAG ATTGCTGTAA ACGCGCTGGC TGTTTTCAGT GCCGGGGCGG CATCGACAAT GAAGTGCTTT GTTGCAGGTA CAATGATATT GACTGTGGCA GGCTTGGTTG CGATAGAGAA TATCAAGGCA GGGGACAAGG TAATTGCGAC GAATCCGGAG ACTTTTGAAG TAGCCGAGAA GACGGTGCTT GAGACATATG TGAGAGAAAC AACGGAGCTT TTGCATTTGA CAATCAATGG AGAGGTAATC AAGACAACCT TTGAGCATCC GTTTTATGTT AAAGATGTGG GTTTTGTTGA AGCGGGAAAA CTGCAAGTAG GAGATAAGTT GGTTGATTCA AGAGGCAATC TTTTGGTGGT GGAAGAGAAA AAGCTTGAAA TAACAGATAA GCCTGTAAAG GTTTACAATT TTAAGGTCGA TAATTTTCAT ACGTATCATG TTGGCGAAAA TAGGGTATTG GTTCATAATG CGAATAAGTA TGTTAAGGGA ACGAGTAGTA CTCTAAAAAG TTTGGGAAAC AAGACTGAAC AATATGTTAC AAAACGAGGC TGGACATGGG ATTCTATGGA CGATGTTGTT AAAAAAACAT ATACTACTCG TGAAGCTATT AACAAAGCAA CTGGTAATCC AGCAACTGCT TACTACAATA AAGCTGGCGA TTATGTAGTT GTGGATAATG TTACCGGTGA ATTAGTACAA GTTAGTAAAT TTGGTGATAC TGGATGGATT CCTGACGCGA CAATTAAAAA TCCATACAAA CCATGA
|
Protein sequence | MAAAVVITGL GIAAALTGGI LGVILAGAFW GALAGGLIGG AVGGIAAAIN GGSFLEGFAD GALSGAISGA VTGAACAGLG ALGALAGKSI QCMSTVGKAI NVTSKVTAAL SFGMDGFDML AMGISLFDPS NALVEFNRKL HSNALYNGFQ IAVNALAVFS AGAASTMKCF VAGTMILTVA GLVAIENIKA GDKVIATNPE TFEVAEKTVL ETYVRETTEL LHLTINGEVI KTTFEHPFYV KDVGFVEAGK LQVGDKLVDS RGNLLVVEEK KLEITDKPVK VYNFKVDNFH TYHVGENRVL VHNANKYVKG TSSTLKSLGN KTEQYVTKRG WTWDSMDDVV KKTYTTREAI NKATGNPATA YYNKAGDYVV VDNVTGELVQ VSKFGDTGWI PDATIKNPYK P
|
| |