Gene Cthe_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2844 
Symbol 
ID4809124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3360466 
End bp3361671 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content44% 
IMG OID640108264 
Producthypothetical protein 
Protein accessionYP_001039236 
Protein GI125975326 
COG category 
COG ID 
TIGRFAM ID[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000924463 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGCCG CAGTAGTTAT TACCGGGTTG GGGATAGCGG CGGCATTGAC AGGCGGTATA 
TTGGGAGTCA TACTGGCAGG AGCATTCTGG GGAGCATTGG CCGGAGGATT GATAGGGGGA
GCGGTTGGAG GAATAGCCGC TGCGATAAAT GGAGGATCGT TTCTGGAAGG ATTTGCGGAC
GGCGCTTTAA GCGGAGCAAT TTCCGGAGCG GTGACAGGAG CGGCATGTGC CGGGCTTGGT
GCTTTAGGAG CTCTAGCAGG GAAAAGCATC CAATGTATGA GCACAGTGGG AAAAGCGATA
AATGTTACGT CAAAGGTTAC GGCAGCACTT TCTTTTGGTA TGGATGGATT TGACATGCTG
GCAATGGGAA TATCATTGTT TGATCCATCC AATGCATTGG TTGAATTCAA CCGGAAGCTG
CATTCCAATG CACTTTATAA CGGATTCCAG ATTGCTGTAA ACGCGCTGGC TGTTTTCAGT
GCCGGGGCGG CATCGACAAT GAAGTGCTTT GTTGCAGGTA CAATGATATT GACTGTGGCA
GGCTTGGTTG CGATAGAGAA TATCAAGGCA GGGGACAAGG TAATTGCGAC GAATCCGGAG
ACTTTTGAAG TAGCCGAGAA GACGGTGCTT GAGACATATG TGAGAGAAAC AACGGAGCTT
TTGCATTTGA CAATCAATGG AGAGGTAATC AAGACAACCT TTGAGCATCC GTTTTATGTT
AAAGATGTGG GTTTTGTTGA AGCGGGAAAA CTGCAAGTAG GAGATAAGTT GGTTGATTCA
AGAGGCAATC TTTTGGTGGT GGAAGAGAAA AAGCTTGAAA TAACAGATAA GCCTGTAAAG
GTTTACAATT TTAAGGTCGA TAATTTTCAT ACGTATCATG TTGGCGAAAA TAGGGTATTG
GTTCATAATG CGAATAAGTA TGTTAAGGGA ACGAGTAGTA CTCTAAAAAG TTTGGGAAAC
AAGACTGAAC AATATGTTAC AAAACGAGGC TGGACATGGG ATTCTATGGA CGATGTTGTT
AAAAAAACAT ATACTACTCG TGAAGCTATT AACAAAGCAA CTGGTAATCC AGCAACTGCT
TACTACAATA AAGCTGGCGA TTATGTAGTT GTGGATAATG TTACCGGTGA ATTAGTACAA
GTTAGTAAAT TTGGTGATAC TGGATGGATT CCTGACGCGA CAATTAAAAA TCCATACAAA
CCATGA
 
Protein sequence
MAAAVVITGL GIAAALTGGI LGVILAGAFW GALAGGLIGG AVGGIAAAIN GGSFLEGFAD 
GALSGAISGA VTGAACAGLG ALGALAGKSI QCMSTVGKAI NVTSKVTAAL SFGMDGFDML
AMGISLFDPS NALVEFNRKL HSNALYNGFQ IAVNALAVFS AGAASTMKCF VAGTMILTVA
GLVAIENIKA GDKVIATNPE TFEVAEKTVL ETYVRETTEL LHLTINGEVI KTTFEHPFYV
KDVGFVEAGK LQVGDKLVDS RGNLLVVEEK KLEITDKPVK VYNFKVDNFH TYHVGENRVL
VHNANKYVKG TSSTLKSLGN KTEQYVTKRG WTWDSMDDVV KKTYTTREAI NKATGNPATA
YYNKAGDYVV VDNVTGELVQ VSKFGDTGWI PDATIKNPYK P