Gene Cthe_1928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1928 
Symbol 
ID4810786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2300254 
End bp2301255 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content41% 
IMG OID640107344 
Producthypothetical protein 
Protein accessionYP_001038339 
Protein GI125974429 
COG category 
COG ID 
TIGRFAM ID[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000122505 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGGAG CGGCATGTGC CGGGCTTGGT GCTTTGGGAG CTCTAGCAGG GAAAAGCATC 
CAATGTATGA GCACAGTGGG AAAAGCGATA AATGTTACAT CAAAGGTTAT GGCAGCACTT
TCTTTTGGTA TGGATGGATT TGACATGCTG GCAATGGGAG TATCATTGTT TGATCCATCC
AACGCATTGG TTGAATTTAA TCGGAAGCTG CATTCCAGTG CACTTTACAA CGGATTCCAG
ATTGCTGTAA ACGCGCTGGC TGTTTTCAGT GCCGGGGCGG CATCTACAAT GAAGTGCTTT
GTTGCAGGCA CGCTGATATT GACTGTGGCA GGCTTGGTTG CAATAGAGAA TATCAAGGCA
GGAGACAAGG TAATTGCGAC GAATCTGGAG ACTTTTGAAG TAGCCGAGAA GACAGTGCTT
GAGACATATG TGAGAGAGAC AACGGAGCTT TTGCATTTGA CAATCAATGG AGAGGTAATC
AAGACAACCT TTGAGCATCC GTTTTATGTT AAAGATGTGG GTTTTGTTGA AGCTAAAGAA
TTGCAAGTAG GAGATAAGCT GCTAGATTCA AAAGGCAATG TTTTGGTGGT GGAAGAGAAA
AAGCTTGAAA TTACAGATGA ACCTGCCAAG GTTTATAACT TCAAGGTTGA TGATTTTCAT
ACTTATCATG TCGGCAATAA TGGAATATTG GTACATAATG CAAATTATAG TAAGGGAATG
AGTAGTAATA TCCCCGACTA TATAAAAGAT AATCGTGTAC CTTTAGATAA GGAGACAGTA
TTGAACAGTA AGGAGTACCA AAAAACTAAT ATTAAAGTTA AAGGTGCTCA AGTTTACAAA
AAAGGGGATA AATATTATTA CCGTGATACT TTCCATACAG GAGAAGCGGC TCATTTAGAG
GTGTTTGATA AGAGAGGAAA CCATATTGGT GAAGCTAATC CACTAACTGG AGAATTGATA
CCGGGAACAG CAGATCCGAT GAAGAAAATT AAAATAAAGT AG
 
Protein sequence
MTGAACAGLG ALGALAGKSI QCMSTVGKAI NVTSKVMAAL SFGMDGFDML AMGVSLFDPS 
NALVEFNRKL HSSALYNGFQ IAVNALAVFS AGAASTMKCF VAGTLILTVA GLVAIENIKA
GDKVIATNLE TFEVAEKTVL ETYVRETTEL LHLTINGEVI KTTFEHPFYV KDVGFVEAKE
LQVGDKLLDS KGNVLVVEEK KLEITDEPAK VYNFKVDDFH TYHVGNNGIL VHNANYSKGM
SSNIPDYIKD NRVPLDKETV LNSKEYQKTN IKVKGAQVYK KGDKYYYRDT FHTGEAAHLE
VFDKRGNHIG EANPLTGELI PGTADPMKKI KIK