Gene Cthe_0877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0877 
Symbol 
ID4810495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1051833 
End bp1052927 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content45% 
IMG OID640106293 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_001037304 
Protein GI125973394 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00007173 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGG GAATAGTCGG ACTTCCTAAT GTTGGAAAAA GCACTCTTTT TAATGCGATA 
ACAAAAGCCG GTGCAGAATC GGCAAACTAT CCTTTTTGTA CGATTGAGCC GAATGTGGGA
ATTGTGGCCG TCCCGGATGA AAGGCTCAAC AAGCTTGCCG AGATGTACAA ACCGGAAAAG
GTCACACCTA CCACAATAGA ATTTGTGGAT ATAGCCGGGC TTGTAAAAGG AGCCAGCAAA
GGCGAAGGTC TTGGCAACAA GTTTTTGTCC CATATAAGAG AGGTTGATGC AATAGTCCAT
GTCGTCCGTT GCTTTGAAGA CAGCAATATC GTGCATGTCG AAGGCTCGGT AGACCCTGTT
CGCGATGTGG AAACCATTGA AATGGAATTA ATTCTGGCGG ACATGGAAGT TTTGGAAAGA
AGGATAGACA GAACACGCAA AATGTTAAAG TCCGGAGATA AAAAGTATCA AGTGGAGCTT
GACATTTACG AGCGTATCAT GAAAACCTTT GAAGAAGGCA AACCGGTTCG CTCAATGAGC
TTTAGCGAGG AAGAGAAAAA AATTGTGGAC CAGCTGTTTC TTCTGACATC AAAGCCGGTA
TTGTACGCAG CAAACGTTTC CGAAGATGAC ATAAATTCCG ACAAACCAAA TCCGTTGGTA
GAAAAGCTTG TTAATTATGC AAAAAACGAA GGTTCGGAAG TAATGGTTAT ATGTGCAAAA
ATCGAAGAAG AAATTGCTCA GCTCGATGAC GAGGAAAAAG CGGAATTCTT AAAAGAACTG
GGACTGTCGG AATCTGGACT TGACCGGTTG ATAAAAGCAA GTTACAGGCT TTTGGGCCTT
ATCAGCTTCC TTACCGCCGG ACCGCAGGAA GTCAGGGCAT GGACTATAGT CAAGGGCACA
AAGGCGCCCC AGGCGGCCGG AAAAATTCAC AGTGACTTTG AAAAAGGCTT TATCCGTGCT
GAAGTCGTCG CCTATGATGA CCTTATAAAG GCCGGTTCAT ATACCATTGC GAAGGAAAAA
GGCCTGGTGC GTTCCGAAGG AAAGGACTAC GTGATGCAGG ACGGCGACGT TACTCTCTTT
AGATTTAATG TATAA
 
Protein sequence
MKMGIVGLPN VGKSTLFNAI TKAGAESANY PFCTIEPNVG IVAVPDERLN KLAEMYKPEK 
VTPTTIEFVD IAGLVKGASK GEGLGNKFLS HIREVDAIVH VVRCFEDSNI VHVEGSVDPV
RDVETIEMEL ILADMEVLER RIDRTRKMLK SGDKKYQVEL DIYERIMKTF EEGKPVRSMS
FSEEEKKIVD QLFLLTSKPV LYAANVSEDD INSDKPNPLV EKLVNYAKNE GSEVMVICAK
IEEEIAQLDD EEKAEFLKEL GLSESGLDRL IKASYRLLGL ISFLTAGPQE VRAWTIVKGT
KAPQAAGKIH SDFEKGFIRA EVVAYDDLIK AGSYTIAKEK GLVRSEGKDY VMQDGDVTLF
RFNV