Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0877 |
Symbol | |
ID | 4810495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1051833 |
End bp | 1052927 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640106293 |
Product | GTP-dependent nucleic acid-binding protein EngD |
Protein accession | YP_001037304 |
Protein GI | 125973394 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0012] Predicted GTPase, probable translation factor |
TIGRFAM ID | [TIGR00092] GTP-binding protein YchF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00007173 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGG GAATAGTCGG ACTTCCTAAT GTTGGAAAAA GCACTCTTTT TAATGCGATA ACAAAAGCCG GTGCAGAATC GGCAAACTAT CCTTTTTGTA CGATTGAGCC GAATGTGGGA ATTGTGGCCG TCCCGGATGA AAGGCTCAAC AAGCTTGCCG AGATGTACAA ACCGGAAAAG GTCACACCTA CCACAATAGA ATTTGTGGAT ATAGCCGGGC TTGTAAAAGG AGCCAGCAAA GGCGAAGGTC TTGGCAACAA GTTTTTGTCC CATATAAGAG AGGTTGATGC AATAGTCCAT GTCGTCCGTT GCTTTGAAGA CAGCAATATC GTGCATGTCG AAGGCTCGGT AGACCCTGTT CGCGATGTGG AAACCATTGA AATGGAATTA ATTCTGGCGG ACATGGAAGT TTTGGAAAGA AGGATAGACA GAACACGCAA AATGTTAAAG TCCGGAGATA AAAAGTATCA AGTGGAGCTT GACATTTACG AGCGTATCAT GAAAACCTTT GAAGAAGGCA AACCGGTTCG CTCAATGAGC TTTAGCGAGG AAGAGAAAAA AATTGTGGAC CAGCTGTTTC TTCTGACATC AAAGCCGGTA TTGTACGCAG CAAACGTTTC CGAAGATGAC ATAAATTCCG ACAAACCAAA TCCGTTGGTA GAAAAGCTTG TTAATTATGC AAAAAACGAA GGTTCGGAAG TAATGGTTAT ATGTGCAAAA ATCGAAGAAG AAATTGCTCA GCTCGATGAC GAGGAAAAAG CGGAATTCTT AAAAGAACTG GGACTGTCGG AATCTGGACT TGACCGGTTG ATAAAAGCAA GTTACAGGCT TTTGGGCCTT ATCAGCTTCC TTACCGCCGG ACCGCAGGAA GTCAGGGCAT GGACTATAGT CAAGGGCACA AAGGCGCCCC AGGCGGCCGG AAAAATTCAC AGTGACTTTG AAAAAGGCTT TATCCGTGCT GAAGTCGTCG CCTATGATGA CCTTATAAAG GCCGGTTCAT ATACCATTGC GAAGGAAAAA GGCCTGGTGC GTTCCGAAGG AAAGGACTAC GTGATGCAGG ACGGCGACGT TACTCTCTTT AGATTTAATG TATAA
|
Protein sequence | MKMGIVGLPN VGKSTLFNAI TKAGAESANY PFCTIEPNVG IVAVPDERLN KLAEMYKPEK VTPTTIEFVD IAGLVKGASK GEGLGNKFLS HIREVDAIVH VVRCFEDSNI VHVEGSVDPV RDVETIEMEL ILADMEVLER RIDRTRKMLK SGDKKYQVEL DIYERIMKTF EEGKPVRSMS FSEEEKKIVD QLFLLTSKPV LYAANVSEDD INSDKPNPLV EKLVNYAKNE GSEVMVICAK IEEEIAQLDD EEKAEFLKEL GLSESGLDRL IKASYRLLGL ISFLTAGPQE VRAWTIVKGT KAPQAAGKIH SDFEKGFIRA EVVAYDDLIK AGSYTIAKEK GLVRSEGKDY VMQDGDVTLF RFNV
|
| |