Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1024 |
Symbol | engA |
ID | 4811318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1225839 |
End bp | 1227161 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106442 |
Product | GTP-binding protein EngA |
Protein accession | YP_001037449 |
Protein GI | 125973539 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.127427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCGAAGC CTGTTGTTGC GGTGGTGGGA AGGCCTAATG TGGGCAAATC GACTTTTTTT AACTATCTTG TGGGAAAAAG AATATCCATT GTTGAAGATA CACCGGGTGT AACCCGTGAC AGAATATATG CGGAAGTCGA ATGGAGAAAC AAAAAGTTTA CATTGATAGA TACGGGCGGT ATAGAACCCT ATTCTGAAGA CAAGATAATG CAGCAAATGA AAAGACAGGC TGAGATAGCG ATTGAGACGG CTGACATTAT TATATTTATG GTGGATGTAA AAGATGGAGT TACCGCGTCG GACAAAGAAG TGGCTACTTT GCTCAGAAAA ACCAAAAAGC CTGTTATAGT GGCTGTCAAC AAAGTGGACA AAATTGGTGA ACTTCCTGCG GATTTCTACG AATTTTACAA CCTTGGTTTT GGTGAACTAA TGGCCATTTC TTCCATTCAC GGACTGGGAA TGGGAGATTT GCTTGATGAG ATATTCAAAT ATTTCCCGGA AGAAGATGCT GAGGATTATG ATGAAGATGT AATAAAAGTT GCGGTGGTGG GAAAGCCAAA TGTCGGAAAA TCGTCTCTCA TCAACAGGAT TTTAGGAGAA GAAAGAGTAA TTGTGAGCGA TATTCCGGGA ACAACGAGAG ATGCTATTGA CACCTTTGTT GAAAATGAAC ACGGCAAATT TGTTTTTATT GATACTGCCG GTATCAGAAG GCAGAGCAAA ATTAATGAGA AAATTGAAAA ATACAGTATC ATAAGATCCT GGACGGCAAT TGAGAGGGCG GATGTTTGTC TGATATTGAT AGACGCAAAA GAAGGTGTTA CCGAGCAGGA TACAAAAATA GCGGGGTACG CTCACGAACA GGGCAAGGCT TCCATTATAG TGGTGAACAA GTGGGATTTG ATTGAAAAAC AGACCGGAAC CCTTGAAGAA TACAGAAGAA CGGTTCATGA AAAACTGGGT TTTATGCTCT ATGCTCCGGT AATCTTTATA TCGGCTTTGA CAGGACAGCG AGTCGACAGG ATTTACGGAC TCATAAAGCA TGTGGCGGAT CAGGCGGCCA TGAGAATTTC CACCGGAGTG TTAAATGACC TTCTGAATGA AGCTACTGCG ATGGTGCAGC CTCCTTCGGA CAAAGGGAAA AGACTCAAAA TTTATTATAT GACTCAATCA TCTGTCAAAC CGCCTTCATT TGTTCTTTTT ATAAACAATA TGGAGCTTAT GCATTACTCG TATGAGAGGT ATCTGGAAAA CCAGCTTAGA AAGAGTTTTG GATTTGAGGG TACGCCTATA AAATTCATAT TGAGGGAAAA AGAAAAGGAG TGA
|
Protein sequence | MPKPVVAVVG RPNVGKSTFF NYLVGKRISI VEDTPGVTRD RIYAEVEWRN KKFTLIDTGG IEPYSEDKIM QQMKRQAEIA IETADIIIFM VDVKDGVTAS DKEVATLLRK TKKPVIVAVN KVDKIGELPA DFYEFYNLGF GELMAISSIH GLGMGDLLDE IFKYFPEEDA EDYDEDVIKV AVVGKPNVGK SSLINRILGE ERVIVSDIPG TTRDAIDTFV ENEHGKFVFI DTAGIRRQSK INEKIEKYSI IRSWTAIERA DVCLILIDAK EGVTEQDTKI AGYAHEQGKA SIIVVNKWDL IEKQTGTLEE YRRTVHEKLG FMLYAPVIFI SALTGQRVDR IYGLIKHVAD QAAMRISTGV LNDLLNEATA MVQPPSDKGK RLKIYYMTQS SVKPPSFVLF INNMELMHYS YERYLENQLR KSFGFEGTPI KFILREKEKE
|
| |