Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2477 |
Symbol | |
ID | 3831211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2583112 |
End bp | 2584185 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637830396 |
Product | GTP-dependent nucleic acid-binding protein EngD |
Protein accession | YP_431302 |
Protein GI | 83591293 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0012] Predicted GTPase, probable translation factor |
TIGRFAM ID | [TIGR00092] GTP-binding protein YchF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000165994 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0012744 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGCCCCTGA CCTGTGGTAT TATTGGTTTA CCCCTGGTAG GCAAAACGAC CCTGTTTAAC CTTTTAACCC AGGCTGAGGC GGAAACCTCG GCCTTTGCCG GCCGTACTAA AACCAACATC CGGACGGCGC CCATACCCGA TGCCCGCCTG GATTTCCTGG CGGCCCTTTA CCATCCCCGC AAGGTTACCC CCGCCACCTT GGAGATTATC GATGTCCCGG GGTTGACCCG GGGGGCCGGT GCGGCCTTCC TGGCCGCTGT CCGGGAAGTA GACGCCCTGA TCCATGTAGT CCGGGCCTTT CGGAACGATA GTATAATCCA CGTAGAAGGT AACCTCAACC CGGTGCGGGA CCTGGAGACT ATTAATGCCG AGCTCCTCCT GGCCGATCTG CAACTGGTCG AAACCCGTCT GGAGCGAATT GCCGCCAGCA AGAAAATCAA GCCGGAAATG CAGGCCGAAC GGGAGGCCCT GGAGCATTGC CGCCAAGCCC TGGAAGCCGA AAAGCCCCTG CTGGAAGCCG GCCTAACGGA AGAGGAATGG CAGACCCTGC GCCATATGGG CTTTTTGACA ACTAAGCCCA TGATCATAGT GGTCAATATC GATGAAGACC AGCTCCGCTC CGGGCATTAT GCCGGTGAAG AAGAGGTCAA GGCCTATGCC CAACCGAAGG GTTTACCGAT ATTGACTCTC TGCGCCGAAC TGGAAGCGGA GATTGCCCGC CTGGAACCGG GCGACAGGGA AGACTTCCTG CGGGAAATGG GCATTACCGA ACCGGGCATC GACCGTCTGG CCCGGGCCAT TTACCACCGC CTGGGATTAA TCTCTTTCTT AACCGCTGGC GAAGACGAAG TCCGGGCCTG GACCATCCAG GCCGGCACCA ACGCCCGGGC GGCGGCCGGT AAAATCCATA GCGATATCGA GCGAGGCTTT ATCCGCGCCG AGGTGGTTAA CTTTGCCGAC CTGGAGCGGT GCGGCAATAT GAATAAAGTC AAGGAACAAG GTCTGGCGCG CCTGGAAGGC AAGGATTATA TTGTGCAGGA CGGCGATATC ATCAACTTCC GCTTTAATGT TTAG
|
Protein sequence | MPLTCGIIGL PLVGKTTLFN LLTQAEAETS AFAGRTKTNI RTAPIPDARL DFLAALYHPR KVTPATLEII DVPGLTRGAG AAFLAAVREV DALIHVVRAF RNDSIIHVEG NLNPVRDLET INAELLLADL QLVETRLERI AASKKIKPEM QAEREALEHC RQALEAEKPL LEAGLTEEEW QTLRHMGFLT TKPMIIVVNI DEDQLRSGHY AGEEEVKAYA QPKGLPILTL CAELEAEIAR LEPGDREDFL REMGITEPGI DRLARAIYHR LGLISFLTAG EDEVRAWTIQ AGTNARAAAG KIHSDIERGF IRAEVVNFAD LERCGNMNKV KEQGLARLEG KDYIVQDGDI INFRFNV
|
| |