Gene Moth_2477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2477 
Symbol 
ID3831211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2583112 
End bp2584185 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content58% 
IMG OID637830396 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_431302 
Protein GI83591293 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000165994 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0012744 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGCCCCTGA CCTGTGGTAT TATTGGTTTA CCCCTGGTAG GCAAAACGAC CCTGTTTAAC 
CTTTTAACCC AGGCTGAGGC GGAAACCTCG GCCTTTGCCG GCCGTACTAA AACCAACATC
CGGACGGCGC CCATACCCGA TGCCCGCCTG GATTTCCTGG CGGCCCTTTA CCATCCCCGC
AAGGTTACCC CCGCCACCTT GGAGATTATC GATGTCCCGG GGTTGACCCG GGGGGCCGGT
GCGGCCTTCC TGGCCGCTGT CCGGGAAGTA GACGCCCTGA TCCATGTAGT CCGGGCCTTT
CGGAACGATA GTATAATCCA CGTAGAAGGT AACCTCAACC CGGTGCGGGA CCTGGAGACT
ATTAATGCCG AGCTCCTCCT GGCCGATCTG CAACTGGTCG AAACCCGTCT GGAGCGAATT
GCCGCCAGCA AGAAAATCAA GCCGGAAATG CAGGCCGAAC GGGAGGCCCT GGAGCATTGC
CGCCAAGCCC TGGAAGCCGA AAAGCCCCTG CTGGAAGCCG GCCTAACGGA AGAGGAATGG
CAGACCCTGC GCCATATGGG CTTTTTGACA ACTAAGCCCA TGATCATAGT GGTCAATATC
GATGAAGACC AGCTCCGCTC CGGGCATTAT GCCGGTGAAG AAGAGGTCAA GGCCTATGCC
CAACCGAAGG GTTTACCGAT ATTGACTCTC TGCGCCGAAC TGGAAGCGGA GATTGCCCGC
CTGGAACCGG GCGACAGGGA AGACTTCCTG CGGGAAATGG GCATTACCGA ACCGGGCATC
GACCGTCTGG CCCGGGCCAT TTACCACCGC CTGGGATTAA TCTCTTTCTT AACCGCTGGC
GAAGACGAAG TCCGGGCCTG GACCATCCAG GCCGGCACCA ACGCCCGGGC GGCGGCCGGT
AAAATCCATA GCGATATCGA GCGAGGCTTT ATCCGCGCCG AGGTGGTTAA CTTTGCCGAC
CTGGAGCGGT GCGGCAATAT GAATAAAGTC AAGGAACAAG GTCTGGCGCG CCTGGAAGGC
AAGGATTATA TTGTGCAGGA CGGCGATATC ATCAACTTCC GCTTTAATGT TTAG
 
Protein sequence
MPLTCGIIGL PLVGKTTLFN LLTQAEAETS AFAGRTKTNI RTAPIPDARL DFLAALYHPR 
KVTPATLEII DVPGLTRGAG AAFLAAVREV DALIHVVRAF RNDSIIHVEG NLNPVRDLET
INAELLLADL QLVETRLERI AASKKIKPEM QAEREALEHC RQALEAEKPL LEAGLTEEEW
QTLRHMGFLT TKPMIIVVNI DEDQLRSGHY AGEEEVKAYA QPKGLPILTL CAELEAEIAR
LEPGDREDFL REMGITEPGI DRLARAIYHR LGLISFLTAG EDEVRAWTIQ AGTNARAAAG
KIHSDIERGF IRAEVVNFAD LERCGNMNKV KEQGLARLEG KDYIVQDGDI INFRFNV