Gene Mext_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4043 
SymbolengA 
ID5834513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4499007 
End bp4500347 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content71% 
IMG OID641369834 
ProductGTP-binding protein EngA 
Protein accessionYP_001641484 
Protein GI163853441 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03594] ribosome-associated GTPase EngA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0947633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.421854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGC CGACCGTCGC GATCGTCGGA CGCCCGAATG TCGGCAAGTC GACCCTGTTC 
AACCGGCTGG TCGGGCGCAA GCTCGCCCTG GTGGATGACC GCCCCGGCGT GACCCGCGAC
CGCCGCGAGG GCGAGGGCTT CATCGGCGAC GTCGCCTTCC GCGTCATCGA CACCGCGGGC
CTCGAAGAGG CGGACGCCGA CTCGCTGCTC GGCCGCATGC GCGCCCAGAC CGAGGCCGCC
ATTCTCGAAG CCGACGCGGT GCTGTTCGTC ATCGACGCCC GCGCCGGCGT CCTGCCGTCC
GACCGGCCCT TTGCCGAGCT GGTGCGCCGC TCCGGCTGCC CCGTCATCCT CATCGCCAAC
AAGGCCGAGG GCGGCGCCGG CATGGCCGGC GCCTACGACG CGTTCTCGCT GGGGCTCGGC
GATCCGATCC CGTTCTCGGC CGAGCACGGC GAGGGCCTGG GCTCGCTGCA GGATGCCCTG
CGCGAGGTTC TGCCCGAACC CGACGAGGAG GACGAGGACG GGGAGGGCGG CAAGGGCCTG
CGCGTCGCCA TCGTCGGGCG CCCGAACGCC GGCAAGTCCA CCCTGATCAA CCGGATGATT
GGCGAGGATC GCCTGCTGGT CGGCCCCGAG GCCGGCATTA CCCGCGATTC GATCTCCCTC
GATTGGGAGT GGCGCGGGCG CCGGATCAAG CTGCACGACA CCGCCGGCAT GCGCCGCCGG
GCGCGCATCG ACGACAAGCT CGAAAAGCTC GCGGTCTCGG ACGGCTTGCG CGCCGTGCGC
TTCGCCGAGG TCGTGGTCGT GCTCCTCGAT GCGACGATCC CGTTCGAGAA GCAGGATCTC
ACCATCGTCG ATCTCGTCGA GAGCGAGGGC CGCGCGGTGG TGATCGGCCT CAACAAGTGG
GATCTCGTGG CCGACCAGCC GGGCCTGCTC AAGACCCTCC GGGAAGACTG CACCCGCCTG
CTGCCGCAGG TGCGCGGCGT CTCGGTGGTG TCGCTCTCGG GGCTCGCCGG CGACGGCATC
GACAAGCTGA TGCAGGCCGT GGTCGATGCC TCCGAGGTGT GGAGCCGGCG CGTCTCGACG
GCGCGGATCA ATGCGTGGCT CACCGACGCG CTCCAGCGCA ACCCGCCGCC CGCGGTCTCC
GGCCGGCGCA TCAAGATCCG CTACGCGACC CAGGTGAAGA GCCGCCCGCC GCACTTCGCC
CTGTTCGGCA ACCAGCTCGA CGCCCTGCCG AAATCCTACA CCCGCTACCT CGTCAACGGC
CTGCGCGAGG CCTTCGATCT GCCCGGCACG CCGATCCGGC TGTCCCTGCG CACCACGAAG
AACCCGTTCG AGAAGGGCTA A
 
Protein sequence
MDLPTVAIVG RPNVGKSTLF NRLVGRKLAL VDDRPGVTRD RREGEGFIGD VAFRVIDTAG 
LEEADADSLL GRMRAQTEAA ILEADAVLFV IDARAGVLPS DRPFAELVRR SGCPVILIAN
KAEGGAGMAG AYDAFSLGLG DPIPFSAEHG EGLGSLQDAL REVLPEPDEE DEDGEGGKGL
RVAIVGRPNA GKSTLINRMI GEDRLLVGPE AGITRDSISL DWEWRGRRIK LHDTAGMRRR
ARIDDKLEKL AVSDGLRAVR FAEVVVVLLD ATIPFEKQDL TIVDLVESEG RAVVIGLNKW
DLVADQPGLL KTLREDCTRL LPQVRGVSVV SLSGLAGDGI DKLMQAVVDA SEVWSRRVST
ARINAWLTDA LQRNPPPAVS GRRIKIRYAT QVKSRPPHFA LFGNQLDALP KSYTRYLVNG
LREAFDLPGT PIRLSLRTTK NPFEKG