Gene Mext_3674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3674 
SymbolmdoG 
ID5833130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4064087 
End bp4065757 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content73% 
IMG OID641369467 
Productglucan biosynthesis protein G 
Protein accessionYP_001641123 
Protein GI163853080 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGA GAGCCGACGA GGGAATGACG CATCCGACCG ACCTTTCGAC CCGCAATGCG 
GGTCCGGCGC CGATCGACCG CCGCCGGCTC CTCGGCGGGG TCGCGGCCGG CGCCGCCCTG
GCCCTGCTGC CCGGCCGGGT GGAGGCGCAG GCCGCGCCGC CGGCCCTGCC CGCGGCGGGC
TCGCCCTTCT CGGATGCCAC CGTGCCCGAC CTCGCCCGGG CGCTCGGCAA CCGGCCCTTC
GTGGCCCAGA CCGCGAACGA CGTCCCAGAC GCGCTCAAGA ACCTGCCGCG GGAAGCCTAC
GAGGCGATCC GCATCCGGCC CGAGGCGCTG ATCTGGGGCG GCGAGCCGCA CGGTTTCGCG
GTCGAGCCGC TGCCCCGCGG CTTCTACTTC ACCGACCGGG TCGCCCTGTT CCTCGTCGAG
GACGGCGTGG TCCGCCCCGT CGTCTACGAC CGCACGCATT ACGATGCGGG CACGGAGGCG
GGGGCGGCGG CTCTGCCGGA GGCCGGTCGC GAGCCGGGCT TTTCCGGCCT GCGCATCCGC
GCACGCTTCG GCGAGCGGCA CCAGGATTTC GCGGTGTTCC AGGGCGCCTG CTTCTACCGG
CTGGTAGGAC AGGGCCAGGA ATTCGGCGTC GATGGGCGCG CGCTGATGCT GCGCCCGGCC
GATCCGCGCG GCGAGGAATT CCCCCGCTGG CGCGCCCTGT TCGTGGAGCG CCCGAAGACG
CCCGACGGCC CGCTGGTGAT CCACGCCCTG CTCGATTCCG ATTCGCTTGC CGCCGCCCTG
CGCCTGGAAC TGCGCCCCGG CGAGACCTCG ACCGCGGCGA TCACCGCCAC CCTGGTCACC
CGCAAGGCCG TCGATCACCT CGGGCTCAGC GGCATGCAGG CACCGTTCCT GTTCGGCCCG
CACGACCGGC GCAGCGCGGA CGATGCCCGG GCCGCGGTCT ATGCCGCGGG CGGCCTGCAG
ATCCGCAATG GCAGCGGGGA GGCGATCTGG CGCCCGGTGC GCAACCCGGA GACGCTCCAG
ATCTCGGGCT TCCTCGACAA CCGCCCGCAG GGCTTCGGCC TGACGCAACG GGACCGGTCC
TTCACAACCT TCGAGGATGA CGGCCGCCAC TGGGAGCGCT GCCCCTCGCT CTGGGTCGAG
CCGGGCGAGG CCACCGGCGC CGCCGAGGCA GGCGCCGAGA GCCTGTGGGG GGAGGGCGCC
GTGACGCTCC TCGAAATCCC GAGCGATTCG GAAGTCAACG AGAACGTCAT CGCCTATTGG
CGGCCGAAGG CGGCCCTGCC CGCGGGCCAG GAGGTCCGTC TCGCCTACCG GCAGAACTGG
GGCCGCGAGC CAGCCTCGGC ATCACCGCAG GGCAGCCCGC TCGCGCGGGT CACGAGCACC
CGCAGCGGCC GCGGCACGGC CAATCCCCGG CGCCTGTTCC TCGTCGACTT TACCGGCGAC
GGGCTGTTCA CCGCCGAGGG CGCCCTCGTG CCCGTCGAGA CCGTGCTGAT CGCCGGCCCC
GGCCGGATCG TCGAGGGCGC GACCCGCTGG ATCGCCCATC CGGAGACCCG TACCGTCCGC
GTCGCCTTCG AGCTCGATCC GGGCAGCGAG CGGGCCTGCG AGTTGCGCCT TGCCCTCAAG
ACGGAAGGGC GACAGATCAC GGAAACGTGG CTGTACCGCT GGACGCCGTA A
 
Protein sequence
MTVRADEGMT HPTDLSTRNA GPAPIDRRRL LGGVAAGAAL ALLPGRVEAQ AAPPALPAAG 
SPFSDATVPD LARALGNRPF VAQTANDVPD ALKNLPREAY EAIRIRPEAL IWGGEPHGFA
VEPLPRGFYF TDRVALFLVE DGVVRPVVYD RTHYDAGTEA GAAALPEAGR EPGFSGLRIR
ARFGERHQDF AVFQGACFYR LVGQGQEFGV DGRALMLRPA DPRGEEFPRW RALFVERPKT
PDGPLVIHAL LDSDSLAAAL RLELRPGETS TAAITATLVT RKAVDHLGLS GMQAPFLFGP
HDRRSADDAR AAVYAAGGLQ IRNGSGEAIW RPVRNPETLQ ISGFLDNRPQ GFGLTQRDRS
FTTFEDDGRH WERCPSLWVE PGEATGAAEA GAESLWGEGA VTLLEIPSDS EVNENVIAYW
RPKAALPAGQ EVRLAYRQNW GREPASASPQ GSPLARVTST RSGRGTANPR RLFLVDFTGD
GLFTAEGALV PVETVLIAGP GRIVEGATRW IAHPETRTVR VAFELDPGSE RACELRLALK
TEGRQITETW LYRWTP