Gene Mext_3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3951 
Symbol 
ID5834221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4387546 
End bp4389741 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content72% 
IMG OID641369742 
ProductTonB-dependent receptor plug 
Protein accessionYP_001641393 
Protein GI163853350 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.478156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.620506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGCG GACGTTCAGG ACGGATGCTG CTGGCGGCGG GGGCAGTGCT GACGGGCGTG 
CCCGGGACCG TACTCTCGGC GCAGGCGCAG GAAGCGGCCA CGCTGGAGGA GATCAGCGTG
ACCTCGGTGA GCCCGATCCA GGGTCGGCCC GCCGCTCCGG CGGCTCCCTC CCCCGCCGCG
CCGTTCGCCC GGGCGGCCGA GGTGCTGCCG GTGGTGACGA ACACCTTCTC GCCGGTCACG
GTGGTGCCGC AGGAGCGGAT CGCCCGCGAC CAGCCGCGCA CGCTCGGCGA CGCGCTGTTC
GACCGGCCCG GCATCTCCGC CTCGACCTAC GCGCCGGGGG CGGCCTCGCG GCCGATCATC
CGCGGCCTCG ACAACGCCCG GGTGCGCATC CAGGAGAACG GCATCGTCAA CGGCGGCGTG
TCGGATCTCG GCGAGGATCA CGCGGTGCCG GTCAACCCGC TCAACGCCAG CCGGATCGAG
GTGATCCGCG GCCCCGCGAC CCTGCGCTAC GGCTCGGGGG CGATCGGCGG CGTGGTCTCG
GCCGAGAACA ACCAAGTGCC GACCTTCATC CCCGCCCGCG GGATCACGGG GCAGGTCACG
ACCGGCTACA CCACCGTCGA CAACGGCCGG CTCGGAGCGG CGAGCGTCGA TGCGGGCGCG
AACGGCATCG CGGTCCACGC CGACGGGTTC GCGACCGCGA CCGATTCCTA CGCGATCCCC
GGCGGCATCC AGCGCAACTC GGCGACCGAG ACGCAAGGGG GCTCGGTCGG CATCTCGGCG
ATCGGCGACC GCGGCTTCCT CGGCATCTCC TACAGCCACT TCAACGCGCT CTATCAGATC
CCCGGCGGCG AGGCGGCGGA AGCGCGCACC CGGCTCAACC CGAACCAGGA CCGCGTGCTC
GCCCGCGGCG AATACCGCCC GCTGGAAGGC CCGTTCGAGG TGCTGCGGTT CTGGGCCGGC
GGCTCGGTCT ACCGCCACGA GGAGATCGGC TTCGGGCACG CCCATCACGA TCACGGGGAC
GATGACGACC ACGCGCATGA GGGCCCGGCC GGCGAGGGCG TGCAGGCGAT CTTCAAGAAC
CGCGAGGTCG AAGCCCGGTT CGAGGCGCAG CACGTGCCGG TCTTCACCGC GCTCGGCACG
CTCACCGGCG CGGTCGGCGT CCAGACGAGC CGGCGGGTGC TCAACTCGCA GCTCGAGAGC
TTCCTGCCGC CGACCGAGTC GCGGGTGCTG GCCGCCTACC TGTTCGAGGA ACTGGCGGTC
GGCGGGGGGC TCCGGTTCCA GGCCGCCGGC CGGATCGAGG GCGACCGGCT CAACAGCATC
GCGACGCAGT TCCCCGGCAG CTACCTGCCG ACGGACGGGG ACCCGCTCAG CTATGCCCTG
ACGCGCCGCT TCGCTCCCAA GAGCCTCAGC TTCGGCGCCT TGCAGGATCT GCCCTACGGC
TTCGTGGCGA GCCTCAACGG CTCCTATGTC GAGCGTGCGC CCACCGGCGC CGAGCTGTTC
TCGCAGGGGC CGCACCACGC CTCGGCGACC TTCGAGATCG GCGATCCGAC CCTGCGGCTG
GAGCGCGCCC GCACCGCCGA GATCTCCTTG CGCCGTGCGG ACGGGCCGCT GCGGCTCGAC
GCCACCGGCT ACGTCACGCG CTACACCGGC TTCATCTACC GGCGCGACAC CGGGAACCGC
TGCGACGACG ATTTCGGCTC CTGCGGCTCC GGCGACGAGC TGCGCCAGGT CGTCTACTCG
CAGGCCAATG CCAGCTTCTA CGGCGCCGAG ATCGGCGCGC AGCTCGATCT CTTCGCCGTC
GGCGACGGCT GGGCCGGAAT CGAGGCGCAG TACGACTTCG TCCGCGCCCA GTTCGACGAC
GGCTCCTACG TCCCGCGCAT CCCGCCGCAC CGGCTCGGCG GCGGCGCCTT CGTGCGGGCC
AACGGCTGGT TCGCGCGGGT GAACCTGCTC CACGCCTTCG ACCACACCGA GATCGCCCCG
TTCGAGACCA CGACGCCGGG CTGGAACGAT CTTCGCGCCG AACTCGCCTA CACCCAGGCG
CTCGACCCGA CGGTCTACGG CGCCACCGAG GTGACGCTGG GGCTGCAAGG CCGCAACCTG
CTCGATGACG ACATCCGCAA CTCGGCCTCG TTCAAGAAGG ACGAGATCCT GCTGCCGGGC
CGCAACCTCC GCCTGTTCCT GACGGCACGA TTCTGA
 
Protein sequence
MRGGRSGRML LAAGAVLTGV PGTVLSAQAQ EAATLEEISV TSVSPIQGRP AAPAAPSPAA 
PFARAAEVLP VVTNTFSPVT VVPQERIARD QPRTLGDALF DRPGISASTY APGAASRPII
RGLDNARVRI QENGIVNGGV SDLGEDHAVP VNPLNASRIE VIRGPATLRY GSGAIGGVVS
AENNQVPTFI PARGITGQVT TGYTTVDNGR LGAASVDAGA NGIAVHADGF ATATDSYAIP
GGIQRNSATE TQGGSVGISA IGDRGFLGIS YSHFNALYQI PGGEAAEART RLNPNQDRVL
ARGEYRPLEG PFEVLRFWAG GSVYRHEEIG FGHAHHDHGD DDDHAHEGPA GEGVQAIFKN
REVEARFEAQ HVPVFTALGT LTGAVGVQTS RRVLNSQLES FLPPTESRVL AAYLFEELAV
GGGLRFQAAG RIEGDRLNSI ATQFPGSYLP TDGDPLSYAL TRRFAPKSLS FGALQDLPYG
FVASLNGSYV ERAPTGAELF SQGPHHASAT FEIGDPTLRL ERARTAEISL RRADGPLRLD
ATGYVTRYTG FIYRRDTGNR CDDDFGSCGS GDELRQVVYS QANASFYGAE IGAQLDLFAV
GDGWAGIEAQ YDFVRAQFDD GSYVPRIPPH RLGGGAFVRA NGWFARVNLL HAFDHTEIAP
FETTTPGWND LRAELAYTQA LDPTVYGATE VTLGLQGRNL LDDDIRNSAS FKKDEILLPG
RNLRLFLTAR F