Gene Mext_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2102 
Symbol 
ID5833209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2357322 
End bp2359481 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content70% 
IMG OID641367899 
ProductTonB-dependent receptor plug 
Protein accessionYP_001639568 
Protein GI163851525 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00121164 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGAGAA AGCGCTATGG GCTTGGCCCG TGGTTCGGAT CGTTCGCCGG GATCGGCATC 
GCCGGCCTCG TCGTGTCGCC GATCCAGGCG GAAGAGGTCG CCCTCGACGA GATCAGCGTC
GTCTCGGACA AGCCGGCACG GAGCGCGGGT CCGGGTGCGG CGGGGCCGGC GCCGAAGCCG
GCCGACCCGG TCGGCGTCCT GCCCGTGGTG ACCGACCGGT TCTCGACGGT CACCGTCGTC
TCCGGGGCCG AACTCGCCCG CTCGCAGCCG ACCTCCCTCG GCGAGGCTCT GTTCGACAAG
CCGGGCCTGT CCGCGACCAC CTTTGCGCCG GGCTCGGCCT CGCGGCCGAT CATCCGCGGC
CTCGACAACC ACCGTGTCCG CATCCTCGAA AACGGCACCG GTGTGCAGGA CATGTCGGAT
CTGGGCGAGG ACCACGCCGT CCCGATCAAC CCGCTGATCA ACGACCGCAT CGAGGTGATC
CGCGGGCCCG CGGGCCTGCG CTACGGGTCC CAGGCCGTCG GTGGCGTGGT CTCGGTCGAG
AACAACCGCA TCCCGACAAG CATTCCCCCC GGCGGCGTCG CGGGACGGGT GACCACCGGC
TACTCGGCCG TCGACAATGG CCGCAACGCC GCCGCCACGG TCGATGCCGG CAGCGGCAAT
GTCGCCGTCC ATGCCGATGC GTTCCGGAGC GCGGCGGACG ACTACAACAC GCCGCTCGGC
ATCCAGCGCA ATTCCTTCAA CGAATCGCAG GGGGGCGCTT TCGGCACCTC CTACCTGTTC
GATCGCGGCT TCGTCGGGAT GTCCTTCAGC CACTTCGATT CGGTCTACGG CATCCCCGGA
ATCAGCTCCG CGGCGCAGCG CACCCGCCTC GATCCGGTGC AGGACAAGCT CCAGGCCCGG
GGCGAGTACC GGCCGCTGGA CGGGCCGTTC GCGGCGATCC GGTTCTGGGC GGGCGGCTCG
AACTACCGCC ACAACGAGTT CGGCACCGAT TCCAACGGCG TCGAGGGGAT CGAGGCGGTG
TTCAAGAACC GCGCGGCGGA AGGGCGGATC GAATTCGACC ACGTGCCGGT CGAGACCGCT
TTCGGCACCT TCAGCGGCCA GCTCGGATTC CAGGCCGACC GGCGCAAGCT CCGCATCTCG
GGGGCGGAAG GGGGCCTGTT GCGCCCGACC GACACGCGGG TGCAGGCCGC TTACCTGTTC
GAGGAACTGG CGCTCGGCGG CGGCCTGCGG TTCCAGGCGG CGGGCCGCAT CGAGGGCAAC
CGCGTCGCCG GCACCCAGGC GCTGTTCCCC TCGAACTTCC TGCCCGACGG CCCCGGCGAC
GAGCCGGTGG AATCGCGGCG TATCCGCCGC TTCGCCCCGA AAAGCGCGAG CCTCAGCGCG
CTCCAGGACC TGCCCCACGG GTTCGTGGGG AGCCTGACCG GCTCCTATGT CGAGCGCGCC
CCGACCTCGC CCGAACTGTT CTCACGCGGC CCGCACGAGG CCTCCGCGAC CTTCGAGATC
GGCGATCCCA ACCTTAAGTC GGAGCGGGCC CGCACCGTCG AGGTGGCGAT CCGGCGGGCG
GAGGGGCCGT TCCGCCTCGA CGCGACCGGC TACATCACGC GCTATACCGG CTTCGTCTAC
AAGCGCCTGA CCGGCTTCAC CTGCGGCGAC ACTTTCGACA GTTGCGGCAT CGGCGGCGGG
GATCTGCGGC AGGTCGCCTA TTCGCAGGCC GGCGCCACCT TCGCCGGCGC CGAGATTGCG
AGCCAGATCG ACATCGTGCC GGTCGGTGAC GGGTTTGCCG GCATCAGCGC CCAGTACGAT
TTCGTGCGCG CCCAGTTCGA CGACGGTCGC TTCGTGCCGC GCATCCCGCC CCACCGCGTC
GGTGGCGGCG TCTTCCTGCG GGCCGACGGC TGGTTCGCTC AACTCAGCCT GCTGCACGCC
TTCGCTCAGA CACAGACCGG CACGCTGGAA ACGCCGACGC CGGGCTACAA CGACCTCAAG
GCGGAGATCG CCTACAGCCG TCCGCTCGAT CCGGCGGTCC ATGGTCTCAG CGAGGTGACG
CTCGGCCTGC GCGGCACGAA CCTGCTCGAC GACGTGATCC GCAACGCCGC CTCCTTCCGC
AAGGACGAGG TCGTGCTGCC GGGCCGCAAC GTGCGGCTCT TCCTCACGGC ACGGTTCTGA
 
Protein sequence
MSRKRYGLGP WFGSFAGIGI AGLVVSPIQA EEVALDEISV VSDKPARSAG PGAAGPAPKP 
ADPVGVLPVV TDRFSTVTVV SGAELARSQP TSLGEALFDK PGLSATTFAP GSASRPIIRG
LDNHRVRILE NGTGVQDMSD LGEDHAVPIN PLINDRIEVI RGPAGLRYGS QAVGGVVSVE
NNRIPTSIPP GGVAGRVTTG YSAVDNGRNA AATVDAGSGN VAVHADAFRS AADDYNTPLG
IQRNSFNESQ GGAFGTSYLF DRGFVGMSFS HFDSVYGIPG ISSAAQRTRL DPVQDKLQAR
GEYRPLDGPF AAIRFWAGGS NYRHNEFGTD SNGVEGIEAV FKNRAAEGRI EFDHVPVETA
FGTFSGQLGF QADRRKLRIS GAEGGLLRPT DTRVQAAYLF EELALGGGLR FQAAGRIEGN
RVAGTQALFP SNFLPDGPGD EPVESRRIRR FAPKSASLSA LQDLPHGFVG SLTGSYVERA
PTSPELFSRG PHEASATFEI GDPNLKSERA RTVEVAIRRA EGPFRLDATG YITRYTGFVY
KRLTGFTCGD TFDSCGIGGG DLRQVAYSQA GATFAGAEIA SQIDIVPVGD GFAGISAQYD
FVRAQFDDGR FVPRIPPHRV GGGVFLRADG WFAQLSLLHA FAQTQTGTLE TPTPGYNDLK
AEIAYSRPLD PAVHGLSEVT LGLRGTNLLD DVIRNAASFR KDEVVLPGRN VRLFLTARF