Gene Mext_4272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4272 
Symbol 
ID5834328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4754476 
End bp4756074 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content70% 
IMG OID641370063 
Productextracellular solute-binding protein 
Protein accessionYP_001641712 
Protein GI163853669 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0380913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.650765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCC CCCGCCGCAC CCTCCTGCAG ACCGGCGCGG CGCTCGCCGC CGGGCTCGCC 
CTCCCCGCTC CCGCGCGGGC GGCCTCCCCC GTCTATCGCC GCGGCAACGA CGCCGACCCG
GAGACGCTCG ATCCGCACAA AACCTCGACG GTGGCCGAGG CGCATCTCCT GCGCGACCTG
TTCGAGGGGC TGCTGACCTA CGACAACCGC GGCACGATCA TTCCCGGCAT GGCCGAGCGT
TGGACCGTCT CCGATGACCG CCTCACCTAC CGCTTCACCC TGCGGCCGGA CGGGCGCTGG
TCGAACGGCG ATGCCGTGAC GGCCGACGAC TTCCTGTTCT CCCTGCGCCG CATCCTCGAT
CTGAAGACGG CGGCGAAATA CGCCGAGGTG CTGTTCCCGA TCCGGGGGGC GGCCGCCGTC
AATGCGGGTG AGCAACCGCC GGAGACGCTG GGGGTGACGG CCCCCGATCC CCGCACCCTG
GAAATCGGGC TCGCCGAGCC GGTGCCCTAC CTCCTCGAAC TCCTGACGCA CCAGACCTCG
CTGCCGGTCC ACCGCCCCTC GCTGGAGCGC TGGGGCGACG CCTTCGCGCG GCCCGGCAAC
CTCGTCTCGA ACGGCCCCTA CGCCCTGGTC GATTGGGTGC CGAACGACCG CATCACCCTG
ACGAAAAACC CGCATTACCG CGACGCCGCC GCGATCCCGA TCGAGCGGGT GGACGTCATC
CCGACTCCCG ACCTCGCCGC GGCGGTGCGG CGCTATGCGG CCGGCGAGAT CGATTCCCTC
TCGGACCTGC CCGCCGACCA GATCGCTTCG CTCAAGAGCC GCTTCGACCG CCAAGTGCAG
CTCGGACCGG GGCTCGGCCT GCTCGCCATC GCCTTCAACC TGCGAAAGAA ACCCTTCGAC
GACGCGCGGG TGCGCCGGGC CCTGTCGCTG GCCATCGACC GGGAATTTCT GGCCGAGATC
GTCTGGGGGC AGACCATGGC CCCGGCCTAT TCCTTCTGCC CGCCCGGCCT CGACAACGCC
CTGCCGCCCC CGCTCCTGCC GGGGCGCGAG GATGGGCCGA TCGACCGCGA GGAGGAGGCG
TTGCGGCTGC TGGCAGAAGC CGGCTACGGG CCGGGCAACC CGCTGACGGT CGAGTATCGC
TTCAACGTCA CCGACAACAA CCGCAACACG GCGATCGCGC TCGCGGATGC GTGGCGCGGC
ATCGGCGTCG TGACCCGCTT CGTCTCCACC GACGCCAAGA CCCACTTCGC GTATCTCCGC
GACGGCGGCC CCTTCGACCT CGCCCGGATG TCCTGGGTCG CCGACTATTC CGATCCGCAG
AATTTTCTCT TTTTGCTCCG CACCGGCAAT GACGGGTTCA ATGCCGGGCG CTGGTCGAAC
GCGCGCTTTG ACGAACTGCT GACGCGGGCG GCGCAGGAGC GCGACGTGCC GGCCCGCGCG
CGGATGCTGT TCGACGCCGA AACCCTCGTG CTCGACGAAC TGCCCTGGGT GCCGCTGCTG
CATTACCGCT CGAAGGCGCT CGTCTCGCCG CGGCTGCACG GGATGCACCC GAACATCCGC
AACGTCGCCC CCACCCGCTA TCTCCGGCTC GATCCATGA
 
Protein sequence
MSLPRRTLLQ TGAALAAGLA LPAPARAASP VYRRGNDADP ETLDPHKTST VAEAHLLRDL 
FEGLLTYDNR GTIIPGMAER WTVSDDRLTY RFTLRPDGRW SNGDAVTADD FLFSLRRILD
LKTAAKYAEV LFPIRGAAAV NAGEQPPETL GVTAPDPRTL EIGLAEPVPY LLELLTHQTS
LPVHRPSLER WGDAFARPGN LVSNGPYALV DWVPNDRITL TKNPHYRDAA AIPIERVDVI
PTPDLAAAVR RYAAGEIDSL SDLPADQIAS LKSRFDRQVQ LGPGLGLLAI AFNLRKKPFD
DARVRRALSL AIDREFLAEI VWGQTMAPAY SFCPPGLDNA LPPPLLPGRE DGPIDREEEA
LRLLAEAGYG PGNPLTVEYR FNVTDNNRNT AIALADAWRG IGVVTRFVST DAKTHFAYLR
DGGPFDLARM SWVADYSDPQ NFLFLLRTGN DGFNAGRWSN ARFDELLTRA AQERDVPARA
RMLFDAETLV LDELPWVPLL HYRSKALVSP RLHGMHPNIR NVAPTRYLRL DP