Gene Mext_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0343 
Symbol 
ID5832886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp388520 
End bp390394 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content69% 
IMG OID641366128 
Productextracellular solute-binding protein 
Protein accessionYP_001637838 
Protein GI163849795 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG GCGCGACCCG CCGAAGCGTC GTCCTCGGCA CCGGGGTGAT AGCGCTTGCG 
GGCGCCCTGC CCCGTTCGGC CCGTGCCGAG GATGTCCACA AGGCTCATGG CGGGATTCAT
GGCCTATCGA GCTTCGGCGA GCTGAAATAC GCGCCCGACT TCCCGAACTT CGACTACGTG
AACCCGATGG CGCCGCGCGG CGGGCGCTTC TCGACAACTC TGGTCCAGAC CTTCGGCAAT
CAGGCGTTCG ACACCTTCGA CACGCTCAAC CCCTACGTCT TCCGCGGCAA CGGTGCGGCC
GGGATCAATC TCACCTTCGA CAGCCTGATG GTGCGCGCCC TCGACGAGCC GGACGCGCTC
TACGGCCTCG TCGCCCGCTC GGTCGAGATC AGCCCCGACG GCCTGACCTA TCGCTTCGCC
CTGCGTCCCC AGGCGCATTT CCACGACGGC TCACCCCTGA CCGCGCGGGA CGCGGCCTTC
TCCCTCACCA TCCTCAAGGA GAAGGGGCAC CCGACGATCG CCCAGGTGAT CCGCGACGTC
GCGGAGGCGA CGGCGGAGGG CGACGAGACC CTCGTCGTCC GCTTCGCCCC CGGCCGCAGC
CGCGACCTGC CGCTGATCGT CGCCGGACTG CCGATCTTCT CGGCGAAGTT CTTCGAAGGG
CGCGACTTCG AGGCCCAGAC TCTCAAGCCC CTGCTCGGCT CCGGCCCGTA TCAGGTCGGG
CGGATCGATA TCGGCCGCTT CATCGAACTG GAGCGTGTGA CCGATTACTG GGCGGCGGAT
CTTCCGGTGA TGGTCGGGCA AAACAACTTC GACCAGCTCC GCTACGAGTA TTTCCGCGAT
CGGCAGGTCG CCTTCGAGGC GTTCAAAGGC GGCGCCTACA CCTATCGCCA GGAATTCACC
TCGCGGATCT GGGCAACGGG CTACGACTTC CCCGCCGCGC GCGAGGGCCG CGTCAAGCGC
GAGACCCTGC CCGACACCTC GCCCGCCGCC ATCCAGGGCT GGTTCTTCAA CACCCGCCGC
GAGGTGTTCA AGGATCCGCG CGTGCGCGAG GCGATCGGCC TGTGCTTCGA CTTCCCCTGG
ACCAACCGCA CAGCGATGTT CGGCTCCTAC GAGCGCACCG TCTCGTTCTT CCAGAAGACC
GACCTGATGG CGACGGGCAA GCCCTCCGCG GAGGAGCTGG CCCTGCTCGA ACCCTTCAGC
GGGCAGGTGC CGGCCGAGGT GTTCGGCGAG GCCTGGACGC CGCCGGTTCC GGACGGCTCG
GGCCAGGACC GGGCGCTGCT CGCCCGCGCG GTGGCCCTGC TTAAGGAGGC GGGCTGCACC
CGCGAGGGCG GCGCCTTGCG GCTGCCGAGC GGCAAGCCGA TCGAGTTCGA ATTCCTCGAT
TCGGATTCCG TCTGGGAGCC GATCGTCCAG CCCTTCATCC GCAATCTCGG GTTGATCGGC
ATCAAGGCGC GCCAGCGGGC GGTCGATGCC GCGCAGTATC AGGCGCGGGT GCGCGACTTC
GACTTCGACA TCACCGCCCG CGCCGCCTCG GGCGACGCGA CGCCGGGGCC GGAGCTGCGC
GAGGCCTATG GCTCCCGCGC GGCGGCGATC CCCGGCTCCA ACAACCTCGC CGGCATCACC
GATCCGGTGA TCGACGCGCT GCTTGACCGC ATTGCCAACG CGGATTCGCG CGCGAGCCTC
ACCGTGGCCT GCCGCGCCCT CGACCGGGTG ATGCGGGCCG GCCGCTACTG GATCCCGATG
TGGTACTCGC CCGAGTACCG CCTCGCCCTG TGGGACATGT ACGGCCGCCC GGCGAAGCTG
CCGACCTATG GGCTCGGCGT GCCGGGCCTG TGGTGGTACG ACGAGGCCAA GGCGCGCCGG
ATCGGCCGGG GCTGA
 
Protein sequence
MSAGATRRSV VLGTGVIALA GALPRSARAE DVHKAHGGIH GLSSFGELKY APDFPNFDYV 
NPMAPRGGRF STTLVQTFGN QAFDTFDTLN PYVFRGNGAA GINLTFDSLM VRALDEPDAL
YGLVARSVEI SPDGLTYRFA LRPQAHFHDG SPLTARDAAF SLTILKEKGH PTIAQVIRDV
AEATAEGDET LVVRFAPGRS RDLPLIVAGL PIFSAKFFEG RDFEAQTLKP LLGSGPYQVG
RIDIGRFIEL ERVTDYWAAD LPVMVGQNNF DQLRYEYFRD RQVAFEAFKG GAYTYRQEFT
SRIWATGYDF PAAREGRVKR ETLPDTSPAA IQGWFFNTRR EVFKDPRVRE AIGLCFDFPW
TNRTAMFGSY ERTVSFFQKT DLMATGKPSA EELALLEPFS GQVPAEVFGE AWTPPVPDGS
GQDRALLARA VALLKEAGCT REGGALRLPS GKPIEFEFLD SDSVWEPIVQ PFIRNLGLIG
IKARQRAVDA AQYQARVRDF DFDITARAAS GDATPGPELR EAYGSRAAAI PGSNNLAGIT
DPVIDALLDR IANADSRASL TVACRALDRV MRAGRYWIPM WYSPEYRLAL WDMYGRPAKL
PTYGLGVPGL WWYDEAKARR IGRG