Gene Mpop_4787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_4787 
Symbol 
ID6310277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp5115018 
End bp5116616 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content70% 
IMG OID642653466 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001927418 
Protein GI188583973 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.690041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCC CCCGCCGCAC CCTCCTGCAG ACCGGCGCGG CGCTCGCCGC CGGGCTCGCC 
CTCCCCGCGC CCGCGCGGGC GGCCTCCCCC GTCTATCGAC GCGGCAACGA CGCCGATCCC
GAGACTCTCG ACCCGCACAA GACCTCGACG GTGGCCGAAG CGCACATCCT GCGCGACCTG
TTCGAGGGAC TTCTGACCTA CGACAACCGC GGCACCATCA TCCCCGGCAT GGCCGAGCGC
TGGACCGTCT CCGACGACCG CCTCACCTAC CGCTTCCATC TCCGGCCCGA CGGGCGCTGG
TCGAACGGCG ATGCCGTCAC GGCCGAGGAT TTCCTGTTCT CGCTACGCCG CATCCTCGAT
CCGAAGACGG CGGCGAAATA CGCGGAGGTG CTGTTCCCGA TCCGCGGGGC GGCCGCCGTC
AATGCGGGCG AGCAGCCGCC GGAGACCCTC GCGGTCGCCG CCCCCGAGGC CCGGACGCTG
GAGATCGGGC TCGCCGAGCC GGTGCCCTAT CTCCTCGAAC TCCTCACCCA CCAGACCTCG
CTGCCGGTCC ACCGTCCCTC GCTGGAGCGC TGGGGCGATG CCTTCGCGCG GCCCGGCAAC
CTCGTCTCGA ACGGCCCCTA CGCCCTGGTC GATTGGGTGC CGAACGACCG CATCACCCTG
ACCAAGAACC CGCATTACCG CGACGCCGCC GCGATCCCGA TCGAGCGGGT CGACGTCATC
CCGACCCCGG ACCTCGCCGC GGCGGTGCGG CGCTATGCGG GCCGTGAGAT CGATTCGCTG
TCCGATCTGC CCGCCGATCA GATCGCCTCG CTGACACAGC GCTTCGGCAC CCAGGTGCAG
CTCGGGCCGG GGCTCGGCCT GCTCGCCATC GCCTTCAACC TGCGAAAAGA ACCCTTCGGC
GATGTCCGGG TGCGCCGCGC CCTGTCGATC GCCATCGACC GGGAGTTCCT GGCGGACATC
GTATGGGGGC AGACCATGGC CCCGGCCTAT TCCTTCTGCC CGCCGGGCCT CGACAACGCC
CTGCCGCCCC CGCTCCTGCC GGGGCGCGAG GACGGGCCGA TCGACCGCGA GGAGGAGGCG
CTGCGGCTGC TGGCGGAGGC CGGTTACGGG CCGGGCAACC CGCTGGCGAT CGAGTACCGC
TTCAATGTCA CCGACAACAA CCGCAACACC GCGATCGCGC TGGCGGACGC GTGGCGCGGC
ATCGGCGTCG AGACCCGCTT CGTCTCCACC GACGCCAAGA CCCATTTCGC CTATCTGCGC
GACGGCGGCA CCTTCGACCT CGCCCGGATG TCTTGGGTCG CCGACTATTC CGACCCGCAG
AACTTCCTAT TCCTGCTGCG CACCGGCAAT GACGGGTTCA ATGCCGGGCG CTGGTCGAAC
CCGGGCTTCG ACGGGCTGCT GACGCGGGCG GCGGGGGAGC GCGACGTTCA GGCCCGCGCG
CGGATGCTGT TCGACGCCGA AAAAATCGTG CTCGACGAAC TGCCCTGGCT GCCGCTGCTG
CATTACCGCT CGAAGGCGCT GGTCTCGCCG CGGCTGCACG GGATGCACCC GAACATCCGC
AACGTCGCCC CCACCCGCTA CCTCCGGCTC GATCCGTGA
 
Protein sequence
MSLPRRTLLQ TGAALAAGLA LPAPARAASP VYRRGNDADP ETLDPHKTST VAEAHILRDL 
FEGLLTYDNR GTIIPGMAER WTVSDDRLTY RFHLRPDGRW SNGDAVTAED FLFSLRRILD
PKTAAKYAEV LFPIRGAAAV NAGEQPPETL AVAAPEARTL EIGLAEPVPY LLELLTHQTS
LPVHRPSLER WGDAFARPGN LVSNGPYALV DWVPNDRITL TKNPHYRDAA AIPIERVDVI
PTPDLAAAVR RYAGREIDSL SDLPADQIAS LTQRFGTQVQ LGPGLGLLAI AFNLRKEPFG
DVRVRRALSI AIDREFLADI VWGQTMAPAY SFCPPGLDNA LPPPLLPGRE DGPIDREEEA
LRLLAEAGYG PGNPLAIEYR FNVTDNNRNT AIALADAWRG IGVETRFVST DAKTHFAYLR
DGGTFDLARM SWVADYSDPQ NFLFLLRTGN DGFNAGRWSN PGFDGLLTRA AGERDVQARA
RMLFDAEKIV LDELPWLPLL HYRSKALVSP RLHGMHPNIR NVAPTRYLRL DP