Gene Mpop_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_4012 
Symbol 
ID6312040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp4282574 
End bp4284145 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content66% 
IMG OID642652710 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001926669 
Protein GI188583224 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA CGATCGACCG CAGGCGGTTT CTTCAGGCGA GCGCCGCCGC TGTGGGTTTT 
GCCCAGATCA ACCCCGATTT CCTGATCTCC TCGGCCTTCG CCCAATCGGG CAAGCCGCTG
GTCTTCCTCT CGGCCGAGAA CATCACCGGC AACTGGGATC CGACCGCCCA TACCACGCTC
TCGCAGAAGA ACATCGAGGG CTTCGTGATG GGCTTCCTGA CCCGCACGCC GATGACCCTC
GATGACCCCG GCAAGGTGGT CTACGAACTC GCCACCGACA TCAAGCTCCT CGATCCGCAC
CGCCTCCAGA TCAAGCTGCG CAAGGGCGTC CACTTCCACG ACGGCAAGCC GTTCGGGCCC
GAGGACGTGA AGGCGACCTT CGAATACGGC GCGGGCAAGG ACCGGCCGGC GCAATGGTAT
CCCGGTCCGA CCGAGACGCT GACGATCACC ACGCCCGACG ACGAGACCGT GATCGTCGAC
ACCTCGAAGG GCGGCTACCC GGCCCACCTC TTCATCTTCC TGGCCTCGTT CCTGCCGATG
ATGTCGGCCA AGGACATCGC CGAGGGGCCG GGCGGCGCCC TCACCCGGCG CCTGAACGGC
ACCGGCCCGT TCCGCTTCGT CGAGCAGCGC GGCAACGACA CCGTGCTCGC GGCCTATGAC
GGCTATTTCC GCGGCAAGCC GGGGATTCCG GGGATCAACT TCACCTTCAC CGGCGACTCG
ACCACGCGGA TGCTGTCGCT GATGAACGGC CAGGCCTCGA TCGTCGAGCG GCTCGAACCC
GAGCAGGTCG AGACGGTCAA GGGCAATCCG AAGATCGCGA TCAACGAGGT CGTCTCGGTC
GAGAACAAGT ATCTCTGGTT CCGCTGCTCC AAGCCGCCCT TCAACGACGT GCGGGTGCGC
ATGGCCGCCT GCCACTCCAT CGACCGGGCG ATGCTCCTGG AGATCCTGGG CGCGGCGGGC
CACGCCTCGG CGAACTTCAT CTCGCCGGTG AAGTTCGGCT ACATCGATCT CAAGAACTAC
CCGGCCTACG ACCCGGCCAA GGCCCAGGCG CTGCTGGCCG AGGCGGGCTT CCCCAAGGGC
AAGGGGCTGC CGCCGCTCGA ATACATCACC TCGGTCGGCT TCTACCCGAA GACCAAGGAG
TACGGCGAGG TCATCACCGC GATGCTCAAC GAGCAGGGCT TCCCGGTGAG TCTCACGGTG
CTGGAGCCGG CGGCCTGGAA CGAGCGGCTC TATCACCGCC CCGGCGGCGG GCCGGGCCAC
ATGGTCGATT GCGGCTGGTC CACCGGTTCG CCCGAGCCGG ATCTGGTGCT GCGCACCCAT
TTCCACTCCT CCTCGCACCG GATCACCGGC ATCGAGGATG CGGAGATCGA CGCCAGCCTC
GACAAGGAGC GCGCGGCGCC GACTCTGGAG GAGCGCAAGG CCGTCCTCCA GAACGAGACC
ATGCCGCTTC TGGCGGCCAA GATGCCGGCG CTGTCGCTGT TCACCTCGGT GATGATCCAC
GCGATGCAGC GGGAATTGAA GGGCCTCTAC ATCTACCCGG ACGGCTCCAT CGACGCGTCC
AAGACCGCCT GA
 
Protein sequence
MAKTIDRRRF LQASAAAVGF AQINPDFLIS SAFAQSGKPL VFLSAENITG NWDPTAHTTL 
SQKNIEGFVM GFLTRTPMTL DDPGKVVYEL ATDIKLLDPH RLQIKLRKGV HFHDGKPFGP
EDVKATFEYG AGKDRPAQWY PGPTETLTIT TPDDETVIVD TSKGGYPAHL FIFLASFLPM
MSAKDIAEGP GGALTRRLNG TGPFRFVEQR GNDTVLAAYD GYFRGKPGIP GINFTFTGDS
TTRMLSLMNG QASIVERLEP EQVETVKGNP KIAINEVVSV ENKYLWFRCS KPPFNDVRVR
MAACHSIDRA MLLEILGAAG HASANFISPV KFGYIDLKNY PAYDPAKAQA LLAEAGFPKG
KGLPPLEYIT SVGFYPKTKE YGEVITAMLN EQGFPVSLTV LEPAAWNERL YHRPGGGPGH
MVDCGWSTGS PEPDLVLRTH FHSSSHRITG IEDAEIDASL DKERAAPTLE ERKAVLQNET
MPLLAAKMPA LSLFTSVMIH AMQRELKGLY IYPDGSIDAS KTA