Gene Mchl_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4051 
Symbol 
ID7118056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4267947 
End bp4269518 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content66% 
IMG OID643526770 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002422779 
Protein GI218531963 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.957525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGTA CAATGGACCG CAGACGGTTT CTTCAGGCGA GCGCCGCCGC TGTGGGTTTT 
GCCCAGATCA ATCCCGACTT CCTGGTCTCC TCGGCCTTCG CCCAATCGGG CAAGCCGCTG
GTCTTCCTCT CGGCCGAGAA CATCACCGGC AACTGGGACC CCACCGCCCA CACCACGCTC
TCGCAGAAGA ACATCGAGGG CTTCGTGATG GGCTTCCTGA CCCGCACGCC GATGACCCTC
GATGACCCCG GCAAGGTCGT CTACGAACTC GCTACCGACA TCCGGTTGCT CGATCCGTAC
CGCCTGCAGA TCAAGCTGCG CAAGGACGTG CAGTTTCACG ACGGCAAGCC GTTCGGGCCC
GAGGACGTCA AGGCGACCTT CGAGTACGGC GCGGGCAAGG ACCGGCCGGC GCAGTGGTAT
CCCGGCCCGA CCGAGACGCT GACGATCACC ACGCCCGACG ACGAGACCGT GATCGTCGAC
ACCTCGAAGG GCGGCTACCC CGCCCACCTC TTCATCTTCC TCGCCTCGTT CCTCCCGATC
CTCTCGGCCA AGGACGTGGC CGAGGGCCCG GGCGGCGCCC TCACCCGGCG CCTGAACGGC
ACCGGCCCGT TCCGCTTCGT CGAGCAGCGC GGCAACGACA CCGTGCTCAA GGCCCATGAC
GGCTATTTCC GCGGCAAACC CGGCATTCCC GGCATCAACT TCACCTTCAC CGGCGACTCG
ACCACGCGGA TGCTGTCGCT GATGAACGGC CAAGCCTCGA TCGTCGAGCG GCTCGAACCC
GAGCAGGTCG AGACAGTCAA GAACAATCCG AAGATCGCGA TCAACGAGGT CGTCTCGGTC
GAGAACAAGT ATCTCTGGTT CCGCTGCTCC AAGCCGCCCT TCAACGACGT GCGGGTGCGC
ATGGCCGCCT GCCACTCGAT CGACCGGGCG ATGCTCTTGG AGATCCTCGG CGCGGCGGGC
CACGCCTCGG CCAACTTCAT CTCGCCGGTG AAGTTCGGCT ACGTCGACCT GAAGAACTAC
CCGGCCTACG ACCCGGCCAA GGCCCAGGCG CTGCTGGCCG AGGCGGGCTT CCCCAAGGGC
AAGGGGCTGC CGCCGCTCGA ATACATCACC TCGGTCGGGT TCTACCCGAA GACCAAGGAG
TACGGCGAGG TCATCACCGC GATGCTCAAC GAGCAGGGCT TTCCGGTGAG CCTCACGGTG
CTGGAGCCGG CGGCCTGGAA CGAGCGGCTC TACCATCGCC CCGGCGGCGG ACCCGGCCAC
ATGGTCGATT GCGGCTGGTC CACCGCCTCG CCCGAGCCGG ATCTGGTGCT GCGCACCCAC
TTCCACTCCT CCTCGCACCG CATCACCGGC ATCGAGGATG CGGAGATTGA CGCGAGCCTC
GACAAGGAGC GCGCGGCGCC GACGCTGGAG GAGCGCAAGG CCATCCTCCA GAACGAGACG
ATGCCGCTGC TGGCCGCCAA GATGCCGGCG CTGTCGCTGT TCACCTCGGT GATGATCCAC
GCGATGCAGC AGGAGCTGAA GGGCCTCTAC ATCTATCCGG ACGGGTCGAT CGACGCCTCC
AAAACCGCCT GA
 
Protein sequence
MAGTMDRRRF LQASAAAVGF AQINPDFLVS SAFAQSGKPL VFLSAENITG NWDPTAHTTL 
SQKNIEGFVM GFLTRTPMTL DDPGKVVYEL ATDIRLLDPY RLQIKLRKDV QFHDGKPFGP
EDVKATFEYG AGKDRPAQWY PGPTETLTIT TPDDETVIVD TSKGGYPAHL FIFLASFLPI
LSAKDVAEGP GGALTRRLNG TGPFRFVEQR GNDTVLKAHD GYFRGKPGIP GINFTFTGDS
TTRMLSLMNG QASIVERLEP EQVETVKNNP KIAINEVVSV ENKYLWFRCS KPPFNDVRVR
MAACHSIDRA MLLEILGAAG HASANFISPV KFGYVDLKNY PAYDPAKAQA LLAEAGFPKG
KGLPPLEYIT SVGFYPKTKE YGEVITAMLN EQGFPVSLTV LEPAAWNERL YHRPGGGPGH
MVDCGWSTAS PEPDLVLRTH FHSSSHRITG IEDAEIDASL DKERAAPTLE ERKAILQNET
MPLLAAKMPA LSLFTSVMIH AMQQELKGLY IYPDGSIDAS KTA