Gene Mchl_4639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4639 
Symbol 
ID7115207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4918875 
End bp4920473 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content70% 
IMG OID643527337 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002423341 
Protein GI218532525 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCC CCCGCCGCAC CCTCCTCCAG ACCGGCGCGG CGCTTGCTGC CGGGCTCGTC 
CTCCCCGCTC CCGCGCGGGC GGCCTCCCCT GTCTACCGCC GCGGCAACGA CGCCGACCCG
GAAACGCTCG ATCCGCACAA GACCTCGACG GTGGCCGAGG CGCATCTCCT GCGCGACCTG
TTCGAGGGGC TGCTGACCTA CGACAACCGC GGCACGATCA TCCCCGGCAT GGCCGAGCGT
TGGACTGTCT CCGACGACCG CCTCACCTAC CGCTTCACCC TGCGGCCGGA CGGGCGCTGG
TCGAACGGTG ATGCCGTGAC GGCCGACGAT TTCCTGTTCT CCCTGCGCCG CATCCTCGAT
CCGAGGACGG CGGCGAAATA CGCCGAGGTG CTGTTCCCGA TCCGGGGAGC GGCCGCCGTC
AATGCGGGCG AGCAGCCGCC GGAGACGCTG GGGGTGACGG CCCCCGATCC CCGCACCCTG
GAGATCGGGC TCGCCGAGCC GGTGCCCTAC CTCCTCGAAC TCCTGACGCA CCAGACCTCG
CTGCCGGTCC ACCGCCTCTC GCTGGAGCGC TGGGGCGATG CCTTCGCGCG GCCCGGCAAC
CTCGTCTCGA ACGGCCCCTA CGCCCTGGTC GATTGGGTGC CGAACGACCG CATCACCCTG
ACGAAAAACC CGCATTACCG CGACGCCGCG GCGATCCCGA TCGAGCGGGT GGACGTCATC
CCGACCCCCG ACCTCGCTGC GGCGGTGCGG CGCTATGCGG CCGGCGAGAT CGATTCCCTC
TCGGACCTGC CCGCCGACCA GATCGCCTCG CTGAAACAGC GCTTCGGCCC CCAGGTGCAG
CTCGGACCGG GGCTCGGCCT GCTCGCCATC GCCTTCAACC TGCGCAAGAA ACCCTTCGAC
GACGCGCGGG TGCGCCGCGC CTTGTCGCTG GCCATCGACC GGGAATTTCT GGCCGAGATC
GTCTGGGGGC AGACCATGGC CCCGGCCTAT TCGTTCTGCC CGCCGGGCCT CGACAACGCC
CTGCCGCCCC CGCTCCTGCC GGGGCGCGAG GATGGGCCGA TCGACCGCGA GGAGGAGGCG
TTGCGGCTGC TGGCGGAAGC CGGCTACGGG CCGGGCAACC CGCTGACGGT CGAGTACCGC
TTCAACGTCA CCGACAACAA CCGCAACACG GCGATCGCGC TCGCGGATGC GTGGCGCGGC
ATCGGCGTCG TGACCCGCTT CGTCTCCACC GACGCCAAGA CCCACTTCGC GTATCTCCGC
GACGGCGGCC CCTTCGACCT CGCCCGGATG TCCTGGGTCG CCGACTATTC CGATCCGCAG
AATTTTCTGT TTTTGCTCCG CACCGGCAAT GACGGCTTCA ATGCCGGGCG CTGGTCGAAC
GCGCGCTTTG ACGAACTGCT GACGCGGGCG GCGCAGGAGC GCGACGTGCC GGCCCGCGCG
CGCATGCTGT TCGAGGCCGA AACCCTCGTG CTCGACGAAC TGCCCTGGGT GCCGCTGCTG
CATTACCGCT CGAAGGCGCT CGTCTCGCCG CGGTTGCACG GGATGCACCC GAACATCCGC
AACGTCGCCC CCACCCGCTA TCTCCGGCTC GATCCATGA
 
Protein sequence
MSLPRRTLLQ TGAALAAGLV LPAPARAASP VYRRGNDADP ETLDPHKTST VAEAHLLRDL 
FEGLLTYDNR GTIIPGMAER WTVSDDRLTY RFTLRPDGRW SNGDAVTADD FLFSLRRILD
PRTAAKYAEV LFPIRGAAAV NAGEQPPETL GVTAPDPRTL EIGLAEPVPY LLELLTHQTS
LPVHRLSLER WGDAFARPGN LVSNGPYALV DWVPNDRITL TKNPHYRDAA AIPIERVDVI
PTPDLAAAVR RYAAGEIDSL SDLPADQIAS LKQRFGPQVQ LGPGLGLLAI AFNLRKKPFD
DARVRRALSL AIDREFLAEI VWGQTMAPAY SFCPPGLDNA LPPPLLPGRE DGPIDREEEA
LRLLAEAGYG PGNPLTVEYR FNVTDNNRNT AIALADAWRG IGVVTRFVST DAKTHFAYLR
DGGPFDLARM SWVADYSDPQ NFLFLLRTGN DGFNAGRWSN ARFDELLTRA AQERDVPARA
RMLFEAETLV LDELPWVPLL HYRSKALVSP RLHGMHPNIR NVAPTRYLRL DP