Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4639 |
Symbol | |
ID | 7115207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 4918875 |
End bp | 4920473 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643527337 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002423341 |
Protein GI | 218532525 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTCC CCCGCCGCAC CCTCCTCCAG ACCGGCGCGG CGCTTGCTGC CGGGCTCGTC CTCCCCGCTC CCGCGCGGGC GGCCTCCCCT GTCTACCGCC GCGGCAACGA CGCCGACCCG GAAACGCTCG ATCCGCACAA GACCTCGACG GTGGCCGAGG CGCATCTCCT GCGCGACCTG TTCGAGGGGC TGCTGACCTA CGACAACCGC GGCACGATCA TCCCCGGCAT GGCCGAGCGT TGGACTGTCT CCGACGACCG CCTCACCTAC CGCTTCACCC TGCGGCCGGA CGGGCGCTGG TCGAACGGTG ATGCCGTGAC GGCCGACGAT TTCCTGTTCT CCCTGCGCCG CATCCTCGAT CCGAGGACGG CGGCGAAATA CGCCGAGGTG CTGTTCCCGA TCCGGGGAGC GGCCGCCGTC AATGCGGGCG AGCAGCCGCC GGAGACGCTG GGGGTGACGG CCCCCGATCC CCGCACCCTG GAGATCGGGC TCGCCGAGCC GGTGCCCTAC CTCCTCGAAC TCCTGACGCA CCAGACCTCG CTGCCGGTCC ACCGCCTCTC GCTGGAGCGC TGGGGCGATG CCTTCGCGCG GCCCGGCAAC CTCGTCTCGA ACGGCCCCTA CGCCCTGGTC GATTGGGTGC CGAACGACCG CATCACCCTG ACGAAAAACC CGCATTACCG CGACGCCGCG GCGATCCCGA TCGAGCGGGT GGACGTCATC CCGACCCCCG ACCTCGCTGC GGCGGTGCGG CGCTATGCGG CCGGCGAGAT CGATTCCCTC TCGGACCTGC CCGCCGACCA GATCGCCTCG CTGAAACAGC GCTTCGGCCC CCAGGTGCAG CTCGGACCGG GGCTCGGCCT GCTCGCCATC GCCTTCAACC TGCGCAAGAA ACCCTTCGAC GACGCGCGGG TGCGCCGCGC CTTGTCGCTG GCCATCGACC GGGAATTTCT GGCCGAGATC GTCTGGGGGC AGACCATGGC CCCGGCCTAT TCGTTCTGCC CGCCGGGCCT CGACAACGCC CTGCCGCCCC CGCTCCTGCC GGGGCGCGAG GATGGGCCGA TCGACCGCGA GGAGGAGGCG TTGCGGCTGC TGGCGGAAGC CGGCTACGGG CCGGGCAACC CGCTGACGGT CGAGTACCGC TTCAACGTCA CCGACAACAA CCGCAACACG GCGATCGCGC TCGCGGATGC GTGGCGCGGC ATCGGCGTCG TGACCCGCTT CGTCTCCACC GACGCCAAGA CCCACTTCGC GTATCTCCGC GACGGCGGCC CCTTCGACCT CGCCCGGATG TCCTGGGTCG CCGACTATTC CGATCCGCAG AATTTTCTGT TTTTGCTCCG CACCGGCAAT GACGGCTTCA ATGCCGGGCG CTGGTCGAAC GCGCGCTTTG ACGAACTGCT GACGCGGGCG GCGCAGGAGC GCGACGTGCC GGCCCGCGCG CGCATGCTGT TCGAGGCCGA AACCCTCGTG CTCGACGAAC TGCCCTGGGT GCCGCTGCTG CATTACCGCT CGAAGGCGCT CGTCTCGCCG CGGTTGCACG GGATGCACCC GAACATCCGC AACGTCGCCC CCACCCGCTA TCTCCGGCTC GATCCATGA
|
Protein sequence | MSLPRRTLLQ TGAALAAGLV LPAPARAASP VYRRGNDADP ETLDPHKTST VAEAHLLRDL FEGLLTYDNR GTIIPGMAER WTVSDDRLTY RFTLRPDGRW SNGDAVTADD FLFSLRRILD PRTAAKYAEV LFPIRGAAAV NAGEQPPETL GVTAPDPRTL EIGLAEPVPY LLELLTHQTS LPVHRLSLER WGDAFARPGN LVSNGPYALV DWVPNDRITL TKNPHYRDAA AIPIERVDVI PTPDLAAAVR RYAAGEIDSL SDLPADQIAS LKQRFGPQVQ LGPGLGLLAI AFNLRKKPFD DARVRRALSL AIDREFLAEI VWGQTMAPAY SFCPPGLDNA LPPPLLPGRE DGPIDREEEA LRLLAEAGYG PGNPLTVEYR FNVTDNNRNT AIALADAWRG IGVVTRFVST DAKTHFAYLR DGGPFDLARM SWVADYSDPQ NFLFLLRTGN DGFNAGRWSN ARFDELLTRA AQERDVPARA RMLFEAETLV LDELPWVPLL HYRSKALVSP RLHGMHPNIR NVAPTRYLRL DP
|
| |