Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4051 |
Symbol | |
ID | 7118056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 4267947 |
End bp | 4269518 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643526770 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002422779 |
Protein GI | 218531963 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.957525 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGTA CAATGGACCG CAGACGGTTT CTTCAGGCGA GCGCCGCCGC TGTGGGTTTT GCCCAGATCA ATCCCGACTT CCTGGTCTCC TCGGCCTTCG CCCAATCGGG CAAGCCGCTG GTCTTCCTCT CGGCCGAGAA CATCACCGGC AACTGGGACC CCACCGCCCA CACCACGCTC TCGCAGAAGA ACATCGAGGG CTTCGTGATG GGCTTCCTGA CCCGCACGCC GATGACCCTC GATGACCCCG GCAAGGTCGT CTACGAACTC GCTACCGACA TCCGGTTGCT CGATCCGTAC CGCCTGCAGA TCAAGCTGCG CAAGGACGTG CAGTTTCACG ACGGCAAGCC GTTCGGGCCC GAGGACGTCA AGGCGACCTT CGAGTACGGC GCGGGCAAGG ACCGGCCGGC GCAGTGGTAT CCCGGCCCGA CCGAGACGCT GACGATCACC ACGCCCGACG ACGAGACCGT GATCGTCGAC ACCTCGAAGG GCGGCTACCC CGCCCACCTC TTCATCTTCC TCGCCTCGTT CCTCCCGATC CTCTCGGCCA AGGACGTGGC CGAGGGCCCG GGCGGCGCCC TCACCCGGCG CCTGAACGGC ACCGGCCCGT TCCGCTTCGT CGAGCAGCGC GGCAACGACA CCGTGCTCAA GGCCCATGAC GGCTATTTCC GCGGCAAACC CGGCATTCCC GGCATCAACT TCACCTTCAC CGGCGACTCG ACCACGCGGA TGCTGTCGCT GATGAACGGC CAAGCCTCGA TCGTCGAGCG GCTCGAACCC GAGCAGGTCG AGACAGTCAA GAACAATCCG AAGATCGCGA TCAACGAGGT CGTCTCGGTC GAGAACAAGT ATCTCTGGTT CCGCTGCTCC AAGCCGCCCT TCAACGACGT GCGGGTGCGC ATGGCCGCCT GCCACTCGAT CGACCGGGCG ATGCTCTTGG AGATCCTCGG CGCGGCGGGC CACGCCTCGG CCAACTTCAT CTCGCCGGTG AAGTTCGGCT ACGTCGACCT GAAGAACTAC CCGGCCTACG ACCCGGCCAA GGCCCAGGCG CTGCTGGCCG AGGCGGGCTT CCCCAAGGGC AAGGGGCTGC CGCCGCTCGA ATACATCACC TCGGTCGGGT TCTACCCGAA GACCAAGGAG TACGGCGAGG TCATCACCGC GATGCTCAAC GAGCAGGGCT TTCCGGTGAG CCTCACGGTG CTGGAGCCGG CGGCCTGGAA CGAGCGGCTC TACCATCGCC CCGGCGGCGG ACCCGGCCAC ATGGTCGATT GCGGCTGGTC CACCGCCTCG CCCGAGCCGG ATCTGGTGCT GCGCACCCAC TTCCACTCCT CCTCGCACCG CATCACCGGC ATCGAGGATG CGGAGATTGA CGCGAGCCTC GACAAGGAGC GCGCGGCGCC GACGCTGGAG GAGCGCAAGG CCATCCTCCA GAACGAGACG ATGCCGCTGC TGGCCGCCAA GATGCCGGCG CTGTCGCTGT TCACCTCGGT GATGATCCAC GCGATGCAGC AGGAGCTGAA GGGCCTCTAC ATCTATCCGG ACGGGTCGAT CGACGCCTCC AAAACCGCCT GA
|
Protein sequence | MAGTMDRRRF LQASAAAVGF AQINPDFLVS SAFAQSGKPL VFLSAENITG NWDPTAHTTL SQKNIEGFVM GFLTRTPMTL DDPGKVVYEL ATDIRLLDPY RLQIKLRKDV QFHDGKPFGP EDVKATFEYG AGKDRPAQWY PGPTETLTIT TPDDETVIVD TSKGGYPAHL FIFLASFLPI LSAKDVAEGP GGALTRRLNG TGPFRFVEQR GNDTVLKAHD GYFRGKPGIP GINFTFTGDS TTRMLSLMNG QASIVERLEP EQVETVKNNP KIAINEVVSV ENKYLWFRCS KPPFNDVRVR MAACHSIDRA MLLEILGAAG HASANFISPV KFGYVDLKNY PAYDPAKAQA LLAEAGFPKG KGLPPLEYIT SVGFYPKTKE YGEVITAMLN EQGFPVSLTV LEPAAWNERL YHRPGGGPGH MVDCGWSTAS PEPDLVLRTH FHSSSHRITG IEDAEIDASL DKERAAPTLE ERKAILQNET MPLLAAKMPA LSLFTSVMIH AMQQELKGLY IYPDGSIDAS KTA
|
| |