Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_2974 |
Symbol | |
ID | 6129920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3298924 |
End bp | 3300516 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641643165 |
Product | extracellular solute-binding protein |
Protein accession | YP_001769820 |
Protein GI | 170741165 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0342651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAGAC GGACCCGACC GGCCCCCCGC CTCGCCCGCA CCGCCCCGCG CGGTCGCGGA GCGGCGGCCC TGCTCCTTGC CGGGACGGCC TTCCTGGCCG GAACGGCCTT CCTGGCCGGG GCGGCGGGCC CGGCCCTCGC CGGCAAGGCC AACGACACCC TGGTCTACGC CTCGGACAGC GAGCCCGAGA ACGTCAGCCC CTATCACAAC AGCGCCCGCG AGGGGGTCAT CCTGGCCCGC AACGCCTGGG ACACGCTGCT CTACCGCGAT CCCGCCACCG GCACCTACCA GCCGATGCTG GCGACCGCCT GGACATGGGC CGATCCCGTC ACCCTCGACC TCACCATCCG GGAGGGCGTG GTCTTCCACA ACGGCGATCC GCTCACGCCG GAGGACGTCG CCTTCACCTT CAACTACGTG CTCACGCCCG AGGCGCGCAC GGTCACCAAG CAGAACGTCG ACTGGATGAA GTCCACCGAG GTGCTGGGCC CGCACACGGT GCGCATCCAC CTGAAGGCGC CGTTCCCGGC CGCGCTCGAA TACCTCGCGG GTCCGACCCC GATCTTCCCG GCGGCCTATT TCAAGAAGGT GGGGCTCGAC GGCTTCGCCA AGGCGCCGGT CGGCACGGGG CCCTACCGGA TCGTCAGCGT CGAGAGCGGG CGCGGCGTCA AGCTCGAGCG CTTCGAGAAG TACTGGTCCG GCAGCCCGAT CGGGCGGCCG AAGATCGGCA AGCTCGAATT CCGGGTCATC CCGGACGCCG ACAGCCGGAT GGCCGAGCTC GTCACCGGCG GCATCGACTG GATCTGGCGC GTGCCGAGCG ACCAGGCCGA TCAGCTGCGC GCGGCGCCCG GGATCACGGT GCTGAGCGCC GAGACGATGC GGGTCGGCTT CCTGCAATTC GACGTCGGCG GCCGGGCGAT GGAGAAGTCG CCCCTCAGGG ACGTGCGGGT GCGCCGGGCG ATCTCCTACG CGATCGACCG CAAGGCGATG GTCGACAACC TCGTGCGCGG CGGCGCGCGC GTGATGAACG TGCTGTGCTT CTCCGGGCAG TTCGGCTGCG TCGAGGAGGG CGCCCCGCGC TACGCCTACG ATCCCGCCAA GGCCAAGGCG CTGCTCAAGG AGGCGGGCTA CCCGGACGGG TTCGAGATCG ACCTCGCGGC CTATCGCGAG CGCGATTACG CCGAGGCCGT GATCGGCTAC CTGCGGGCGG TCGGCATCCG GGCGCGGCTC AACTACCTGC GCTACGCCGC CTTCCGGGAC GCGCTGCGCG GCGGCAAGGT CTCGATCGGC TTCCAGACCT GGGGCTCGTT CTCGGTCAAC GACGTCTCGG CCTTCACGGG CGTGTATTTC CGCGGCGGCG ACGAGGATCT GACCCGCGAC CCGGCGGTGA TCGCGGCGCT CCAGGCCGGC GACACCGCGA GCGACCCGGG CGAGCGCAAG GCGAAGTACG CCGAGGCGCT CTCGCGCATC GCCGGCGAGG CCTACGCGCT GCCGATGTTC TCCTACCCGT CGAACTACGC CTTCACCCAG GACCTGAACT TCACGGCGCA GCCCGACGAG GTGCCGCGCT TCTACGCCGC CTCCTGGAAG TGA
|
Protein sequence | MLRRTRPAPR LARTAPRGRG AAALLLAGTA FLAGTAFLAG AAGPALAGKA NDTLVYASDS EPENVSPYHN SAREGVILAR NAWDTLLYRD PATGTYQPML ATAWTWADPV TLDLTIREGV VFHNGDPLTP EDVAFTFNYV LTPEARTVTK QNVDWMKSTE VLGPHTVRIH LKAPFPAALE YLAGPTPIFP AAYFKKVGLD GFAKAPVGTG PYRIVSVESG RGVKLERFEK YWSGSPIGRP KIGKLEFRVI PDADSRMAEL VTGGIDWIWR VPSDQADQLR AAPGITVLSA ETMRVGFLQF DVGGRAMEKS PLRDVRVRRA ISYAIDRKAM VDNLVRGGAR VMNVLCFSGQ FGCVEEGAPR YAYDPAKAKA LLKEAGYPDG FEIDLAAYRE RDYAEAVIGY LRAVGIRARL NYLRYAAFRD ALRGGKVSIG FQTWGSFSVN DVSAFTGVYF RGGDEDLTRD PAVIAALQAG DTASDPGERK AKYAEALSRI AGEAYALPMF SYPSNYAFTQ DLNFTAQPDE VPRFYAASWK
|
| |