Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0822 |
Symbol | |
ID | 6130709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 934336 |
End bp | 935646 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641641136 |
Product | extracellular solute-binding protein |
Protein accession | YP_001767810 |
Protein GI | 170739155 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTCT TCGACCGCCG TCAGATCCTC AAGGCGGGCG CCGCGCTCGG CCTGTCCGGG GCCGCTTGCC TCGACGGCTT CGCCAGGGCC TGGGCCCAGG AGAGCCAGTG GAAGCCCGAG CCCGGCGCCT CGCTGAAGCT CCTGCGCTGG AAGCGCTTCA TCCCCTCGGA GGACGAGGCC TTCATGCGCC TCGTCGACGC CTTCACCAAG GCGACGGGCG TGCCGGTGAG CGTCACGAGC GAGTCCTTCG ACGACATCCA GCCGAAGGCC TCGGTCGCCG CCAATACCGG CCAGGGCCCC GACATGGTCT GGGGCCTCTA CTCCTTCCCG GCTCTGTTCC CGTCGAAATG CCTCGAGGTC GGCGACGTCG CCGACTACCT CGGCAAGAAG TACGGCGGCT GGCTGCCCGC CGCCGAGGCC TACGGCAAGG TGAAGGGCAA GTGGATCGCC ATCCCGATGG CCTTCAACGG CGGCTACCTC AACTACCGGA TCGCGGCCAT GCAGAAGGCC GGGTTCAGCA AGTTCCCGGA GGATCTCGGC GGCTTCCTCG AACTCTGCCG CGCGCTCAAG AAGAACAACA CGCCGTCCGG CTTCGCCCTC GGCCACGCCA CCGGCGACGC CAATTCCTGG ATCCACTGGG CGCTGTGGTC GCACGACGCC TACCTGGTCG ACGCCAACGA GAAGATCATC ATCAACTCGC CGGAGACCGC CAAGGCGCTC GAATACGTCA AGAGCCTCTA CGAGACCTTC ATCCCCGGGA CGGTCTCGTG GAACGATTCC TCGAACAACA AGGCGTTCCT GTCCGGCGAG CTCTACCTGA CCAATAACGG CATCTCGATC TACGCCGCAG CCAAGACCGA GCGCAAGGAC ATCGCCGAGG ACATGGACCA CGCGGTCTAC CCGGTCGGCA AGTCCGGCAA GCCGACCGAG TTCCAGCTCG CCTTCCCGAT CCTGGCCTAC ACCTACACGA AGGCCCCGAA CGCCTGCAAA GCCTTCATGG CCTTCGCGCT GGAGGCGCAG AACTACAATG CGTGGCTGCA GGCGGCGCAG GGCTACCTCT GCCACCCGCT GAAGGCCTAC GCCAACAACC CGATCTGGAC CTCGGACCCG AAGAACAAGG TCTTCGGCGA GGCCTCGACC CGGACGCTGG CGGCGGGCGG CCTCGCCCCG GTGAGCGAGA AGGTGGCGGC CGTGCTCGCC GACTTCGTCA TCGTCGACAT GTTCGCCTCC TACTGCACCG GCCGCGAGGA CGTGAAGGGC GCCATCCGCA CGGCGGAGCG GCAGGCCCAG CGCATCTTCC GGTCGGCGTG A
|
Protein sequence | MTFFDRRQIL KAGAALGLSG AACLDGFARA WAQESQWKPE PGASLKLLRW KRFIPSEDEA FMRLVDAFTK ATGVPVSVTS ESFDDIQPKA SVAANTGQGP DMVWGLYSFP ALFPSKCLEV GDVADYLGKK YGGWLPAAEA YGKVKGKWIA IPMAFNGGYL NYRIAAMQKA GFSKFPEDLG GFLELCRALK KNNTPSGFAL GHATGDANSW IHWALWSHDA YLVDANEKII INSPETAKAL EYVKSLYETF IPGTVSWNDS SNNKAFLSGE LYLTNNGISI YAAAKTERKD IAEDMDHAVY PVGKSGKPTE FQLAFPILAY TYTKAPNACK AFMAFALEAQ NYNAWLQAAQ GYLCHPLKAY ANNPIWTSDP KNKVFGEAST RTLAAGGLAP VSEKVAAVLA DFVIVDMFAS YCTGREDVKG AIRTAERQAQ RIFRSA
|
| |