Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_2701 |
Symbol | |
ID | 6129041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3000255 |
End bp | 3001859 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641642915 |
Product | extracellular solute-binding protein |
Protein accession | YP_001769574 |
Protein GI | 170740919 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGCCAT GGTCTTTCCG CAGCCCGATC CTGGGTCTCG TTCTCGCTGC CGTCCTGGCC CTGCCGGCCG CGGCGCAGGG CGTGCTCCGC ATCGGGATGA CCGCCTCCGA CATCCCGCTG ACCACCGGCC AGGCGGACAA TGGCGGCGAG GGCATGCGCT TCATGGGCTA CACGGCCTAT GACGCGCTCA TCAACTGGGA CCTCGGCAGT GCCGACAAGG CCTCCGAACT GACCCCGGGC CTCGCCACCG CCTGGAAAAC CGACCCCGAC GACACGCGCA AGTGGATCTT CACCCTGCGC GAGGGCGTGA CCTTCCACGA CGGCTCGCCC TTCGACGCCG ACGCGGTGGT CTGGAACCTC GACAAGCTCC TCAAGAACGA CAGCCCGCAA TTCGACCCGC GCCAATCGGC GCAGGGGCGC ACCCGCATCC CGAGCGTGGC GAGCTACCGG GCCGTCGACC CCAAGACGGT CGAGATCGTC ACCAAGACCC CGGACGCCAC CTTCCCGTTC CAGATCGCCT GGATCCTGAT GTCCTCGCCG GCGAACTGGG AGAAGCAGGG CCGCAGCTGG GACGCGGTCG CGAAGGCGCC CTCCGGCACC GGCCCGTGGA AGATCACCAC CTTCGTGCCC CGGGAGCGGG CGGAGCTCAC CCCGAACAAG GCGTACTGGG ACAAGGCGCG GGTGCCCAAG CTCGACAAGC TGGTGCTGAT CCCCCTGCCC GAGGCCAATG CCCGCGTGGC GGCCCTGCGC TCGGGCCAGG TCGACTGGAT CGAGGCGCCG GCCCCGGACG CGCTCGCGTC GCTCAAGGCG GCGGGCTTCC GGATCGTCAC GAACCTCTAC CCGCACAATT GGACGTGGCA CCTGTCGCGG GTCGAGGGCT CGCCCTGGAA CGACATCCGG GTCCGCAAGG CGGCGAACCT CGCGGTCGAC CGCGAGGGGC TCAAGGAGTT CCTGGGCGGC CTCGCGGTGC CGGCCGAGGG CTTCATGACC CCGGGCCATC CCTGGTTCGG CACGCCGGCC TTCAAGGTGA AGTACGACCC CGAACAGGCC AAGGTGCTCC TGAAGGAGGC CGGCTACGGC CCGAACAAGC CGGTCACCAC CAAGATCCTG ATCTCGGCCT CCGGCTCGGG TCAGATGCAG CCTCTGCCGA TGAACGAGTT CATCCAGCAG AACCTCGCCG AGGTCGGCAT CAAGGTCGAT TTCGAGGTCG TGGAATGGAA CACGCTCATC AACATCTGGC GCGCCGGCGC CAAGAGCGAG AGCGCGCGGG GCGCCACCGG CATGAACTAC TCCTACCTGA TCCAGGATCC GTTCACCGCC TTCATCCGCC ACGCCCAGTG CAACCTGGCG CCCCCGAACG GCACCAACTG GGGCTCCTAC TGCGACCCTG AGATGGACAA GCTGTTCGAT CAGGTTCGCA CCACCTTCGA TCCGGCGGCG CAGACCGCCG TGCTGCGGAA GATCCACGAG AAATTCGTCG ACGACGCGCT CTTCCTGATG GTGACGCACG ACGTGAACCC GCGGGCGATG AGCCCGAAGG TGAAGGGCTT CGTCCAGGCG CAGAGCTGGT TCCAGAACTT CTCGTCGATC TCGATGGACA AGTGA
|
Protein sequence | MRPWSFRSPI LGLVLAAVLA LPAAAQGVLR IGMTASDIPL TTGQADNGGE GMRFMGYTAY DALINWDLGS ADKASELTPG LATAWKTDPD DTRKWIFTLR EGVTFHDGSP FDADAVVWNL DKLLKNDSPQ FDPRQSAQGR TRIPSVASYR AVDPKTVEIV TKTPDATFPF QIAWILMSSP ANWEKQGRSW DAVAKAPSGT GPWKITTFVP RERAELTPNK AYWDKARVPK LDKLVLIPLP EANARVAALR SGQVDWIEAP APDALASLKA AGFRIVTNLY PHNWTWHLSR VEGSPWNDIR VRKAANLAVD REGLKEFLGG LAVPAEGFMT PGHPWFGTPA FKVKYDPEQA KVLLKEAGYG PNKPVTTKIL ISASGSGQMQ PLPMNEFIQQ NLAEVGIKVD FEVVEWNTLI NIWRAGAKSE SARGATGMNY SYLIQDPFTA FIRHAQCNLA PPNGTNWGSY CDPEMDKLFD QVRTTFDPAA QTAVLRKIHE KFVDDALFLM VTHDVNPRAM SPKVKGFVQA QSWFQNFSSI SMDK
|
| |