Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_6476 |
Symbol | |
ID | 6131701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 7121581 |
End bp | 7123512 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641646566 |
Product | extracellular solute-binding protein |
Protein accession | YP_001773169 |
Protein GI | 170744514 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.864629 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTCT GGCGGCGGGA GCGGCCTCTC GGGGGGCGGG GCGGGCGGGA TGGCGGCGCG GGGCGCCCGC GGGCGGCCGC GGCGCTCGCC GCCCTGGCGC TCCTCCTGCC GGGCCTCGCC CGCGCCGCCG ACGAGACCGC CGCGGCCCGC CTGCCCGCCG AGCCCCTGGT CGTCGACCTC GCGGCGGCCG GCCGAGAGCT CGGCCGGCCG GGCGGCGAGA TCGTCACCCT CGTCGCCAAG CCCCGGGACA TCCGCTACAT CTCGGCCTAC AGCTACGCCC GGCTCGTCGG CTACGACGAG AGCCTCGCCC TCAAGGCCGA CCTCCTCGCG CAGTACGAGT CGCAGGACGA CAAGGTCTTC ACCTTCACGC TGCGGGCCGG CCACCGCTGG TCGGACGGGC AGCCCTTCAC GGCGGAAGAC TTCCGCTACT GGTGGGAGGA CGTCGCCCTC AACAAGGAGC TGAGCCCCTC CGGCCCGCCG GAATTCATGA TCGTGGACGG GCAATTGCCG CGCTTCGAGG TGCTCGACCC GCTGCGCGTC CGCTTCACCT GGGACAGGCC GAACCCGCGC TTCCTGCCGG AACTCGCCTC GCCGCGCGAC CCCTTCATCT ACCGGCCGGC CCATTACCTG AAGCAGTTCC ACGCCCGCTA CGCGCCGAAG AACGACCTCG ACGCCGCCGT CAAGGCGCAG AAGGTCAAGG GCTGGGCCAC CCTGCACAAC CGGCTCGACG CGATGTTCGA GCAGACCAAT CCGGACCTGC CGACGCTGCA GCCCTGGCGC GTGACCAACA CGGCCCCCGC CGCGCGGTTC GTCTTCGTGC GCAACCCCTA CTACCATCGG GTGGACCGCA ACGGCCAGCA GCTCCCCTAC GTCGACCGGG TGCTGATGGA CATCGCGGCG GCCGGGCTCT TCGCCGCCAA GGCCAATGCC GGCGAGGCGG ACCTGCTCTT CCGCGGCCTC ACGATGGAGG ACATCCCGAT CCTGCGCGAG GGCGAGCGGG CCAAGGGCTA CCGCACGCAT CTCTGGCCGA GCGCCCGCGG CTCCGAGATC GCCCTCTACC CGAACCTGAC CGCGGCCGAT CCGGTCTGGC GCAGGCTCAA CCGCGACGTG CGCTTCCGGC GCGCCCTGTC GCTCGCCATC GACCGGCGCA CCCTCAACAA CGCGCTCCTG TTCGGGCTCG GCACCGAGGG GAACGACACG GTCGTGGAGG GCAGCCCGCT CTACAAGCCG GAATACCGCA CGCTCCACGC CCAGTACGAT CCCGAGGAGG CCTCCCGGCT CCTCGACACG GTCGGGCTCG GCGCGCGCAA CGGCGCCGGG ATCCGGCTCC TGCCGGACGG GCGGGAGCTC GAGATCGTGG TCGAGACCGA CGGCGAGAGC ATGATGCTCA CCGACGGGCT CACCCTGATC GCGGAATTCT GGCGCGAGGT CGGGGTCAAG CTCTTCGTGA AGCCCCAGGA CCGCACGGTG CTGCGCAACC GGGCCTATGC GGGGCTCACC ACCATGGTGG CGGCGCAGGG GCTCGACCTC GCCATGCCGA CCGCCGAGAT GCCGCCGGAC GAGCTCGCCC CCGTCCACCA GGACACCTAC GCGTGGCCGA AATGGGGGCA GTTCGTCGAG ACCAAGGGCA AGGAGGGCGA GCCGGTCGAC ATCCCGGAAG CCAAGACCCT CCTCGACCTC AACGCGCAGT GGATGGCGAC CGGCGACGCC GCCATGCAGG CGCGGATCTG GGGCGAGATG CTGCGCAACC GGGCCGAGAA CCAGTGGGTG ATCGGCACGG TCGCGGGAGC GCTGCAGCCG GTCGTCGCCA AGACCCGCCT CGTCAACCTT CCTGACCGGG CGACCTACAG CTGGAAGCCC ACCGCCATGA TCGGGGTCTA CCGCATGGAT GAGTTCTTCT GGGGCCCCGA GCGCAAAGAG GCCGCCCGGT GA
|
Protein sequence | MTVWRRERPL GGRGGRDGGA GRPRAAAALA ALALLLPGLA RAADETAAAR LPAEPLVVDL AAAGRELGRP GGEIVTLVAK PRDIRYISAY SYARLVGYDE SLALKADLLA QYESQDDKVF TFTLRAGHRW SDGQPFTAED FRYWWEDVAL NKELSPSGPP EFMIVDGQLP RFEVLDPLRV RFTWDRPNPR FLPELASPRD PFIYRPAHYL KQFHARYAPK NDLDAAVKAQ KVKGWATLHN RLDAMFEQTN PDLPTLQPWR VTNTAPAARF VFVRNPYYHR VDRNGQQLPY VDRVLMDIAA AGLFAAKANA GEADLLFRGL TMEDIPILRE GERAKGYRTH LWPSARGSEI ALYPNLTAAD PVWRRLNRDV RFRRALSLAI DRRTLNNALL FGLGTEGNDT VVEGSPLYKP EYRTLHAQYD PEEASRLLDT VGLGARNGAG IRLLPDGREL EIVVETDGES MMLTDGLTLI AEFWREVGVK LFVKPQDRTV LRNRAYAGLT TMVAAQGLDL AMPTAEMPPD ELAPVHQDTY AWPKWGQFVE TKGKEGEPVD IPEAKTLLDL NAQWMATGDA AMQARIWGEM LRNRAENQWV IGTVAGALQP VVAKTRLVNL PDRATYSWKP TAMIGVYRMD EFFWGPERKE AAR
|
| |