Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3428 |
Symbol | |
ID | 6132055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3804262 |
End bp | 3805539 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641643598 |
Product | extracellular solute-binding protein |
Protein accession | YP_001770250 |
Protein GI | 170741595 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.390518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.012309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGGA CCCTTTCCCT GGCCGCGATG GCGGCGGCGG CGATGCTCGC CGCCAGCGCC GCGTCGGCGG CATCGTTGAA GATCCTGGAC CACGGCAGCC GCGGCGCCGC CGAACTCGAC GCCATCGCGG CCCAGGTCGC GGCCTGGAAC CGATCGCACC CGGACATCCC GGCCGAACTC GTGACCCTGC CCAAGCCGAT CGAGAACCAG ACGGTTCAGG CGAAGGCCCT GGCCGGCACC TGGCCGGACA TCCTCGACTT CGACGGGCCG AGCTTCGCGA ACGCGGCCTG GGCCGGCCTG CTGGCGCCCC TCGACGACCT GCTCCCGCCC GACCTGATGC GCGCCCTGCT GCCGTCGATC CGCGCACAGG GCCTCTACGC CCCGGACGGC AAGATCTACG CCCTCGGCCA GTTCGATTCC GGGCTCGGAC TCTGGGCCTC CCGCTCGGCC CTGCGCCAGG CCGGGATCCG GATCCCCAGC GGCCTCGACG ATGCCTGGAC CGGCGAGGAG TTCGAGGCCG CGCTCGCCGC CCTCAAGCGC GCCGGCTACC CGACGCCCCT CGACATGAAG CTGAATTACG GCGTCGGCGA GTGGTACACC TACGGCTTCG CGCCGATCCT GCAATCGTAC GGCGGGGATC TGATCAACCG CACGACCTGG CAGGCCGAAG GGACGATCAA TTCCGAGGCG TCGATCGCGG CGTTGGGCCG GATCCAGTCC TGGATGAAGG CCGGGTACAT CGTGCCCGCC TCGGAAGGCG ACGACGCCTT CTACGGCAAG CGGAGCGCGG CCCTGGCTTT GGTCGGGCAC TGGATGTGGC CGACCCACAG CGCCGCCCTC GGCTCCGACC TGGTGCTGCT GCCGATGCCG CGCTTCGGCG CGCGCCACGT CACCGGGATG GGGAGCTGGA ACTGGGGGAT CTGGTCGGGC TCCCCGAACA AGGAGGCGGC CGCCAAGTTC CTGGAATTCC TGATGTCCGA GCCGGCGATG GAGGCGGTGG CCGGGGCGGC GGGCGCGATC CCGTCGCGCC AGGCGGCGGC GGAGCGCAAC CCGCTCTTCC GCCAGGGCGG GCCGATGGCG CTCTACCGCG AGCAGCTGAC CCGCATCGCG GTGCCCCGCC CGCCGCATCC GGCCTACCCG GTGATCTCGC GGGCCTTCGC CGCCGCGGTC AATTCGGTCA TGAAGGGCGA GGATCCGAAG CGGGCGCTCG ACCGGGCGGC GGCGGCGATC GACCAGGAGA TCGAGCAGAC CAACGGCTAC AAGCCCTTCG GCGGCTGA
|
Protein sequence | MKRTLSLAAM AAAAMLAASA ASAASLKILD HGSRGAAELD AIAAQVAAWN RSHPDIPAEL VTLPKPIENQ TVQAKALAGT WPDILDFDGP SFANAAWAGL LAPLDDLLPP DLMRALLPSI RAQGLYAPDG KIYALGQFDS GLGLWASRSA LRQAGIRIPS GLDDAWTGEE FEAALAALKR AGYPTPLDMK LNYGVGEWYT YGFAPILQSY GGDLINRTTW QAEGTINSEA SIAALGRIQS WMKAGYIVPA SEGDDAFYGK RSAALALVGH WMWPTHSAAL GSDLVLLPMP RFGARHVTGM GSWNWGIWSG SPNKEAAAKF LEFLMSEPAM EAVAGAAGAI PSRQAAAERN PLFRQGGPMA LYREQLTRIA VPRPPHPAYP VISRAFAAAV NSVMKGEDPK RALDRAAAAI DQEIEQTNGY KPFGG
|
| |