Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2875 |
Symbol | |
ID | 5835081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3222326 |
End bp | 3223255 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641368676 |
Product | periplasmic solute binding protein |
Protein accession | YP_001640336 |
Protein GI | 163852293 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.279263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTTGG GCGTGGGGCG CGGCGTCGCC ATCGTGGTTC TGCTGTGGTT CGGCCTTGCA CCGTCGATGG CCGCGGCGGA CGGCGTGCGG CTGAAGGCGG TGGCGACCTT CTCGATCCTG GCCGACCTCG TGGCGCAGGT CGGCGGCGAG CAGGTCGCCG TGACGAGCCT CGTCGGGCCG GATGCCGACG CTCACGGCTA CTCCCCCACG CCGGGCGACG CCCGCAAGCT GGCTGAGGCG AATCTTGTCG TCGTCAACGG CCTCGGATTC GAGGGCTGGA TGGAGCGGCT GATCAAGGCT TCAGGCACCA AGGCGCCGGT CACCGTCGCC TCCAAGGGGG TGAAGACGGT CGCGGGCAGC CACGACCACG ATCACGCCGA GGATCACGGA CACGATCACG GCGACCACCC CGATCCGCAT GCCTGGCAGA ACGTGGCGAA CGCCAAGCTC TACGTCGCCA ACATCCGCGA CGGCCTCAGC GCCGCCGACC CGGACCACGC CTCGCTCTAT GCCGCGAACG CCGCCGCCTA CACGCAGAAG CTCGACGCGC TGGACGCGGA GATCCGCGCG GCTTTGGGTG CAATCCCGGA GGAGCGGCGG CGCATCATCA CCACGCACGA TTCCTTCGGC TATTTCAGCG CCGCCTACGG CATGCGCTTC CTGGCGCCGC AGGGCATCTC CACAGATAGC GAGGCCGGCC CGAAGGATGT CGCCCGCATC ATCCGCCAGA TCCGCCGGGA CAAGGTGCCG GCGGTGTTCG TGGAGAGCAT CGCCGACCCG CGGCTGATGC AGCAGATCGC CCGCGAGAGC GGCGCCAAGG TCGGCGGGCG GATCTACTCC GACGCGCTGA GCGCGCCGGG GGGACCGGCC CCCGGCTATC TGGAGATGAT GCGCGCCAAT CTCAGCGCGT TCCGGGACGC GCTGAGCTAA
|
Protein sequence | MGLGVGRGVA IVVLLWFGLA PSMAAADGVR LKAVATFSIL ADLVAQVGGE QVAVTSLVGP DADAHGYSPT PGDARKLAEA NLVVVNGLGF EGWMERLIKA SGTKAPVTVA SKGVKTVAGS HDHDHAEDHG HDHGDHPDPH AWQNVANAKL YVANIRDGLS AADPDHASLY AANAAAYTQK LDALDAEIRA ALGAIPEERR RIITTHDSFG YFSAAYGMRF LAPQGISTDS EAGPKDVARI IRQIRRDKVP AVFVESIADP RLMQQIARES GAKVGGRIYS DALSAPGGPA PGYLEMMRAN LSAFRDALS
|
| |