Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3951 |
Symbol | |
ID | 5834221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4387546 |
End bp | 4389741 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641369742 |
Product | TonB-dependent receptor plug |
Protein accession | YP_001641393 |
Protein GI | 163853350 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.478156 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.620506 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGGCG GACGTTCAGG ACGGATGCTG CTGGCGGCGG GGGCAGTGCT GACGGGCGTG CCCGGGACCG TACTCTCGGC GCAGGCGCAG GAAGCGGCCA CGCTGGAGGA GATCAGCGTG ACCTCGGTGA GCCCGATCCA GGGTCGGCCC GCCGCTCCGG CGGCTCCCTC CCCCGCCGCG CCGTTCGCCC GGGCGGCCGA GGTGCTGCCG GTGGTGACGA ACACCTTCTC GCCGGTCACG GTGGTGCCGC AGGAGCGGAT CGCCCGCGAC CAGCCGCGCA CGCTCGGCGA CGCGCTGTTC GACCGGCCCG GCATCTCCGC CTCGACCTAC GCGCCGGGGG CGGCCTCGCG GCCGATCATC CGCGGCCTCG ACAACGCCCG GGTGCGCATC CAGGAGAACG GCATCGTCAA CGGCGGCGTG TCGGATCTCG GCGAGGATCA CGCGGTGCCG GTCAACCCGC TCAACGCCAG CCGGATCGAG GTGATCCGCG GCCCCGCGAC CCTGCGCTAC GGCTCGGGGG CGATCGGCGG CGTGGTCTCG GCCGAGAACA ACCAAGTGCC GACCTTCATC CCCGCCCGCG GGATCACGGG GCAGGTCACG ACCGGCTACA CCACCGTCGA CAACGGCCGG CTCGGAGCGG CGAGCGTCGA TGCGGGCGCG AACGGCATCG CGGTCCACGC CGACGGGTTC GCGACCGCGA CCGATTCCTA CGCGATCCCC GGCGGCATCC AGCGCAACTC GGCGACCGAG ACGCAAGGGG GCTCGGTCGG CATCTCGGCG ATCGGCGACC GCGGCTTCCT CGGCATCTCC TACAGCCACT TCAACGCGCT CTATCAGATC CCCGGCGGCG AGGCGGCGGA AGCGCGCACC CGGCTCAACC CGAACCAGGA CCGCGTGCTC GCCCGCGGCG AATACCGCCC GCTGGAAGGC CCGTTCGAGG TGCTGCGGTT CTGGGCCGGC GGCTCGGTCT ACCGCCACGA GGAGATCGGC TTCGGGCACG CCCATCACGA TCACGGGGAC GATGACGACC ACGCGCATGA GGGCCCGGCC GGCGAGGGCG TGCAGGCGAT CTTCAAGAAC CGCGAGGTCG AAGCCCGGTT CGAGGCGCAG CACGTGCCGG TCTTCACCGC GCTCGGCACG CTCACCGGCG CGGTCGGCGT CCAGACGAGC CGGCGGGTGC TCAACTCGCA GCTCGAGAGC TTCCTGCCGC CGACCGAGTC GCGGGTGCTG GCCGCCTACC TGTTCGAGGA ACTGGCGGTC GGCGGGGGGC TCCGGTTCCA GGCCGCCGGC CGGATCGAGG GCGACCGGCT CAACAGCATC GCGACGCAGT TCCCCGGCAG CTACCTGCCG ACGGACGGGG ACCCGCTCAG CTATGCCCTG ACGCGCCGCT TCGCTCCCAA GAGCCTCAGC TTCGGCGCCT TGCAGGATCT GCCCTACGGC TTCGTGGCGA GCCTCAACGG CTCCTATGTC GAGCGTGCGC CCACCGGCGC CGAGCTGTTC TCGCAGGGGC CGCACCACGC CTCGGCGACC TTCGAGATCG GCGATCCGAC CCTGCGGCTG GAGCGCGCCC GCACCGCCGA GATCTCCTTG CGCCGTGCGG ACGGGCCGCT GCGGCTCGAC GCCACCGGCT ACGTCACGCG CTACACCGGC TTCATCTACC GGCGCGACAC CGGGAACCGC TGCGACGACG ATTTCGGCTC CTGCGGCTCC GGCGACGAGC TGCGCCAGGT CGTCTACTCG CAGGCCAATG CCAGCTTCTA CGGCGCCGAG ATCGGCGCGC AGCTCGATCT CTTCGCCGTC GGCGACGGCT GGGCCGGAAT CGAGGCGCAG TACGACTTCG TCCGCGCCCA GTTCGACGAC GGCTCCTACG TCCCGCGCAT CCCGCCGCAC CGGCTCGGCG GCGGCGCCTT CGTGCGGGCC AACGGCTGGT TCGCGCGGGT GAACCTGCTC CACGCCTTCG ACCACACCGA GATCGCCCCG TTCGAGACCA CGACGCCGGG CTGGAACGAT CTTCGCGCCG AACTCGCCTA CACCCAGGCG CTCGACCCGA CGGTCTACGG CGCCACCGAG GTGACGCTGG GGCTGCAAGG CCGCAACCTG CTCGATGACG ACATCCGCAA CTCGGCCTCG TTCAAGAAGG ACGAGATCCT GCTGCCGGGC CGCAACCTCC GCCTGTTCCT GACGGCACGA TTCTGA
|
Protein sequence | MRGGRSGRML LAAGAVLTGV PGTVLSAQAQ EAATLEEISV TSVSPIQGRP AAPAAPSPAA PFARAAEVLP VVTNTFSPVT VVPQERIARD QPRTLGDALF DRPGISASTY APGAASRPII RGLDNARVRI QENGIVNGGV SDLGEDHAVP VNPLNASRIE VIRGPATLRY GSGAIGGVVS AENNQVPTFI PARGITGQVT TGYTTVDNGR LGAASVDAGA NGIAVHADGF ATATDSYAIP GGIQRNSATE TQGGSVGISA IGDRGFLGIS YSHFNALYQI PGGEAAEART RLNPNQDRVL ARGEYRPLEG PFEVLRFWAG GSVYRHEEIG FGHAHHDHGD DDDHAHEGPA GEGVQAIFKN REVEARFEAQ HVPVFTALGT LTGAVGVQTS RRVLNSQLES FLPPTESRVL AAYLFEELAV GGGLRFQAAG RIEGDRLNSI ATQFPGSYLP TDGDPLSYAL TRRFAPKSLS FGALQDLPYG FVASLNGSYV ERAPTGAELF SQGPHHASAT FEIGDPTLRL ERARTAEISL RRADGPLRLD ATGYVTRYTG FIYRRDTGNR CDDDFGSCGS GDELRQVVYS QANASFYGAE IGAQLDLFAV GDGWAGIEAQ YDFVRAQFDD GSYVPRIPPH RLGGGAFVRA NGWFARVNLL HAFDHTEIAP FETTTPGWND LRAELAYTQA LDPTVYGATE VTLGLQGRNL LDDDIRNSAS FKKDEILLPG RNLRLFLTAR F
|
| |