Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3200 |
Symbol | tonB |
ID | 4786539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3404370 |
End bp | 3405260 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640091773 |
Product | periplasmic protein/ biopolymer transport |
Protein accession | YP_001022388 |
Protein GI | 124268384 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.917881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0459006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTGA TTCCGCAACG CTGGGCAGAG CGCCTCCGGC GACTCACCGC ACTGCACGTC GCGCTGCTGA TCTCCTTCGG TGTTCACGCG GTGCTGCTGA CGGTGCGCTT CGTCGATCCG GAAGGCTTCA ACCGCGTGTT CAAGGACACG CCGCTCGAGG TCATCCTGGT CAACGCGCGC TCGACCGAGC CGCCGCTGAA GGCGCAGGCG ATCGCACAGG CCGCGCTGGC GGGCGGCGGC GAGGCCCAGG CCGGCCGCGC CACCTCGCCG CTGCCACCGT CCGCCCTGAC CGAGGTCGGC GATTCGCTGG AAGACGCGCG CCAGCAGATC CAGTCGCTGC AGGAGCAGCA GATGCAGCTG CTGGCGCAGA TCCGCCGCGA GCTGGCGCGC CTGCCGCCGC CCGACCCGCA ACGCGACGAG GGGAGCCCGG ACGCCCGCGC CCAGGCCGAA CAGCGCCAGC AGATGCTGCA GATGCTGGCC GAGATCGAGA AGCGCATCAA CGACGCCAAT GCCCGTCCGA AGAAGCGCTA CGTGAGCCCG TCGACCAGCG AGGCGGTGTA CGCCGTCTAC TACGACGTGC TGCGCCGCAA GATCGAGGAA CGCGGCACCC TGAACTTCCC CGAGGACAAG GGCCGCAAGC TCTACGGCGA GCTGACGATG ATCGTCACCG TCGACGCGCT CGGCCAGGTG CTCGACACGG AGATCGTCCA GAGCTCGGGC CAGCGCACGC TCGATCGTCG CGCTCAGGCC ATCGTGCGCG CGGCATCGCC GTTCGGCCCC TTCACCGACG CGATGCGCAA GCAGGCCGAC CAGATCGTCG TCGTGTCGCG CTTCCGCTTC ACGCGAGACG AGGGCTTCGA GACGCGGCTC CTGCAGTCGT CGCCGCAATG A
|
Protein sequence | MKLIPQRWAE RLRRLTALHV ALLISFGVHA VLLTVRFVDP EGFNRVFKDT PLEVILVNAR STEPPLKAQA IAQAALAGGG EAQAGRATSP LPPSALTEVG DSLEDARQQI QSLQEQQMQL LAQIRRELAR LPPPDPQRDE GSPDARAQAE QRQQMLQMLA EIEKRINDAN ARPKKRYVSP STSEAVYAVY YDVLRRKIEE RGTLNFPEDK GRKLYGELTM IVTVDALGQV LDTEIVQSSG QRTLDRRAQA IVRAASPFGP FTDAMRKQAD QIVVVSRFRF TRDEGFETRL LQSSPQ
|
| |