Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0113 |
Symbol | ssuA |
ID | 4784515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 117100 |
End bp | 118137 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640088660 |
Product | sulfonate binding protein |
Protein accession | YP_001019310 |
Protein GI | 124265306 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00282136 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCATT CGGATTCTTC TTCCTCGTCC CGCTGGAGAC GGCTGCCGCA GTCGGCCCGT CGCCACCTGC TGCAGGCCTT CGGCGCCGCC GCCGGCGCCG CGGCGCTCGG CACCTGGCCG CTGGCGCGTG CACAGTCCTC GGGCGCGGCG CGCGAGCCGC TGCGCGTGGG CTACCAGAAG TCGGCCAGCC TGTTCGTGCT GCAGAAGGCG CAGGGCTCGC TGGAGAAGAA GCTCGCGCCG CTGGGCGTGG GTGTGAAGTG GATCGAGTTC CCGGCCGGGC CGCAGTTGCT GGAAGGCCTG AACGTCGGCT CGGTCGACAT CGGCCATGTG GGCGAGGCGC CGCCGATCTT CGCGCAGGCG GCCGGCGCCG ACTTCGTCTA CATCGGCCAC GACCCGGCCG CGCCGGAGGC CGAGGCCATC GTCGTGCCGC AGGGCTCGGC GATCAGGAGC GTGGCCGAGC TCAACGGCAG GAAGGTCGCG CTGAACAAGG GCTCGAACGT GCACTACCTG CTGGTGCGCG CGCTCGAGAA GGCCGGCCTG AAGTACGCCG ACATCCAGCC GGTCTTCCTG CCGCCGGCCG ATGCGCGTGC CGCCTTCGAG AAGGGCGCGG TCGATGCCTG GGCGATCTGG GATCCCTTCC TCGCCGCGGT CGAGAAGCAG ACCGGTGCGC GCGTGCTGGT CGACGGCCGC AACGGCGTCG CCAACAACTA CCTGTTCTAC CTGGCCGAGC GCAAGTTCGT GCAGAAGAAC GGCGACGTGA TCCAGGCGCT GTTCGCCGAT TCGCAGGAGC AGGGCCGCTG GCTGAAGGCC GACTTGAAGC GCGCCGCGGC GATCATTGCG CCACTGCAGG GCCTGGACCC GGAGATCGTC GAGCTCGCGC TGCGCCGCTA CAACTTCAAT GTCACGCCGC TCAGCGAGCA GGTCGCGGCG CAGCAGCAGC AGATCGCCGA CGTGTTCCAC GAGCTCAAGC TGATCCCCAA GCCGATCCGC GTGGCCGACG CGCTGCCCGC GGTGCGCGTC GCGCAGAAGC AGCCCTGA
|
Protein sequence | MSHSDSSSSS RWRRLPQSAR RHLLQAFGAA AGAAALGTWP LARAQSSGAA REPLRVGYQK SASLFVLQKA QGSLEKKLAP LGVGVKWIEF PAGPQLLEGL NVGSVDIGHV GEAPPIFAQA AGADFVYIGH DPAAPEAEAI VVPQGSAIRS VAELNGRKVA LNKGSNVHYL LVRALEKAGL KYADIQPVFL PPADARAAFE KGAVDAWAIW DPFLAAVEKQ TGARVLVDGR NGVANNYLFY LAERKFVQKN GDVIQALFAD SQEQGRWLKA DLKRAAAIIA PLQGLDPEIV ELALRRYNFN VTPLSEQVAA QQQQIADVFH ELKLIPKPIR VADALPAVRV AQKQP
|
| |