Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0114 |
Symbol | ssuA |
ID | 4784516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 118148 |
End bp | 119143 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640088661 |
Product | sulfonate binding protein |
Protein accession | YP_001019311 |
Protein GI | 124265307 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.802065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00280198 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCTTCC ATCGACATGC GTTGTTCGCC GGCGTGCTGG CGCTGGCCAC CGGGCTGCTG GGGTTCGCGC CGGGGGCGCA CAGCCAGCCG GCCGCACCGA AGGAGATCCG CATCGGCTTC CAGAAGAGCG CCGTCAACCT GGTGATCCTC AAGCAGCAGG GTGCGCTGGA GAAGCGCTTT CCCGACAGCA AGGTGTCGTG GATCGAGTTC CCGGCCGGGC CGCAGCTGCT GGAGGCACTG GCGGTCGGCA GCCTGGAGAT CGGTCTGACC GGCGACTCGC CGCCGGTGTT CGCGCAGGCG GCCGGCAAGG ACCTGCGCTA CGTCGGCGCC GAGCCGCCCA AGCCGCAGAG TTCGGCCATC CTCGTGAAGC CCGACTCGCC GCTGCGCACG CTGGCCGACC TGAAGGGCAG GAAGGTCGCG TTCCAGAAGG GCTCCAGCGC GCATTACCTC GTGGTGCGCG CGCTGGCGCA GGCCGGGCTG CAGTGGAGCG ACATCACGCC GATCTACCTG CCGCCGTCGG ACGCGCGTGC CGCCTTCGAG CGCGGCAGCG TCGACGCCTG GGCCATCTGG GACCCCTACT ACGCCGCGAC CGAGCTCGAC ATCCAACCGC GCGTGCTGAG CAATGGCGTG GGCCTGTCGG GCAACAACTC CTTCTACCTG GCATCGACCG CGTTCACGCA GAACCACCCG CAAGCGGTGC AGGTCCTGCT CGACGAGCTG ACGCGGGCCG ATGCCTACGT GCAGTCGCAC CGCAAGGAGT CCGCGCAGTT CATCGCCGAC TTCAGCGGCC TGAGCCTGGC GACGGTGCAC CTGTTCATTT CGCGCCGCCC GCCATCGCCG GTGAAGCCGC TGTCGCCGGC GCTGGTGGCC GACCAGCAGC GTGTGGCCGA TGCCTTCCAG CAGCTCGGGC TGATCCCCAA GCCGGTGGCG GTGGCCGAGA TCGTGTGGCA GCCCGGCGCC CCGGGGGCGG CGCGCCTCGC GAACGCCGCC CGCTGA
|
Protein sequence | MSFHRHALFA GVLALATGLL GFAPGAHSQP AAPKEIRIGF QKSAVNLVIL KQQGALEKRF PDSKVSWIEF PAGPQLLEAL AVGSLEIGLT GDSPPVFAQA AGKDLRYVGA EPPKPQSSAI LVKPDSPLRT LADLKGRKVA FQKGSSAHYL VVRALAQAGL QWSDITPIYL PPSDARAAFE RGSVDAWAIW DPYYAATELD IQPRVLSNGV GLSGNNSFYL ASTAFTQNHP QAVQVLLDEL TRADAYVQSH RKESAQFIAD FSGLSLATVH LFISRRPPSP VKPLSPALVA DQQRVADAFQ QLGLIPKPVA VAEIVWQPGA PGAARLANAA R
|
| |