Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0123 |
Symbol | cysP |
ID | 4784525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 125660 |
End bp | 126673 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640088670 |
Product | thiosulfate binding protein |
Protein accession | YP_001019320 |
Protein GI | 124265316 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000442607 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCTCTCC TGCCTTCGTC GCTGCGGCGC GCACTGATCG TTTCTCCCCT GTTGCTGGCC GCCGGCTGGG CCACGGCCGC CGGGCCGACG CTGCTGAACG TCAGCTACGA CGTGGCGCGC GAGTTCTACA AGGAGATCAA CCCGGCCTTC CAGAAACAAT GGAAGCAGGC CAGCGGCGAG GACGTGACGA TCAACCAATC CCACGGCGGC TCGAGCAAGC AGGCCCGCGC GGTGATCGAC GGTCTGGAGG CCGACGTCGT GACGATGAAC CAGTCGAGCG ACATCGACGC CATCGCCACC CGCGGCAAGC TGATCCCCGA AGACTGGGCG AAGCGGCTGC CGAACAACGC CGCGCCCACC ACCTCGGTCA CGGTGATCCT GGTGCGCAAG GGCAACCCGA AGAAGATCGC CGACTGGCCC GATCTCGCCA AGCCCGGCGT CAGCGTTGTG ATCCCGAATC CCAAGGTCAC CGGCAACGGT CGCTACAGCT ATGTGGCCGC CTGGGGCTCG GCCATCAAGG CCGGCGGTAC GCCGGCCCAG GCGCGCGAGC TGGTGCAGAA GATCTTCGCC AACGTGCCGG TGTTCGACGG TGGCGGCCGT GCGGCGACCA CCACCTTCGC ACAGCGCAAC ATCGGCGACG CGCTGGCGAC CTTCGAGTCG GAGGTCAACC TGATCAAGGC CGAGTTCGGC GACAACTTCG ACGTGGTCTA CCCCAAATCG ACCATCCTGG CCGAGAACCC GGTGAGCGTG ATCGACACCG TGGTCGACAA GAAGGGCACG CGCAAGCAGG CCGAGGCCTA CCTGAAGTTC CTGTGGTCCG ACGAGGCCCA GGAGATCGCC GCCAAGCACC AGTTGCGCCC GCGCTCGCCC GCGCTGCTGG CGAAGTACGC GAAGGACTTC CCGAAGGTCA ACACCTTCAC GATCGACGAG ATCTTCGGCG GCTGGAGCAA GGCGCAGAAG GAGCACTTCG ACGACGGCGG GATCTACGAC CAGATCCTGG CGGCGCGCAA GTGA
|
Protein sequence | MSLLPSSLRR ALIVSPLLLA AGWATAAGPT LLNVSYDVAR EFYKEINPAF QKQWKQASGE DVTINQSHGG SSKQARAVID GLEADVVTMN QSSDIDAIAT RGKLIPEDWA KRLPNNAAPT TSVTVILVRK GNPKKIADWP DLAKPGVSVV IPNPKVTGNG RYSYVAAWGS AIKAGGTPAQ ARELVQKIFA NVPVFDGGGR AATTTFAQRN IGDALATFES EVNLIKAEFG DNFDVVYPKS TILAENPVSV IDTVVDKKGT RKQAEAYLKF LWSDEAQEIA AKHQLRPRSP ALLAKYAKDF PKVNTFTIDE IFGGWSKAQK EHFDDGGIYD QILAARK
|
| |