Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0122 |
Symbol | cysP |
ID | 4784524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 124475 |
End bp | 125488 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640088669 |
Product | thiosulfate binding protein |
Protein accession | YP_001019319 |
Protein GI | 124265315 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.964738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000712106 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACCCGTT TCGCTTCCGT CCGTGGCGCG CTCGCCGCCT CGATCCTTGT TTTGGCCGCC GGTGCCGCCG CCGCCAAGGA CGTGACCCTG CTCAATGTCT CCTACGACCC GACGCGCGAG CTCTACGTCG AGTACAACGC CGCCTTCGCC AAGTACTGGA AGGGCAAGAC CGGCGACAAC GTGACCGTGA AGCAGTCGCA CGGCGGCTCG GGCAAGCAGG CTCGCTCGGT GATCGACGGC ATCGACGCCG ACGTGGTGAC GCTGGCGCTG GCCTACGACA TCGACGAGAT CGGCGAGAAG GCCAAGCTGC TGCCGGCCGA CTGGCAGAAG CGCCTGAAGC ACAACAGCTC GCCCTACACC TCCACCTACA TCTTCCTGGT GCGCAAGGGC AACCCGAAGG GCATCAAGAA CTGGGACGAC CTGGTGAAGC CGGGCGTGTC GGTGATCACC GCGAACCCCA AGACCTCGGG TGGCGCCCGC TGGGGCTACC TGGCGGCCTA CGGCTTCGCG CTCAAGCAGC CCGGAGGCGA CGATGCCAAG GCGCGCGAGT TCGTCGGCAA CCTGTTCAAG AACGTGCCGG TGCTGGATTC CGGCGCCCGC GGCTCCACGG TGACCTTCGC CGAGCGTGGT ATCGGTGACG TGCTGCTGGC CTGGGAGAAC GAGGCTCACC TTTCGCTGAA GGAGTTCGGC GTCGACAAGT TCGACATCGT CTACCCGCCG CTGAGCATCC TGGCCGAGCC GCCGGTGACG GTGGTCGACA AGGTGGTCGA CAAGAAGGGC ACCCGCGACG TGGCCCAGGC CTACCTCGAG TACCTGTACA CCGCGGAGGG CCAGGAAATC GCCGCGCGCA ACTTCTACCG GCCGATCGAC GAGAAGGTCG CGGCGAAGTA CGCGAAGAAC TTCCCGAAGG TCAACCTGTT CACCATCGAC GAGGTGTTCG GCGGGTGGGC CAAGGCGCAG AAGACCCACT TCGCCGACGG CGGTGTGTTC GATCAGATCT ATACGAAGAA GTAA
|
Protein sequence | MTRFASVRGA LAASILVLAA GAAAAKDVTL LNVSYDPTRE LYVEYNAAFA KYWKGKTGDN VTVKQSHGGS GKQARSVIDG IDADVVTLAL AYDIDEIGEK AKLLPADWQK RLKHNSSPYT STYIFLVRKG NPKGIKNWDD LVKPGVSVIT ANPKTSGGAR WGYLAAYGFA LKQPGGDDAK AREFVGNLFK NVPVLDSGAR GSTVTFAERG IGDVLLAWEN EAHLSLKEFG VDKFDIVYPP LSILAEPPVT VVDKVVDKKG TRDVAQAYLE YLYTAEGQEI AARNFYRPID EKVAAKYAKN FPKVNLFTID EVFGGWAKAQ KTHFADGGVF DQIYTKK
|
| |