Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0425 |
Symbol | sulP |
ID | 4785415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 460464 |
End bp | 462269 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640088983 |
Product | sulfate transporter |
Protein accession | YP_001019622 |
Protein GI | 124265618 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCAC CCGCTGCGCC GGCTGCCGGC CCGCCCTGGC TACAACGCTG CTTCGGTCCG TGGGTCAGGC TCGTGTCGCG CGAGACGCTG CGCGCCGACC TGCTGGCCGG CCTGCTGGGC GCCGTGCTGG TACTGCCTCA GGGCATCGCC TTCGCCTCGC TCGCCGGCCT GCCGCCGCAG TACGGCCTGG CCACCGCCAT CCTGCCGTGC ATCGTCGCGG CGCTGTTCGG CTCCAGCCTT CACGTGATGT CGGGGCCGAC GAACGCGAAC TCGCTGGCGC TGGCGGCCAT GCTCACGCCG CTCGCCTGGG TGCGCAGCCC CGACTACATC GAGCTCGCGC TGACCGTCAC GCTGCTGGTC GGCGTGATGC AGACCCTGAT CGGCGCGCTG CGCCTGGGCA GCATCGCCAA CTTCATCTCG CCGGCGGCCT TGCTCGGCTT CACCGCCGGC GCCTCGGTGC TGATTGCGCT GCATGCCCTG CCCGACCTGC TGGGCATGAG TTCCGGCACC GGGTTGAGGC CGATGCTCGA GGCCCTCTGG CAGCGGCCGC TCGAGGTGGT GCACCTCGGT TCGCTGGTCG TCGGCGTGGT GGCCCTCGCG GTCACGCTCG CCGTGCGGCA CTGGCAGCGG CGCTGGCCCG CGCTTCTGCT GGGGCTGGCG GCCGGCACGC TGGTCGCGGT GCTGCTGAAC GCCGGCCATG AGGACGGCAC GTTCTGGCAC GTCGAGCAGA TCGGTGAGGT GCCGCTGCCC TGGCCCCGCT GGCACTGGCC CGACATCGAC ATCTCGCGGC TGCGCGACCT GGTCAGCATC GCCTTCGCCC TGACGCTGGT GGCGCTCGCC CAGTCGATCT CGATCGCCAA GGCCGTGGCC GCCCGCTCGG GCCAGCGCAT CGACGCCAAC CGCGAGTTCC TCGGCCAGGG CCTGTCCAAC GTGGTGGGAG GCCTGACCTC CGCCTATGTC TCCTGCGGAT CGCTGAACCG TTCGATCCCC AACCTCGAGG CCGGTGCGCG CACGCCACTG GCTTCGGTGT TCTCGGCCGG GCTGCTGCTG TTGCTGGTGC TGGTGAGCGC CCCGCTGCTG GCGCTGATCC CGAACGCGGC GATCGCCGCG GTGCTGCTGC CGGTCGCCTG GAACCTGCTC GACCTGCCCG GCTGGCGGCG CCTGATGCGG CTAGAACGCA GCGACTTCGC GATCGCTGCG GCCACCGCGG TGGCCACCGT GAGCCTCAGG CTCGAGATCG CGATACTGCT CGGCAGCATC CTGTCGCTGA GCAGCTACCT GCAACGCACG GCGCGGCCGG CCATGCGCAC GATGGGGTTC GATTCCGTGG CGCCGGACCG CCCCTTCGTC GTGCTCGACG GCCAGACCGA GGCGCTGCCG GAGTGCCCGC AGCTCAAGCT GCTGCGCATG GAGGGGGCGG TCTACTTCGG CGCCGCCCAG CACGTGTCGG ACACCCTGCA CCACCTGCGC GCGGCGCCCG CCGCGCCGCG CCACCTGCTG GTCATGAGCA AGAGCATGAA CTTCATCGAC CCGGCCGGGG CCCAGGTCTG GGACGAGGAG CTGCGCGCCC GCCGGGCCGA CGGCGGCGAC CTCTATTTCC ACCGCCCGCG CCCGCCGGTG CTGGAGCTGT GGGAACGCTC GGGCTTCCTC GACGCGCTCG GTCGGGACCA TGTGTTTTCC GACAAGCACA GTGCAATCGC ACGGATCGTG CCGCGCCTCG ACCCGGCGAT CTGCGTCGGC TGCAAGGTCA GGATCTTCGG CGAGTGCGCC CGCCAGCCCG GCGCGCCGCC GGCGCCGGAG ATCTGA
|
Protein sequence | MSSPAAPAAG PPWLQRCFGP WVRLVSRETL RADLLAGLLG AVLVLPQGIA FASLAGLPPQ YGLATAILPC IVAALFGSSL HVMSGPTNAN SLALAAMLTP LAWVRSPDYI ELALTVTLLV GVMQTLIGAL RLGSIANFIS PAALLGFTAG ASVLIALHAL PDLLGMSSGT GLRPMLEALW QRPLEVVHLG SLVVGVVALA VTLAVRHWQR RWPALLLGLA AGTLVAVLLN AGHEDGTFWH VEQIGEVPLP WPRWHWPDID ISRLRDLVSI AFALTLVALA QSISIAKAVA ARSGQRIDAN REFLGQGLSN VVGGLTSAYV SCGSLNRSIP NLEAGARTPL ASVFSAGLLL LLVLVSAPLL ALIPNAAIAA VLLPVAWNLL DLPGWRRLMR LERSDFAIAA ATAVATVSLR LEIAILLGSI LSLSSYLQRT ARPAMRTMGF DSVAPDRPFV VLDGQTEALP ECPQLKLLRM EGAVYFGAAQ HVSDTLHHLR AAPAAPRHLL VMSKSMNFID PAGAQVWDEE LRARRADGGD LYFHRPRPPV LELWERSGFL DALGRDHVFS DKHSAIARIV PRLDPAICVG CKVRIFGECA RQPGAPPAPE I
|
| |