Gene Mpe_A0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0918 
SymbolsulP 
ID4787300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp973248 
End bp974981 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content71% 
IMG OID640089479 
Productsulfate transporter 
Protein accessionYP_001020115 
Protein GI124266111 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACC GGCGCAGCCT GCGCGCGGAC CTGCTCGCCG GACTCACGGG CACCATCATC 
CTGGTGCCGC AGGCGGTCGC CTACGCCAGC ATCGCCGGGC TGCCGCCGGC CTATGGCCTC
TACACCGCGA TCGTGCCGGT GATCGTGGCC GCGCTGTTCG GCTCGTCGCT GCATCTCGTG
TCCGGCCCCA CCGCCGCGTT GTCGATCGTC ATCTTCGCGA CGCTCAGCCC GCTGGCCGAG
CCGGGCAGCG CGGCCTACAT CCAGCTCGCG CTGAGCCTCA CCTTCATGAC GGGGCTGCTG
ATGCTGGCGA TGGGGCTGGC GCGCCTCGGC GTGCTGGTGA ACTTCATCTC GCACAGCGTG
GTGATCGGAT TCACCGCCGG CGCCGCGGTG CTGATCGCCA CCAGCCAGCT GAAGAACTTC
TTCGGCATCA CCGCGCCGGC CAGCGCCTCG TTCATCGAGA CCCTGCGCCT GTTCGTGCAG
CGCCTGCCGG ACACCAACGT CCACGTGCTG AGCGTGGGCA TCGTCACGCT GCTGGCCGCG
GTGGGCACGC GCACCTGGCT GCCGCGCGCG CCGCACATGA TCGTCGCCAT GGCGGTCGGC
AGCCTGCATG CGCTGGCGCT GACCGCGCTG TTCGGCCCGC AGACGGGCAT CGCGATGGTG
TCGGCCATCC CGCGCAGCCT GCCGCCGCTG TCGATGCCGA TCCCTTCGGG TGAGACCCTG
AGGCAGCTCG CACCGATCGC GCTGGCGCTC GCGATGCTCT CGCTCACCGA GGCGGTGGCC
ATCGCGCGCG CGATCGCGCT CAAGTCGGGG CAGCGCATCG ACAGCAGCCA GGAGTTCATC
GGCCAGGGCC TCGCCAACGT GGTCGGCAGC TTCGCCTCGA GCTATGTCTC CAGTGGCTCG
TTCACGCGCA GCGGGGTCAA CCACACAGCC GGCGCGAAGA CGCCGCTGGC GCCGGTGTTC
TCGGCGCTGT TCCTGGTGCT GACGCTGGTG GCGCTGGCGC CGCTGGTGCG CTACCTGCCG
ATCGCGTCGA TGGCGGCGAT CCTGCTCGTC GTCGCGTACT CGCTGGTCGA CGTGCACCAC
ATCCGCGGCA TCCTGCGGAC CAGTCGCGCC GAGGCTGCGG TGCTGGCGGC GACGTTCCTC
GCCACCCTGT TCCTGCACCT CGAGTTCGCC ATCTACGTCG GCGTGCTGCT GTCGCTGATG
GTGTTCCTGG AGCGCACCGC CCGGCCGGAG ATCCGCGACG CGGTCCCGGC GCCGGGCGCG
CACAGCTACC ACTTCGTGCC GCAGACCGAC GAGCCCGACT GCTGCCAGCT GAAGATGGTC
TTCATCGACG GGCCCATCTA CTTCGGCGCC GTCGATCACG TGCAGCGCCG GCTGCGCGAC
ATCGACGCGG CCGATCCCGG CCACAAGCAC CTGCTCGTGC TGGCGCCGGG CATCAACTTC
ATCGACAGCT CGGGGGCCGA GCTGCTGGGC CAGGAGGCGC GCCGACGGCG CCAGCTCGGC
GGCGGCCTGT ACTTCCACCG CCTGCACCCG TCGGCCGTCG ACGTGCTGGC GCGCTCCGGC
CACCTCGACG CCATCGGCCG CGAGAACCTG CATGCGATCG GCAGCAACGT GATCGACGCC
CTCTACCCGC GGCTGGACCC GCAGATCTGC CGCCGCTGCC CTGCCCGCAT CTTCAGCCAG
TGCCAGCGCA CGCTGCCCGA CGGCACGCCG CGCGAGCCGG CCGGCACCCC ATGA
 
Protein sequence
MVNRRSLRAD LLAGLTGTII LVPQAVAYAS IAGLPPAYGL YTAIVPVIVA ALFGSSLHLV 
SGPTAALSIV IFATLSPLAE PGSAAYIQLA LSLTFMTGLL MLAMGLARLG VLVNFISHSV
VIGFTAGAAV LIATSQLKNF FGITAPASAS FIETLRLFVQ RLPDTNVHVL SVGIVTLLAA
VGTRTWLPRA PHMIVAMAVG SLHALALTAL FGPQTGIAMV SAIPRSLPPL SMPIPSGETL
RQLAPIALAL AMLSLTEAVA IARAIALKSG QRIDSSQEFI GQGLANVVGS FASSYVSSGS
FTRSGVNHTA GAKTPLAPVF SALFLVLTLV ALAPLVRYLP IASMAAILLV VAYSLVDVHH
IRGILRTSRA EAAVLAATFL ATLFLHLEFA IYVGVLLSLM VFLERTARPE IRDAVPAPGA
HSYHFVPQTD EPDCCQLKMV FIDGPIYFGA VDHVQRRLRD IDAADPGHKH LLVLAPGINF
IDSSGAELLG QEARRRRQLG GGLYFHRLHP SAVDVLARSG HLDAIGRENL HAIGSNVIDA
LYPRLDPQIC RRCPARIFSQ CQRTLPDGTP REPAGTP