Gene Mpe_A1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1160 
SymbolsulP 
ID4785735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1239344 
End bp1241008 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content71% 
IMG OID640089723 
Productsulfate transporter 
Protein accessionYP_001020356 
Protein GI124266352 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.549813 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTGC ACCGCTTCCG CCCGCGCCTG CTCGATGCGC TGAAGACCTA CGACCGCGGC 
CGCTTCGCAG CCGACGCCGG CGCCGGCCTG ACGGTCGGCG TGATCGCACT GCCGCTGGCG
ATGGCCTTCG CGATCGCCAG CGGCGTCAAG CCCGAGCAGG GTATCTTCAC CGCGATCATC
GGCGGCTTCC TGGTGTCGGC GCTCGGTGGC TCCAGTGTGC AGATCGCCGG GCCGGCAGGC
GCCTTCATCG TCATCGTCTA CGGCATCGTC GAGCGCTACG GCCTGGCCAA CCTGCTGATC
GCCACGGTGC TGGCGGGCGC GATGCTGTTC GCGCTCGGGC TGTTCCGGCT CGGCGGCCTG
GTGCGCTACG TGCCGATCAC CATCGTGATC GGCTTCACCA ACGGCATCGC GGTGCTGGTG
GCGCTGTCGC AGCTCAAGGA CCTGTTCGGC CTGGTCACGC CCAAGCTGCC GGCCGACTTC
TTCACGCAGA TCGGCGTGCT GCTCGAGCAC GCCGACAGCT TCAACCCGAT GGCCTTCGCG
ATCGGCCTGC TGTCGCTGGC GCTGCTGTTC GCGTGGCCGC GGCTGCAGAA GCACCACACC
GTGGTCGAGC ACACCACGCT GCGCAGCGCC TCGCGCATGC CCGCGCCGAT GGTGGTGCTG
GTGCTGGCGA CCACGGCCAC CGCACTGCTG CACCTGCCGG TCGAGACCAT CGGCTCGCGC
TTCGGCGGCA TCCCGCAGGC GCTGCCGGCC TTCGTGTGGC CCGAGCTCAG CTGGCGCAGC
GCGAAGGAGC TGTTCATCCC GACGCTGACG ATAGCGATGC TGGGCGCGGT GGAGTCGCTG
CTGTGCGCGC GCGTGGCCGA CAACGTGGGC ACCGTGCCGA AGCACGACCC GAACCAGGAG
CTGATGGCGC AGGGCATCGC CAACGTGGTG ACGCCGTTCT TCGGCGGCAT CCCGACCACC
GGCACGATCG CGCGCACGGT CACCAATGTG CGGGCCGGCG CGACCAGTCC GGTCGCCGGC
ATCGTGCATG CGGTCACGCT GCTCGTGGTG GTGCTGGTGG CGGCGCCGCT GGCCGAGCAC
GTGCCGCTGG CGGCGCTGGC CGGCATCCTG CTGTTCGTGG CCTGGAACAT GGGCGAATGG
CACGAGTTCG CACGGCTGCG GCACTTCAGC CTGCCCTACC GCATCATCCT GGTCGGCACC
TTCCTGCTGA CGGTGATCTT CGACCTGTCG GTGGCGGTGC AGGTGGGCCT GGTGATGGCC
TGCGCGTTCT TCATCTACCG CATGAGCACG CTGTTCCGCA TCGAACCGCT GGCGGCCCCG
GCCGAGACGC CGTCGGGCGT GGTGGTCGAG CGGCTCTACG GCGCACTGTT CTTCGGCGCG
GTGGCCAAGC TCGAGGCGGT GCCGGCGCGC CTGCCGGCCG GCACGCGCGT GCTGGTGCTC
GAGGCGCAGC GTCTGATCTC GATCGACGCC AGCGGTGTCG ACGCCCTCAC CCAGCTCTAC
CGCACGCTGC AGCGCCAGGG CGTGGGACTG CGGCTGTGCG AGCTGAACGA GCAGCCGCGC
TCGCTGCTGC AGCGCAGCGG CTTCACGGCG CTGATCGGCG AGGAGCGCAT CGCGCCCACG
CTCGCCGAGG CGCTGGCGCG GTCGGTCGCG CCGACGGCCA CGTGA
 
Protein sequence
MQLHRFRPRL LDALKTYDRG RFAADAGAGL TVGVIALPLA MAFAIASGVK PEQGIFTAII 
GGFLVSALGG SSVQIAGPAG AFIVIVYGIV ERYGLANLLI ATVLAGAMLF ALGLFRLGGL
VRYVPITIVI GFTNGIAVLV ALSQLKDLFG LVTPKLPADF FTQIGVLLEH ADSFNPMAFA
IGLLSLALLF AWPRLQKHHT VVEHTTLRSA SRMPAPMVVL VLATTATALL HLPVETIGSR
FGGIPQALPA FVWPELSWRS AKELFIPTLT IAMLGAVESL LCARVADNVG TVPKHDPNQE
LMAQGIANVV TPFFGGIPTT GTIARTVTNV RAGATSPVAG IVHAVTLLVV VLVAAPLAEH
VPLAALAGIL LFVAWNMGEW HEFARLRHFS LPYRIILVGT FLLTVIFDLS VAVQVGLVMA
CAFFIYRMST LFRIEPLAAP AETPSGVVVE RLYGALFFGA VAKLEAVPAR LPAGTRVLVL
EAQRLISIDA SGVDALTQLY RTLQRQGVGL RLCELNEQPR SLLQRSGFTA LIGEERIAPT
LAEALARSVA PTAT