Gene Mmar10_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1620 
Symbol 
ID4284590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1772039 
End bp1773787 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content65% 
IMG OID638141107 
Productsulfate transporter 
Protein accessionYP_756850 
Protein GI114570170 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.242226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTGGC TGCGGCGCCT CCTGCCCGGA CTTGAATGGC TCGACGGCTA TGACGGTCAT 
CGACTGACCC AGGACGGTCT GGCCAGCCTG GTCACCGCCA TCCTGCTGAT CCCGCAAAGC
CTGGCCTATG CCCTCCTTGC CGGCCTGCCA CCGCAAGCCG GGCTTTACGC CTCCATCGCG
CCCCTGGTCG CCTATGCCCT GTTCGGGTCC AGCCGTGTGC TGGCGGTTGG GCCTGTAGCG
GTGATCTCAT TGATGACCGC AGCCGCGATC GGTTCGCTTG GCCTGACAGA CCCGGCCGAC
CTGATGGCCG CCGCCGGCGC CCTCGCTTTG CTGTCCAGCG TCTTCCTCTT GTTGTTCGGC
GTGTTCCGGC TGGGCAGTGT CGCCAATTTC CTGTCCCGCC CGGTGGTGGA AGCCTTCATC
ACGGCATCGA CCGTACTCAT CATCGCCAGC CAGCTTCGCC ACTTTCTCGG TGTGGAGATG
GAGGGCGCGA CAATCCCGGA ACTGGTGGTT TCGCTGATCC GCCAGTTCGA CGGGATCAAT
ACCACGGCGC TGGCAATGGG CGTGATTAGC CTGGCCTTTC TCCTGGCCTC GCGCTCACTC
CTGCCCAACC TGCTGGAGCG GACCGGACTG GCGACATCCC ATATCAGCAT CCTGACCCGG
ATCGCCCCGG CCGCCCTCGT CGCGCTGACG GCGCTGACGG CCTGGGCCTT CGGACTGCAG
GAGCGGACCG GCCTGTCGAT TGTCGGCGAA CTGCCCTCCG GCCTGCCACC TTTCGCCTTT
CCGATCGTGC CACTGGAGAC CTGGCGGGCG CTGATCGGAC CGGCGGCGCT GATTTCTCTG
GTCGGCTTTG TCGAGAGCGT ATCGGTTGGA CAATCGCTCG CCGCGCGTCG CCGCGAGACC
ATCAATCCGA ACCGGGAATT GCTCGGTCTG GGCGCAGCCA ATGCGGCCGC GGCCTTCACC
GGCGGCTATC CGGTCACCGG CGGGTTCGCG CGCTCGGTGG TGAATGAGTC GGCGGGCGCT
GAAACACCGG TCGCCGGTGT CTTCACGGCG CTCATCATTC TCCTCGTCGC CGCCTTCCTC
ACTCCGCTTT TCCACCATCT CCCGAAAGCG GCCCTCGCCG CGACGATTCT GGCAGCGATC
TGGCGGATGG CGAATTTCCA TGACGCCTGG CTGGCCTGGA AATACTCCCA TGCCGACGGG
GCCGCGGCCT TCCTGACCTT GGTCGGCGTG CTTTTCCTCG GGGTCGAGAT CGGTCTGACC
CTGGGCGTCG CCCTGTCGGT CGGGCTCGTT CTCCAGCGCA CGATGCGACC GCACTGGGCG
GAGGTCGGAC AGGTCCCGCG CACCCATCAC TTCCGCAATA TCAACCGGCA TGAGGTGATC
TGTTCGCCGC ATGTGGTGTC GCTGCGGATC GACGAGGCGC TCTATTTCGC CAATGCGCGC
TTTCTGGAAG ACCTCGCCGG CGAGATCATC GCCCGTGAAA GCCGTCCGAC CGACCTCGTC
CTGCTGTTTG CCGCCGTCAA TTTCGTTGAT GCGAGCGCCC TTGGCAGCCT GCGGGTGATC
AATGCCCGTC TCGGGGATGC CGGGGTCAAA CTCCACCTGT CCGAGGTCAA GGGCCCGGTC
GCTGACAAGC TGCTCGAGGC GGGTTTCTAC GAGGAATTGT CCGGCGAGGT TTTCCTGTCC
CACTACGCCG CCATGCGGAC ACTCGACCCG GCAACCACCT TGCGCGCGGA GGGCGTTTCA
CCGGATTGA
 
Protein sequence
MAWLRRLLPG LEWLDGYDGH RLTQDGLASL VTAILLIPQS LAYALLAGLP PQAGLYASIA 
PLVAYALFGS SRVLAVGPVA VISLMTAAAI GSLGLTDPAD LMAAAGALAL LSSVFLLLFG
VFRLGSVANF LSRPVVEAFI TASTVLIIAS QLRHFLGVEM EGATIPELVV SLIRQFDGIN
TTALAMGVIS LAFLLASRSL LPNLLERTGL ATSHISILTR IAPAALVALT ALTAWAFGLQ
ERTGLSIVGE LPSGLPPFAF PIVPLETWRA LIGPAALISL VGFVESVSVG QSLAARRRET
INPNRELLGL GAANAAAAFT GGYPVTGGFA RSVVNESAGA ETPVAGVFTA LIILLVAAFL
TPLFHHLPKA ALAATILAAI WRMANFHDAW LAWKYSHADG AAAFLTLVGV LFLGVEIGLT
LGVALSVGLV LQRTMRPHWA EVGQVPRTHH FRNINRHEVI CSPHVVSLRI DEALYFANAR
FLEDLAGEII ARESRPTDLV LLFAAVNFVD ASALGSLRVI NARLGDAGVK LHLSEVKGPV
ADKLLEAGFY EELSGEVFLS HYAAMRTLDP ATTLRAEGVS PD