Gene A9601_02331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02331 
Symbol 
ID4716917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp217052 
End bp218704 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content38% 
IMG OID640077932 
Productputative sulfate transporter 
Protein accessionYP_001008628 
Protein GI123967770 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.868992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAATAA TTAATGGATT TCATCTAAAG AATTTAAGAG GAGATATTCT TGGAGGGATC 
ACTGCTGCTG TAGTAGCTTT ACCTCTCGCT CTTGCTTTTG GTAATGCTGC ATTAGGACCT
GGCGGGGCAA TTTATGGCCT ATATGGGGCA GTAGTAGTTG GTTTTTTAGC AGCATTATTT
GGAGGAACAC CTGCTCAAGT TAGTGGACCT ACCGGTCCAA TGAGTGTAAC TGTTGCTGGC
GTAGTAGCAG GCTTAGCAGC AGTGGGGGTT CCAAGAGATC TTTCTGCAGG ACAAATTTTA
CCTTTAGTGA TGGCAGCGGT AGTAATTGGC GGCTTACTGC AAATATTATT TGGAATTCTC
AAACTAGGTA AATACATTAC TTTAGTTCCA TATTCTGTTG TGTCAGGATT CATGTCTGGT
ATTGGAGTAA TAATAATTGC ACTTCAGATT GGTCCATTAC TAGGAATCAG TACCAGAGGT
GGAGTAGTTG AATCTTTATC AACTGTATTT TCAAATTTCC AGCCAAACGG TGCTGCTATT
GGAGTAGCAA TAATGACACT AGGTATAGTA TTTCTTACTC CTAGAAAAAT AAGTCAATGG
GTTCCTTCTC CCCTCTTAGC CTTATTGATA GTAACCCCAA TATCAATATT AATTTTTGGA
GAAGGAGCTA TTGATAGAAT TGGTGAAATT CCCAGGGGAG TTCCATCTTT AAATTTCCCA
AGTTTTAATC AATATTTTCC AATCATTTTT AAGGCAGGAT TAGTCCTCGC AGTACTTGGC
GCAATTGACT CTTTACTAAC ATCACTAGTA GCAGACAATA TATCTCAAAC AAAACATAAT
TCTGATAGAG AACTTATTGG TCAAGGAATA GGAAATGCTG TTGCCGGTCT GTTTTCAGGC
TTACCTGGAG CCGGAGCAAC AATGAGAACA GTTATAAATG TTAAATCTGG AGGATCCACT
CCCATTTCTG GTATGGTTCA CTCAGTTGTC TTGTTGATAG TTTTAGTTGG CGCAGGTCCT
TTAGCCGAGC AAATACCAAC TGCGTTATTA GCAGGAATTC TTATAAAAGT TGGTCTAGAT
ATTATTGATT GGGGGTTCTT AAGGAGGGCC CACAAATTAT CTTTAAAAAC TTCAGTTGTT
ATGTACGGCG TACTCCTCAT GACTGTTTTT TGGGATTTAA TTTGGGCAGT TTTAGTCGGT
GTATTCATAG CAAATATGCT CACTATTGAT TCAATAACGG AAACTCAACT AGAAGGTATG
GATGAAGATA ATCCTTTATC AAAAGATGAT CAAGCTAAAA ATGCATTACC TGCTGATGAA
AAAGCACTAC TTGATAGATG TTCAGGAGAA GTAATGTTAT TTAGACTTAA AGGACCACTT
AGTTTTGGAG CAGCTAAAGG TATATCTGAG AGAATGATGC TAGTAAGAAA CTATAAGGTT
TTGATATTAG ATATCACTGA TGTACCAAGA CTTGGAGTGA CCGCGACTCT GGCAATAGAA
GATATGATGC AAGAAGCTAA AAATAATTCC AGAAAAGCAT TTGTTGCTGG GGCTAATGAA
AAAGTAAAGG ATAGATTAGC TAAGTTTGGA GTTGAAGGCA TCATTGAGAC AAGAAAAGAA
GCTTTAGAAA CCGCTCTAAA TGAAATAGCC TAA
 
Protein sequence
MKIINGFHLK NLRGDILGGI TAAVVALPLA LAFGNAALGP GGAIYGLYGA VVVGFLAALF 
GGTPAQVSGP TGPMSVTVAG VVAGLAAVGV PRDLSAGQIL PLVMAAVVIG GLLQILFGIL
KLGKYITLVP YSVVSGFMSG IGVIIIALQI GPLLGISTRG GVVESLSTVF SNFQPNGAAI
GVAIMTLGIV FLTPRKISQW VPSPLLALLI VTPISILIFG EGAIDRIGEI PRGVPSLNFP
SFNQYFPIIF KAGLVLAVLG AIDSLLTSLV ADNISQTKHN SDRELIGQGI GNAVAGLFSG
LPGAGATMRT VINVKSGGST PISGMVHSVV LLIVLVGAGP LAEQIPTALL AGILIKVGLD
IIDWGFLRRA HKLSLKTSVV MYGVLLMTVF WDLIWAVLVG VFIANMLTID SITETQLEGM
DEDNPLSKDD QAKNALPADE KALLDRCSGE VMLFRLKGPL SFGAAKGISE RMMLVRNYKV
LILDITDVPR LGVTATLAIE DMMQEAKNNS RKAFVAGANE KVKDRLAKFG VEGIIETRKE
ALETALNEIA