Gene P9303_12181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_12181 
Symbol 
ID4778503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1060793 
End bp1062373 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content48% 
IMG OID640086727 
Productputative sulfate transporter 
Protein accessionYP_001017232 
Protein GI124022925 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.97846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAACC GGTCAGAACC ATCGTTGATC AAACAATGGT TAGGCAACCC TCCAAAAGAT 
CTCCTCTCTG GGCTGGTGGT TGCCTTCGCA ATGATCCCAG AGGCGATTGC CTTTTCAGGC
ATCGCTGGTG TTGATCCGCA AGTGGGCCTC TTTGGAGCGT TTTGCCTTTC AGTCACCATC
GCCATAGTCG GCGGACGCAT GGGGATGATC ACCTCAGCGA CTGGCTCGAC AGCACTGTTG
ATGACAGGGA TTGTGGCGAC TGGTAATGCC GTTGGTGAAG GCCTAGGCCT TTCCTATTTG
ATGGCAGCAG GCCTATTGAC TGGGCTATTG CAAATCCTTT GGGGTTATTT GAGACTTGCC
TATCAAATGC GTTTTGTTCC TCAAGGGGTG CTGAGTGGAT TTGTGAATGC ACTTGCCTTA
TTGATTTTTC AAGCACAATT CCCTCAATTA GGCCTGAATC TGCATTACGG CGAAGACGTT
GTTGTCGATC ATGCAACCCA GGTTTTGCCT ACTGCTGGCC AGATCCCTTT GGTTTGGGGG
TTGGTGATCC TTGGCTTGGT GATTATTTAT GGCTTACCGC GCATAACTCG CCTGCTGCCT
TCTCAGCTGG TGGCCATCAT TGTCTTAACG ATGATTAGCA TCGGTTTCAA CCTCGATATT
CCTACTGTTG AGAGCCTAGG AAGCCTTCCA GATGGTCTCC CTAGTTTCAC ACTTCCATTT
GGATCATTAG CAGATGGCAA GGTGCCATTC AACCTTGAAA CATTTGGATT GGTTTTACCC
ACAGCTCTCG CAGTTTCCTT GGTTGGGTTG ATTGAGACGT TCCTCACCCA AGACATCGTT
GATGATTTAA CTGATACCAC CTCCAACAAG AATGTTGAGG CTAGAGGCCA GGGAATAGCA
AATGTTGTCT CCTCTCTTTT TGGAGGAATG GCAGGCTGTG CACTTGTGGG CCAATCTGTG
ATGAATACAG AGAATGGTGG GCGTAGCAGA CTTTCAACAT TGTTTTCTGG TGTCAGTTTA
TTGTTAATGA TCTTGTTGGG TCAAGGTTGG TTAAAACAAA TCCCTATGGC GGCATTAGTG
GCTGTGATGA TTGCGATTGC TGTCAGTACT GCTGATATCC GCGGACTCCG ACAGCTCAAA
AAGATTCCTC GCAGTGATAC AGCTGTAATG CTGATGACTT TTGCCGTCAC CATGCTCACC
ACACCCCATA ATTTAGCGCT TGGAGTGTTA GCGGGTGTTG CACTGGCAGG GGTACTCTTC
AGCCGTAAAG TTGCCAAAGT GATCCGTGTC AGTGTTATAC AAGTCAACCC TGATGAGCTT
CGCTATGAGG TAAGTGGACA ATTATTCTTC GTGAGCAAGG TGTATTTTTT ACAGGGCTTT
GATATTCATG AGCACCCTGC CAAGGTGACA GTCGACATGT CGCGAGCACA CATCTGGGAC
CAAAGCGGAG TTGCTGCTTT GGATCAGGTG ATTCGCAAGC TTCGCCTTGG AGGATCAGAG
GTGGAGGTGG TTGGCCTTAA TAAGGAGAGT CTTGATCTGT TTGAACGTAT AGGCGGCAAT
CAGGAGCCTG CTCATATCTA G
 
Protein sequence
MINRSEPSLI KQWLGNPPKD LLSGLVVAFA MIPEAIAFSG IAGVDPQVGL FGAFCLSVTI 
AIVGGRMGMI TSATGSTALL MTGIVATGNA VGEGLGLSYL MAAGLLTGLL QILWGYLRLA
YQMRFVPQGV LSGFVNALAL LIFQAQFPQL GLNLHYGEDV VVDHATQVLP TAGQIPLVWG
LVILGLVIIY GLPRITRLLP SQLVAIIVLT MISIGFNLDI PTVESLGSLP DGLPSFTLPF
GSLADGKVPF NLETFGLVLP TALAVSLVGL IETFLTQDIV DDLTDTTSNK NVEARGQGIA
NVVSSLFGGM AGCALVGQSV MNTENGGRSR LSTLFSGVSL LLMILLGQGW LKQIPMAALV
AVMIAIAVST ADIRGLRQLK KIPRSDTAVM LMTFAVTMLT TPHNLALGVL AGVALAGVLF
SRKVAKVIRV SVIQVNPDEL RYEVSGQLFF VSKVYFLQGF DIHEHPAKVT VDMSRAHIWD
QSGVAALDQV IRKLRLGGSE VEVVGLNKES LDLFERIGGN QEPAHI