Gene P9211_02341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02341 
Symbol 
ID5731612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp224706 
End bp226439 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content41% 
IMG OID641284578 
Productputative sulfate transporter 
Protein accessionYP_001550119 
Protein GI159902775 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGGAA CTCCGTACGA AAGTACGGTC AACGATCTAA TCTGGTTCAA CTTGGATCCT 
AAATGGAGTT TTTCCACAGT GACCCTAATT AATGGTTTTC ATCTAAAGAA TATTAGAGGT
GACATATTAG GTGGCCTCAC AGCTGCTGTA GTTGCATTAC CACTTGCACT TGCCTTTGGT
AATGCAGCCC TAGGCCCTGG AGGGGCAATA TATGGCTTAT ATGGTGCTGT TGTAGTTGGT
TTTTTGGCAG CTTTATTCGG CGGTACCCCT GCTCAAGTAA GTGGACCTAC TGGACCAATG
AGTGTCACTG TGGCAGGAGT AGTCGCTAGC CTGGCTGCTG TAGGTGTTCC TAGGGATCTA
TCTGCTCAAC AAATTCTGCC GCTCGTAATG GCTGCAGTAG TTATTGGCGG TTTATTTCAA
GTTTTATTTG GTGTTTTAAA GCTTGGGAAA TACATAACTC TTGTTCCCTA CTCGGTTGTT
TCAGGTTTTA TGTCTGGTAT TGGTGTAATT ATTATTGCAT TACAAATTGG ACCCCTACTT
GGAATTAGTA CAAGAGGAGG CGTCGTTGAA TCTTTAACAA CAGTTGCATC AAATTTTGAA
CCAAATGGTG CAGCAATAGG GGTTGCAATA ATGACTCTTG GAATAGTGTT TTTAACCCCC
CGTAAAGTTA GTCAATGGGT CCCATCCCCT CTTATGGCTT TACTAATAGT CACTCCTATT
TCTATTATTC TTTTTGGAGA TAGTGGTTTA GATCGTATAG GGGAAATACC AAGAGGGGTA
CCATCTCTTA GCCTTCCAAG TTTTAATCAA TACCTTCCAA TAATCTTAAA AGCAGGACTG
GTACTTGCTG TATTAGGTGC AATTGATTCT CTACTAACAT CGCTCGTAGC GGACAATATT
TCACAAACCA GACATAACTC TGATAGGGAA CTAATTGGTC AAGGAATAGG TAATGCAGTA
GCTGGATTGT TTTCTGGTCT GCCAGGAGCA GGCGCAACGA TGCGAACAGT CATAAATGTC
AAGTCCGGCG GGTCTACTCC TTTATCTGGA ATGGTCCATT CAATAGTCCT ATTAATAGTT
TTAGTTGGTG CTGGGCCGCT TGCTGAGCAA ATCCCTACTG CACTTCTTGC AGGCATTCTT
ATAAAGGTTG GCCTAGATAT TATCGATTGG GGATTTTTAC GCAGAGCTCA TCGTCTTTCA
CTTAAGACAG CCACTGTTAT GTATGGCGTC TTATTAATGA CTGTCTTTTG GGATCTAATA
TGGGCTGTTT TAGTTGGTGT TTTTATAGCA AATATGCTCA CAATTGACTC AATTACTCAA
ACACAATTAG AGGGAATGGA AGCAGATAAC CCTCTTCAAG GAAGTGATGA CGATCTACCA
TCATTGCCAG CCGATGAGCA ATCATTACTA GAGAGTTGCT CAGGCGAGGT AATGCTTTTT
AGGCTCAAAG GTCCTTTGAG TTTTGGTGCT GCAAAAGGAA TTACTGAGCG AATGATGCTT
GTCAGAAACT ATAAAGTCTT AATCTTAGAT ATCACGGATG TGCCTCGACT TGGTGTGACT
GCAACCTTAG CTATAGAGGA TATGATTCAA GAAGCAAAAA TTAATTCTAG AAAAGCTTAT
GTAGCTGGAG CAAGTGGCAA AGTACAAGAA AGACTATCCA AATTTGGAGT CGAAGGAGTA
GTCTCAACGA GAAAAGAAGC TTTAGAAGCT GCTGTCAGTT TAATCAAAAA TTAA
 
Protein sequence
MEGTPYESTV NDLIWFNLDP KWSFSTVTLI NGFHLKNIRG DILGGLTAAV VALPLALAFG 
NAALGPGGAI YGLYGAVVVG FLAALFGGTP AQVSGPTGPM SVTVAGVVAS LAAVGVPRDL
SAQQILPLVM AAVVIGGLFQ VLFGVLKLGK YITLVPYSVV SGFMSGIGVI IIALQIGPLL
GISTRGGVVE SLTTVASNFE PNGAAIGVAI MTLGIVFLTP RKVSQWVPSP LMALLIVTPI
SIILFGDSGL DRIGEIPRGV PSLSLPSFNQ YLPIILKAGL VLAVLGAIDS LLTSLVADNI
SQTRHNSDRE LIGQGIGNAV AGLFSGLPGA GATMRTVINV KSGGSTPLSG MVHSIVLLIV
LVGAGPLAEQ IPTALLAGIL IKVGLDIIDW GFLRRAHRLS LKTATVMYGV LLMTVFWDLI
WAVLVGVFIA NMLTIDSITQ TQLEGMEADN PLQGSDDDLP SLPADEQSLL ESCSGEVMLF
RLKGPLSFGA AKGITERMML VRNYKVLILD ITDVPRLGVT ATLAIEDMIQ EAKINSRKAY
VAGASGKVQE RLSKFGVEGV VSTRKEALEA AVSLIKN