Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_02341 |
Symbol | |
ID | 5731612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 224706 |
End bp | 226439 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641284578 |
Product | putative sulfate transporter |
Protein accession | YP_001550119 |
Protein GI | 159902775 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGGAA CTCCGTACGA AAGTACGGTC AACGATCTAA TCTGGTTCAA CTTGGATCCT AAATGGAGTT TTTCCACAGT GACCCTAATT AATGGTTTTC ATCTAAAGAA TATTAGAGGT GACATATTAG GTGGCCTCAC AGCTGCTGTA GTTGCATTAC CACTTGCACT TGCCTTTGGT AATGCAGCCC TAGGCCCTGG AGGGGCAATA TATGGCTTAT ATGGTGCTGT TGTAGTTGGT TTTTTGGCAG CTTTATTCGG CGGTACCCCT GCTCAAGTAA GTGGACCTAC TGGACCAATG AGTGTCACTG TGGCAGGAGT AGTCGCTAGC CTGGCTGCTG TAGGTGTTCC TAGGGATCTA TCTGCTCAAC AAATTCTGCC GCTCGTAATG GCTGCAGTAG TTATTGGCGG TTTATTTCAA GTTTTATTTG GTGTTTTAAA GCTTGGGAAA TACATAACTC TTGTTCCCTA CTCGGTTGTT TCAGGTTTTA TGTCTGGTAT TGGTGTAATT ATTATTGCAT TACAAATTGG ACCCCTACTT GGAATTAGTA CAAGAGGAGG CGTCGTTGAA TCTTTAACAA CAGTTGCATC AAATTTTGAA CCAAATGGTG CAGCAATAGG GGTTGCAATA ATGACTCTTG GAATAGTGTT TTTAACCCCC CGTAAAGTTA GTCAATGGGT CCCATCCCCT CTTATGGCTT TACTAATAGT CACTCCTATT TCTATTATTC TTTTTGGAGA TAGTGGTTTA GATCGTATAG GGGAAATACC AAGAGGGGTA CCATCTCTTA GCCTTCCAAG TTTTAATCAA TACCTTCCAA TAATCTTAAA AGCAGGACTG GTACTTGCTG TATTAGGTGC AATTGATTCT CTACTAACAT CGCTCGTAGC GGACAATATT TCACAAACCA GACATAACTC TGATAGGGAA CTAATTGGTC AAGGAATAGG TAATGCAGTA GCTGGATTGT TTTCTGGTCT GCCAGGAGCA GGCGCAACGA TGCGAACAGT CATAAATGTC AAGTCCGGCG GGTCTACTCC TTTATCTGGA ATGGTCCATT CAATAGTCCT ATTAATAGTT TTAGTTGGTG CTGGGCCGCT TGCTGAGCAA ATCCCTACTG CACTTCTTGC AGGCATTCTT ATAAAGGTTG GCCTAGATAT TATCGATTGG GGATTTTTAC GCAGAGCTCA TCGTCTTTCA CTTAAGACAG CCACTGTTAT GTATGGCGTC TTATTAATGA CTGTCTTTTG GGATCTAATA TGGGCTGTTT TAGTTGGTGT TTTTATAGCA AATATGCTCA CAATTGACTC AATTACTCAA ACACAATTAG AGGGAATGGA AGCAGATAAC CCTCTTCAAG GAAGTGATGA CGATCTACCA TCATTGCCAG CCGATGAGCA ATCATTACTA GAGAGTTGCT CAGGCGAGGT AATGCTTTTT AGGCTCAAAG GTCCTTTGAG TTTTGGTGCT GCAAAAGGAA TTACTGAGCG AATGATGCTT GTCAGAAACT ATAAAGTCTT AATCTTAGAT ATCACGGATG TGCCTCGACT TGGTGTGACT GCAACCTTAG CTATAGAGGA TATGATTCAA GAAGCAAAAA TTAATTCTAG AAAAGCTTAT GTAGCTGGAG CAAGTGGCAA AGTACAAGAA AGACTATCCA AATTTGGAGT CGAAGGAGTA GTCTCAACGA GAAAAGAAGC TTTAGAAGCT GCTGTCAGTT TAATCAAAAA TTAA
|
Protein sequence | MEGTPYESTV NDLIWFNLDP KWSFSTVTLI NGFHLKNIRG DILGGLTAAV VALPLALAFG NAALGPGGAI YGLYGAVVVG FLAALFGGTP AQVSGPTGPM SVTVAGVVAS LAAVGVPRDL SAQQILPLVM AAVVIGGLFQ VLFGVLKLGK YITLVPYSVV SGFMSGIGVI IIALQIGPLL GISTRGGVVE SLTTVASNFE PNGAAIGVAI MTLGIVFLTP RKVSQWVPSP LMALLIVTPI SIILFGDSGL DRIGEIPRGV PSLSLPSFNQ YLPIILKAGL VLAVLGAIDS LLTSLVADNI SQTRHNSDRE LIGQGIGNAV AGLFSGLPGA GATMRTVINV KSGGSTPLSG MVHSIVLLIV LVGAGPLAEQ IPTALLAGIL IKVGLDIIDW GFLRRAHRLS LKTATVMYGV LLMTVFWDLI WAVLVGVFIA NMLTIDSITQ TQLEGMEADN PLQGSDDDLP SLPADEQSLL ESCSGEVMLF RLKGPLSFGA AKGITERMML VRNYKVLILD ITDVPRLGVT ATLAIEDMIQ EAKINSRKAY VAGASGKVQE RLSKFGVEGV VSTRKEALEA AVSLIKN
|
| |