Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07551 |
Symbol | |
ID | 5730244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 659147 |
End bp | 660724 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641285118 |
Product | putative sulfate transporter |
Protein accession | YP_001550640 |
Protein GI | 159903296 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0159934 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACAA AAAACAACGC GACTTTTTTT AATCAGTGGT TTAGCAGCCC TAAAGAAGAT CTACTTTCTG GTTTAGTAGT TGCCTTTGCA ATGATTCCAG AGGCTATTGC TTTTTCTGGG ATAGCAGGAG TTGATCCACA AGTAGGACTA TTTGGGGCAT TTTGCCTATC CATAACAATT GCTCTAGTCG GAGGCCGGCT GGGGATGATC ACTTCAGCCA CTGGATCAAC GGCGCTATTG ATGACTGGCG TAGTTGCTAC TGGTAATGCA CAAGGAGAAG GGCTTGGGCT CTCTTATCTA ATAGCAGCAG GATTATTAAC AGGAGTTTTC CAAATTCTTT GGGGATATTT ACGCTTGGCA TATCAAATGA GATTTGTGCC AACTGGTGTA TTAAGCGGTT TTGTAAATGC CCTTGCATTA TTAATTTTCC AAGCCCAACT TCCACAATTA GGATTGAATT TGCAATATGG CGAGCATATA CATAACGAAC AAATGAGCCA AATGCTTCCT ACTGGAATCC AAATACCTAT TGTATGGGGA TTAGTAATAC TAGGCCTGAT TATTATTTAT GGTCTTCCAA AAATAACCAA AGCAATCCCC TCGCAACTTA TCGCAATTTT AGTTTTAACA CTAATATCTA TAGGTTTAGA TCTTCAAATA CCTACTGTCC AAGATCTTGG TCAACTTCCC GTTGGTTTAC CAAGTATTTC TTTGCCTTTT GGATCGATCG ATAATGGGAA AATACCTTTC AACATGGAGA CATTCGGAAT TATTCTTCCT ACTGCACTTG CTATTTCACT TGTAGGGTTA ATGGAAACAT TTCTTACTCA GGACATACTT GATGAGATAA CAGATTCAAA CTCAAACAAA AATGTTGAAG CTAGAGGTCA AGGAATAGCA AACATAGTTT CATCTCTGTT TGGAGGAATG GCTGGCTGCG CACTAGTTGG ACAATCAGTT ATGAATGTAG AGAATGGAGG TAGGTCAAGA CTTTCAACCT TCTCATCTGG TATTAGCCTT CTTATATTGA TATTACTTTG CAAACCATGG CTAAAAGAGA TACCTATGGC GGCACTTGTA TCAGTAATGA TAACTATTGC TATTAGTACT GCTGATACAA ATGGACTAAA AAACATCTCC AGAATACCTA GAAGTGATAC CGCAGTAATG TTAATGACTT TCTCAGTAAC GATGCTAACA ACACCACATA ATCTTGCTCT AGGTGTAATT GCTGGAGTTG CTCTAGCCGG AATATTATTT AGTCGTAAAG TTGCAAAAGT AATTAAAGTT ACTTCAAGCA AAGTAAATGA AGAAGAGATT ATTTATAAAG TATCTGGACA ACTATTTTTT GTAAGTAAAA TATATTTTGC ACAGGGTTTT GATACACATG ATCACGTGAA GAAAATAACC ATTGACATGA CGAATGCTCA CATATGGGAT CAGAGTGGAG TGGCTGCTCT TGATCAAATA ATAAGGAAAC TTACTTTGGG CGGATCAAAT GTTAATGTTA TCGGTCTCAA CAAAGAAAGT CTTGATCTAT TTGATCGACT GGGTGGTCAA GAGCCATCCC ATGGATAA
|
Protein sequence | MPTKNNATFF NQWFSSPKED LLSGLVVAFA MIPEAIAFSG IAGVDPQVGL FGAFCLSITI ALVGGRLGMI TSATGSTALL MTGVVATGNA QGEGLGLSYL IAAGLLTGVF QILWGYLRLA YQMRFVPTGV LSGFVNALAL LIFQAQLPQL GLNLQYGEHI HNEQMSQMLP TGIQIPIVWG LVILGLIIIY GLPKITKAIP SQLIAILVLT LISIGLDLQI PTVQDLGQLP VGLPSISLPF GSIDNGKIPF NMETFGIILP TALAISLVGL METFLTQDIL DEITDSNSNK NVEARGQGIA NIVSSLFGGM AGCALVGQSV MNVENGGRSR LSTFSSGISL LILILLCKPW LKEIPMAALV SVMITIAIST ADTNGLKNIS RIPRSDTAVM LMTFSVTMLT TPHNLALGVI AGVALAGILF SRKVAKVIKV TSSKVNEEEI IYKVSGQLFF VSKIYFAQGF DTHDHVKKIT IDMTNAHIWD QSGVAALDQI IRKLTLGGSN VNVIGLNKES LDLFDRLGGQ EPSHG
|
| |