Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_12181 |
Symbol | |
ID | 4778503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1060793 |
End bp | 1062373 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640086727 |
Product | putative sulfate transporter |
Protein accession | YP_001017232 |
Protein GI | 124022925 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.97846 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAACC GGTCAGAACC ATCGTTGATC AAACAATGGT TAGGCAACCC TCCAAAAGAT CTCCTCTCTG GGCTGGTGGT TGCCTTCGCA ATGATCCCAG AGGCGATTGC CTTTTCAGGC ATCGCTGGTG TTGATCCGCA AGTGGGCCTC TTTGGAGCGT TTTGCCTTTC AGTCACCATC GCCATAGTCG GCGGACGCAT GGGGATGATC ACCTCAGCGA CTGGCTCGAC AGCACTGTTG ATGACAGGGA TTGTGGCGAC TGGTAATGCC GTTGGTGAAG GCCTAGGCCT TTCCTATTTG ATGGCAGCAG GCCTATTGAC TGGGCTATTG CAAATCCTTT GGGGTTATTT GAGACTTGCC TATCAAATGC GTTTTGTTCC TCAAGGGGTG CTGAGTGGAT TTGTGAATGC ACTTGCCTTA TTGATTTTTC AAGCACAATT CCCTCAATTA GGCCTGAATC TGCATTACGG CGAAGACGTT GTTGTCGATC ATGCAACCCA GGTTTTGCCT ACTGCTGGCC AGATCCCTTT GGTTTGGGGG TTGGTGATCC TTGGCTTGGT GATTATTTAT GGCTTACCGC GCATAACTCG CCTGCTGCCT TCTCAGCTGG TGGCCATCAT TGTCTTAACG ATGATTAGCA TCGGTTTCAA CCTCGATATT CCTACTGTTG AGAGCCTAGG AAGCCTTCCA GATGGTCTCC CTAGTTTCAC ACTTCCATTT GGATCATTAG CAGATGGCAA GGTGCCATTC AACCTTGAAA CATTTGGATT GGTTTTACCC ACAGCTCTCG CAGTTTCCTT GGTTGGGTTG ATTGAGACGT TCCTCACCCA AGACATCGTT GATGATTTAA CTGATACCAC CTCCAACAAG AATGTTGAGG CTAGAGGCCA GGGAATAGCA AATGTTGTCT CCTCTCTTTT TGGAGGAATG GCAGGCTGTG CACTTGTGGG CCAATCTGTG ATGAATACAG AGAATGGTGG GCGTAGCAGA CTTTCAACAT TGTTTTCTGG TGTCAGTTTA TTGTTAATGA TCTTGTTGGG TCAAGGTTGG TTAAAACAAA TCCCTATGGC GGCATTAGTG GCTGTGATGA TTGCGATTGC TGTCAGTACT GCTGATATCC GCGGACTCCG ACAGCTCAAA AAGATTCCTC GCAGTGATAC AGCTGTAATG CTGATGACTT TTGCCGTCAC CATGCTCACC ACACCCCATA ATTTAGCGCT TGGAGTGTTA GCGGGTGTTG CACTGGCAGG GGTACTCTTC AGCCGTAAAG TTGCCAAAGT GATCCGTGTC AGTGTTATAC AAGTCAACCC TGATGAGCTT CGCTATGAGG TAAGTGGACA ATTATTCTTC GTGAGCAAGG TGTATTTTTT ACAGGGCTTT GATATTCATG AGCACCCTGC CAAGGTGACA GTCGACATGT CGCGAGCACA CATCTGGGAC CAAAGCGGAG TTGCTGCTTT GGATCAGGTG ATTCGCAAGC TTCGCCTTGG AGGATCAGAG GTGGAGGTGG TTGGCCTTAA TAAGGAGAGT CTTGATCTGT TTGAACGTAT AGGCGGCAAT CAGGAGCCTG CTCATATCTA G
|
Protein sequence | MINRSEPSLI KQWLGNPPKD LLSGLVVAFA MIPEAIAFSG IAGVDPQVGL FGAFCLSVTI AIVGGRMGMI TSATGSTALL MTGIVATGNA VGEGLGLSYL MAAGLLTGLL QILWGYLRLA YQMRFVPQGV LSGFVNALAL LIFQAQFPQL GLNLHYGEDV VVDHATQVLP TAGQIPLVWG LVILGLVIIY GLPRITRLLP SQLVAIIVLT MISIGFNLDI PTVESLGSLP DGLPSFTLPF GSLADGKVPF NLETFGLVLP TALAVSLVGL IETFLTQDIV DDLTDTTSNK NVEARGQGIA NVVSSLFGGM AGCALVGQSV MNTENGGRSR LSTLFSGVSL LLMILLGQGW LKQIPMAALV AVMIAIAVST ADIRGLRQLK KIPRSDTAVM LMTFAVTMLT TPHNLALGVL AGVALAGVLF SRKVAKVIRV SVIQVNPDEL RYEVSGQLFF VSKVYFLQGF DIHEHPAKVT VDMSRAHIWD QSGVAALDQV IRKLRLGGSE VEVVGLNKES LDLFERIGGN QEPAHI
|
| |