Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_02331 |
Symbol | |
ID | 4716917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 217052 |
End bp | 218704 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640077932 |
Product | putative sulfate transporter |
Protein accession | YP_001008628 |
Protein GI | 123967770 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.868992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAATAA TTAATGGATT TCATCTAAAG AATTTAAGAG GAGATATTCT TGGAGGGATC ACTGCTGCTG TAGTAGCTTT ACCTCTCGCT CTTGCTTTTG GTAATGCTGC ATTAGGACCT GGCGGGGCAA TTTATGGCCT ATATGGGGCA GTAGTAGTTG GTTTTTTAGC AGCATTATTT GGAGGAACAC CTGCTCAAGT TAGTGGACCT ACCGGTCCAA TGAGTGTAAC TGTTGCTGGC GTAGTAGCAG GCTTAGCAGC AGTGGGGGTT CCAAGAGATC TTTCTGCAGG ACAAATTTTA CCTTTAGTGA TGGCAGCGGT AGTAATTGGC GGCTTACTGC AAATATTATT TGGAATTCTC AAACTAGGTA AATACATTAC TTTAGTTCCA TATTCTGTTG TGTCAGGATT CATGTCTGGT ATTGGAGTAA TAATAATTGC ACTTCAGATT GGTCCATTAC TAGGAATCAG TACCAGAGGT GGAGTAGTTG AATCTTTATC AACTGTATTT TCAAATTTCC AGCCAAACGG TGCTGCTATT GGAGTAGCAA TAATGACACT AGGTATAGTA TTTCTTACTC CTAGAAAAAT AAGTCAATGG GTTCCTTCTC CCCTCTTAGC CTTATTGATA GTAACCCCAA TATCAATATT AATTTTTGGA GAAGGAGCTA TTGATAGAAT TGGTGAAATT CCCAGGGGAG TTCCATCTTT AAATTTCCCA AGTTTTAATC AATATTTTCC AATCATTTTT AAGGCAGGAT TAGTCCTCGC AGTACTTGGC GCAATTGACT CTTTACTAAC ATCACTAGTA GCAGACAATA TATCTCAAAC AAAACATAAT TCTGATAGAG AACTTATTGG TCAAGGAATA GGAAATGCTG TTGCCGGTCT GTTTTCAGGC TTACCTGGAG CCGGAGCAAC AATGAGAACA GTTATAAATG TTAAATCTGG AGGATCCACT CCCATTTCTG GTATGGTTCA CTCAGTTGTC TTGTTGATAG TTTTAGTTGG CGCAGGTCCT TTAGCCGAGC AAATACCAAC TGCGTTATTA GCAGGAATTC TTATAAAAGT TGGTCTAGAT ATTATTGATT GGGGGTTCTT AAGGAGGGCC CACAAATTAT CTTTAAAAAC TTCAGTTGTT ATGTACGGCG TACTCCTCAT GACTGTTTTT TGGGATTTAA TTTGGGCAGT TTTAGTCGGT GTATTCATAG CAAATATGCT CACTATTGAT TCAATAACGG AAACTCAACT AGAAGGTATG GATGAAGATA ATCCTTTATC AAAAGATGAT CAAGCTAAAA ATGCATTACC TGCTGATGAA AAAGCACTAC TTGATAGATG TTCAGGAGAA GTAATGTTAT TTAGACTTAA AGGACCACTT AGTTTTGGAG CAGCTAAAGG TATATCTGAG AGAATGATGC TAGTAAGAAA CTATAAGGTT TTGATATTAG ATATCACTGA TGTACCAAGA CTTGGAGTGA CCGCGACTCT GGCAATAGAA GATATGATGC AAGAAGCTAA AAATAATTCC AGAAAAGCAT TTGTTGCTGG GGCTAATGAA AAAGTAAAGG ATAGATTAGC TAAGTTTGGA GTTGAAGGCA TCATTGAGAC AAGAAAAGAA GCTTTAGAAA CCGCTCTAAA TGAAATAGCC TAA
|
Protein sequence | MKIINGFHLK NLRGDILGGI TAAVVALPLA LAFGNAALGP GGAIYGLYGA VVVGFLAALF GGTPAQVSGP TGPMSVTVAG VVAGLAAVGV PRDLSAGQIL PLVMAAVVIG GLLQILFGIL KLGKYITLVP YSVVSGFMSG IGVIIIALQI GPLLGISTRG GVVESLSTVF SNFQPNGAAI GVAIMTLGIV FLTPRKISQW VPSPLLALLI VTPISILIFG EGAIDRIGEI PRGVPSLNFP SFNQYFPIIF KAGLVLAVLG AIDSLLTSLV ADNISQTKHN SDRELIGQGI GNAVAGLFSG LPGAGATMRT VINVKSGGST PISGMVHSVV LLIVLVGAGP LAEQIPTALL AGILIKVGLD IIDWGFLRRA HKLSLKTSVV MYGVLLMTVF WDLIWAVLVG VFIANMLTID SITETQLEGM DEDNPLSKDD QAKNALPADE KALLDRCSGE VMLFRLKGPL SFGAAKGISE RMMLVRNYKV LILDITDVPR LGVTATLAIE DMMQEAKNNS RKAFVAGANE KVKDRLAKFG VEGIIETRKE ALETALNEIA
|
| |