Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32829 |
Symbol | SUT1 |
ID | 5002843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 709341 |
End bp | 711137 |
Gene Length | 1797 bp |
Protein Length | 509 aa |
Translation table | |
GC content | 65% |
IMG OID | 640418264 |
Product | SulP family transporter: sulfate; sulfate transporter of the Major Facilitator Superfamily (MFS) |
Protein accession | XP_001418781 |
Protein GI | 145348697 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGACGACGGC GACGACGACG ACGGCGACGG CGGCGGCGAC GGCGACGCGT CGCGTCGCGC GCGCGCGAAC GGCGACGCGC GTCGTCGTCG GCGGGCGCGA CCGACGGTTG GGCGCGACGC GCGCGACGGC GCCGGCGCGC GCGACGGCGA CGGCGCGCGC GACGGCGCGC GACGGACGCG AGATCGCGGA CGTCGCGCGC GAGGCGGCGA CGCAGGCGCT GAGCGGGATC ACGGTGTCGC TGGCGATGGT GCCGGAATCG CTGGCGTTCA CGTTCGTCGC GGGGACGACG CCGATCGTCG GCTTGCACGC GGCGGCGCTG ATGGGGCTGT GCACGGCGGC GCTGGGCGCG CAGCCGGGGG TGATCTCGGG CGCGGCGGGA GCGACGGCGG TGGTGATCGC GCCGCTCGTG GCGTCGCACG GGGTGGAGTA CTTGTTCGCG TGCGTGGCGC TCGCGGGCGC GGCGCAGGCG GCGTGCGGCG CGCTGCGGTT GGGGAAATTC ATCAGGCTCG TGCCGCAGCC GTGCATGATC GGATTCGTCA ATGGATTGGC GATCGTGATC GGGAAGTCGC AGCTGGAGAC GTTCGTCGGG CTGACGGGGA CGACGCTCGC GACGACGATC GGGCTCACGG CGTTTACGAT GGCGCTCATC AAACTGCTGC CGAGGGCGTC GTTCGCGCCC AAGGGTGTTC CGGCGCCGTT GTTGGCCATC GCGTCGTGCG CGACGCTGAC GAATGTGATG AAAATCGCGA CGAAGACGGT GGGCGACGTC GCGCCCGTGA GCGGGGCGTT GCCGTCGTTT CACATTCCAA ACGTTCCGGC GAGCTTGGAA ACGCTCGCCG TCATCGCGCC TTACGCGTTA TCCGTCGCCG CGGTGGGCTT GATCGAGACC TTGCTCACCC AACAACTCGT AGATGACATC ACCGAGCGAC GGACGGCGAC GCACACCGAG TGCATCGCGC AAGGCGTTGG AAATATGGTG AACGGCGCGT TCGGGGCGAT GGGCGGGTGC GCCATGATCG GGCAGTCGAT GATCAACGTC AATTCCGGTG GTCGCACGCG CGTGGCCGGG ATCACGTGCG CGCTCGCGAT TTTAAGTTAC GTCACCTTTG GCGCTTCGTT CATCGAACGC ATTCCCATGG CCGCTTTAAC GGGGACGATG TTATGCTTGG TTTTCGATAT TTTCGATTGG ACGTCGTTTT CGCGAGTGAA AAAGATTCCC AAGACCGACG CCGTCGTTTT GCTGCTCGTC ACCGGCGTCA CAGTGGTGAC AAACTTGGCG GTCGCCGTGT TCGCTGGCGT CGTCTTGTCC GCGCTCGGTT TCGCTTGGAA ATCTTCTCAG CGCATCGACG TCGTGCGCTC GCGAACGAGT TCCTCCGAGA CGCTGTGCGA GTTGTACGGA CCGCTCTTTT TCGGTTCGGT GCAAAGCTTC GCGGACAAGC TCGATCCGCG CGACGAGGAA CTCGATCGAG TGGTGCTAGA CTTCGCCCAT AGCAAAGTGT GGGATTCTTC TGCGCTCGTC GCCATCGATG AGTTGGCGGA AAAGTACCGA AACTGCGGTA AGACGCTCAC GCTCCGTCAC TTGTCGCCGG ATTGCGCAAA ATTGCTGAAG AAGGGAGGCG ACTTGGTCGA AGTCGACGTC GACACGGATC CCGTGTACGC GGTCGCGGCG AATTTGGACG CAGAGACGCT GTCCGCCGTC ACGAAGACGT TGGGTGGACG GAAAGGACTT TCGCTGGCGG AGGAAAATGC GTTGAAGCGA CAGTATTTGC GTTGATGATC TTGCCTATCT ATTGTAC
|
Protein sequence | MVPESLAFTF VAGTTPIVGL HAAALMGLCT AALGAQPGVI SGAAGATAVV IAPLVASHGV EYLFACVALA GAAQAACGAL RLGKFIRLVP QPCMIGFVNG LAIVIGKSQL ETFVGLTGTT LATTIGLTAF TMALIKLLPR ASFAPKGVPA PLLAIASCAT LTNVMKIATK TVGDVAPVSG ALPSFHIPNV PASLETLAVI APYALSVAAV GLIETLLTQQ LVDDITERRT ATHTECIAQG VGNMVNGAFG AMGGCAMIGQ SMINVNSGGR TRVAGITCAL AILSYVTFGA SFIERIPMAA LTGTMLCLVF DIFDWTSFSR VKKIPKTDAV VLLLVTGVTV VTNLAVAVFA GVVLSALGFA WKSSQRIDVV RSRTSSSETL CELYGPLFFG SVQSFADKLD PRDEELDRVV LDFAHSKVWD SSALVAIDEL AEKYRNCGKT LTLRHLSPDC AKLLKKGGDL VEVDVDTDPV YAVAANLDAE TLSAVTKTLG GRKGLSLAEE NALKRQYLR
|
| |