Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_1343 |
Symbol | |
ID | 5006147 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | - |
Start bp | 61588 |
End bp | 64695 |
Gene Length | 3108 bp |
Protein Length | 583 aa |
Translation table | |
GC content | 59% |
IMG OID | 640421568 |
Product | DASS family transporter: sodium ion/sulfate |
Protein accession | XP_001422192 |
Protein GI | 145355916 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.351628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.572139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTCCTGCGAG ACTACGTCGC CGCGGACTGG ACGATGATGC TCGCCGTCGC GACGCTCAAC CTGTGCGGCG TCATCACGCT CGCCGAATCC CTCGCCGGGT TCGCGAACGA GGGATTGCTC ACGGTCGGCG CGCTGTTCGT CGTCGCGGCG GGGATCAGCG CGACGGGGGG GCTGGACTGG TACATGGGGA AGGTGCTGGG GAAGCCGCGA ACCCCGGCGG GGGCGCAGTT GCGATTGATG CTGCCGATCG CGTGCGTGAG TGGATTTTTG AATAACACGC CCGTGGTGGC GGTGATGATA CCGATCGTGC TGCGATGGGC CGAGGCCACG GGAATGGCGA AGGAGCAGTT GATGATTCCG TTGTCGTTCG CGAGCGTGCT CGGCGGCACG TGCACGCTCA TCGGAACGTC GACGAATTTA GTGGTGCAGG GAATGGTGGA GACGTGGACG CGCGAGCACG CGGGGCGAGG GGGGGAGGTG AAGATCGGTC TGTTTGATCT CGGGCTGTAC GGCGTGCCCG TGGCGTTGGC GGGGATAGCG TACGTGTTGC TGGCGTCGCC GTTTTTGTTG CCAAAAGGCG CGCGGCGAAT CGGAAGCGGA CCGAGGGATC AACGGCGCGG AGAGGACGAA GAGGATTTAC TCGTCGGTGC GCGCGTCGAG GGGTGGTCGC CCGCGGTCGG GCACACGGTC GCCGCGAGCG GATTGCGAGG TTTGCCGGGA TTGTACCTGG TGAGCGTGCG CAGAAACCAA GCGTTGTTGC GCGCCATCGG ACCAGAGTTC ATCCTGAACC AGGGCGATAT TTTGTATTTT ACCGGGATGA TCGAGTCGCT CGGGAAGGTG TGCGCGGAGT ATGGGTTGAT GGCGATCACG CAAGAGTACG ACGAAGACGA AGACGAAGGC GCGAGAGAGA CGTCGTTTGA CGGAGGTGAC GAGAAGGACG TCGTCGCCAT GCACTCGGCG AGCAGCATTG CGGATTTACA AAAGCTCGCG CGCACGCACG CCATTTACAA AGAATCGGAA GGTGGAATTT ATGCCTTGCG GAAGCGTCGA CAAAAGGCTC GACCTCGCAG ATCGCACGGA TCGAACCAAG ACTCGAGCGA CGCTGGGAGC GCGCTCCCGC TGAGCCCGGG GTATTCATCC TCCGACGGCA ACGCGTATAC TTCTGGTGGC GCCAGTGATT CTGACGACGG CACCGCACGA GCGATCAAAG CTCAGGCGTT GCGGATGTAC GGCGAAAATC TGAAGAATGC TCTCGGTGAA GCTCGCGGCG ACGGTGACGC GATGATTAGC GCCAAGGAGA GCGAAACCGC CCTGGGGCCG CCGTTGGTGA CCGTGGATGT CGATCCCGAT GCCGATCAAA ATAACCCCGA ACACGTTGGA CGTATGGTGC TCGGAATTAG TGCGAACGAT CGACCCGGGT TGTTGCACGA CATCTCGCAG GCGTTGAATC GCTTGCGTGT GCAGTTGTTA CATTGCGAAG CTTCGGCGGT GGGTTCGCGA TCGGTTTCTA TTTGGCGCTT GCAAGTGTTG CAAAACACCA CGACGAAAGA GGAGATCACG ACGGTGATCA AAACTTTACT CGAACCCGTG ACTGGGATTG AAGCCGTGAA AAGGCGCGGG CAATCCGTCG TTCGATGCAG TGTGAAACCG ACGTCATCGC TCGTGGGAAA GACTCCAGGA GAGGCGGATT TGCGCGTCAC GTACGGCGGC GCCATCATCG CTGTGCAACG TGGCGGTCGC GCGCCACCGG GTAAGCTAAA CGCGCTGATT TTTCAAGTGG GTGATACTCT CGTATTACAG GCGCAGGATA GTTCACCTTT GATCAAGCTC TTGGAGGAAT CTGAAGACGT CGTCGTTGAG GAGGACGAGC GCTTGGCGAA GACTCGCGAG GATTTAGAAA TCATCGGTCA TGGATCGGGA ACGATGAACA CAGGGAAAGA GTTCCTCATC GCCGTGCGCA TTGAAGCTAC GGCGAAGAAT TTCATAGGCA AGACGGCGGT CGAGAGCGGC TTGCGGTCGT TGCCGGGTTT GTTTTTGGTA TCAATCGAGC GCACGCGATC CATCGTGAGC GTCGGCGCAG CGGCTGTCGT CGCCATGGTT GAATCTCCCG CGGACAGAAC CACTGTGATC GATCCGAGTG AGCCTTTAGA AGCAAATGAC GTGATGTGGT ACGCCGGTGG CGCCAACGCC ATCGCCAGTC TTCGTCGGGT TCCGGGGCTG GCGCCGTACT CGAGCGACCA AGTGGACAAG CTCGAAATCT CCAGTCACGA CCGTCGATTG ATTCAAGCCG TGGTCGCGAA AACTGGAGAT TTGGTCGGTA AGTCTATTCG AGACATCAAA TTTCGCACGC GATTCAACGC AGTCGTCATC GCCGTCCATC GCGAGGGCGC GCGCGTGCAC TCAAGAATTG GCGACGTCGT CTTGCACCCC GGCGACGTGC TTTTACTCGA CGCCGGTGAA GATTTCAAAC AAAGCGCCCG GGCGCAGGGC GCGTTCGCTC TGATCAGTGT TTTGGACGAT TCTACGCCCC CTCGTTTGCG ATTACTCATC CCCTCGCTCT TGTGCGCGCT GGCAATGATT AGTTTGTACA CCGCCGGCGT CATGGAGCTC TTCACCGCCG CGGTGCTCGC GGCGGCTGTG ATGATCGCCT CTGGCACTCT GACGCAGCAA GAGGCTCGAA ACGCCATAAA ATGGGACGTC ATAGTCACCA TCGCCGGTGC TTTTGGAATT TCGCGCGCCA TGCAAAACAG TGGCGTCGCC GAAGCCGTGG CGAAGAAACT CGTCGCGTTG GGTCGCGTCA CGAACACGGG TGAAATCGGC TTGCTCGTCG CCGTCTACCT CGCCACGTTC TTGATCTCCA ACATCGTGAC GAACAACGCC GCCGCGGCTT TGATGCTTCC CATCGCCGCC AGCGCCGCGG AATCTGAAAA CATCGCGCTC GAAAAGATGG CCTTCTTACT CATGCTCGCC GCCTCGGCGT CCTTCATGTC GCCCTTCGGA TACCAAACCA ACCTCATGGT GTACGGCCCG GGCGGCTACG TGTTCGCCGA CTTTATCAAA TTTGGTTTCC CCATGCAGAT CACGCTCTTG ATCGTCAGTA TCGTCGTC
|
Protein sequence | LLRDYVAADW TMMLAVATLN LCGVITLAES LAGFANEGLL TVGALFVVAA GISATGGLDW YMGKVLGKPR TPAGAQLRLM LPIACVSGFL NNTPVVAVMI PIVLRWAEAT GMAKEQLMIP LSFASVLGGT CTLIGTSTNL VVQGMVETWT REHAGRGGEV KIGLFDLGLY GVPVALAGIA YVLLASPFLL PKGARRIGSG PRDQRRGEDE EDLLVGARVE GWSPAVGHTV AASGLRGLPG LYLVSVRRNQ ALLRAIGPEF ILNQGDILYF TGMIESLGKV LRRVPGLAPY SSDQVDKLEI SSHDRRLIQA VVAKTGDLVG KSIRDIKFRT RFNAVVIAVH REGARVHSRI GDVVLHPGDV LLLDAGEDFK QSARAQGAFA LISVLDDSTP PRLRLLIPSL LCALAMISLY TAGVMELFTA AVLAAAVMIA SGTLTQQEAR NAIKWDVIVT IAGAFGISRA MQNSGVAEAV AKKLVALGRV TNTGEIGLLV AVYLATFLIS NIVTNNAAAA LMLPIAASAA ESENIALEKM AFLLMLAASA SFMSPFGYQT NLMVYGPGGY VFADFIKFGF PMQITLLIVS IVV
|
| |