Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39202 |
Symbol | |
ID | 5004856 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 495262 |
End bp | 498219 |
Gene Length | 2958 bp |
Protein Length | 634 aa |
Translation table | |
GC content | 57% |
IMG OID | 640420277 |
Product | DASS family transporter: sodium ion/sulfate |
Protein accession | XP_001420710 |
Protein GI | 145352770 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.21287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCATCG CGCTGCTGAA AAATGTCGCG GGGCCGGACG TGTTGATGCT CGGCGCGCTC GCGCTCGAGC TCGCGGCTTC GATCGTGGAC CTGGGCGAGG GATTGAAGGG GTTCAGCAAT AAAGGCTTGC TCACCGTCGC GTGCTTGTTC GTCGTGGCGG CGGGGATATC GAACACGGGC GCGCTGGACT ATTACATGAG CAAAGCCTTG GGGACGCCGA AGAGCGCGGC GGACGCGCAG CTGCGGTTGA TGGTGCCGAT CGCGGTGGTG AGCGCGTTTT TGAATAATAC GCCGGTGGTG GCGATCATGA TACCGATCGT GCAGAAGTGG TCGCGGAAGT GTAAGATTTC GTCGGCGCAG TTGTTCATAC CGCTGTCGTT TAGCTCGATT TTAGGCGGCA CGTGCACGCT GATCGGGACG TCGACGAATT TGGTCGTCGA TGGCATGCGA CGGGAGCGAT ATCCCAATGA ACGCGCGCTC GGGTTGTTTG AGCTGTCGAA GTTTGGCGTG CCGGTGTTGC TGAGCGGGTT GGCGTACATG CTCATCGCGG CGCCGGCATT GCTGCCGGGC GGCGCGCGAG ACGGCAATCG CGCGGGGGGG CAAGACATGG ATATGGACGA TTTAACCGTA GGTGCAGTCG TTCCGCGTGG TTCACCCGTC GTGGACGCCG AAGTCGCTGT CTTGCGCGGG TTGAATGGGT TGTATCTCGT CAGCGTGCAG CGAGGCGACA TGCTGATGCG CGCAGTACAA CCAGATTTCA TTCTCGAGGT TGGTGACATC TTGATCTTCA CCGGTCTCGT CGATCGCATA GGGGAAGTGT GTAAAACACA CGGCTTGGTG CCGCTCACGC ACGAAGTGGA GGAGAAAATG TTAGAAGAGA AGAGGAATTC GTTCGACGAT TATCGCAAGA CCAATGAACG CTCAGGTGAA AACGCCGAAA GCACGGTGAC GTTTCCTCCG CGCGCGAGGA CGCCCTCGCT CACTGGTCAG TCGCAGGACG ACAGTGGTAT TCGAGGCGTT GCGTGGATGC ATCGGGAGTC GAGAAAAGAT GGGCGAGCGA GTATAGAGTT GCACACGTAC GAAAAGATGC TTCGCAAGTT CAAGTCGAGC GGCAACGACG AAGACGCCGA CGGGAGCGCG AGCGATAAAG CCGCCACATC ACCGCCGAGC GAGCAGAGAA CGCAAAGTGG ACGCGCAAGT TTAGATTACA ACTTCAAGCA CGGTGAGCGT CGACGACAAT TAAGCGACGA TATGCGTGAG CATGACTATT TCACTCAGTC GGTTGAGGTC ATTCGCGCTC GCATACTCGA AAACTCTCCG CTCGTCGGCA CAACGCCATC GGAGTGCAAG TTTAGGCACA AATTTAAAGC GTCCATAATG GCGGTGCAGA GTATGGGCGA GAAATCAGGG ACAGGGAACG ACGTCCGCCG AGGGCATCTC GGGACGACGA TTTTCCGAGC CAACGATATT CTTATGCTTC ATGTCTCTCA AGAGAGCCCA CTTCTGGTGC AAAAGCAAGA ACGTGAAGCT GCGTCAGCGG AAAACTCGCG ACGTGGACTT CCTTCGCCAG GTTCGCGACG GAATTTGTCC TTGGATGCCA TCGAGGCGTC GACGTACACG TGCGGGAACG ATTTGGAGGT TTTGCTCACC GACGCGAAGA CCGAACTTAA CTTTAACGAG TTTGCCATTC CCATGCGCGT CGTGAGTGGA GGGGCGTTGG ATGGCAAAAC CATCAGAGGC GTTGATCTGG CGAACGTGCC GGGTTTATTT TGCACGGCAA TTCAATCGAA CGCGAGCAGC GAAATGCGCA TACCGGACGC CGAGACAATT CTTAATGGTG AAGATATTTT GTGGTTTGCG GGCGATATGA ACGGTATGCA GACGCTTCGA CGCATTCCTG GGTTAGTTTC GCAAGAAAAT CAAGTGGAAA AGCTCAAGCA CGTCCGTAAA ACTGATCGAA GATTAGTGCA AGCCGTGATA TCACAAGGGA GCTCGCTCGT GGGGAGATGC GTCCGTGACA CGCACTTCAG AACGCGTTAC GACGCCGTCA TCATCGCTGT CCAACGCAGT GGAGGGCGTA TACAGGCGAG AATCGGCGAT ATCGTACTCG AAGCGGGCGA CGTTTTGCTC CTGGACACGT CGACGAGTTT TTTGCTTGAT CATAAGTCGG ATCCGACGTT TGCGCTCATC TCTGAAATCG AAGACTCCGC TCCGCCGCAA TTCGACAAGC TTCTTCCTTC TCTCGGCACA GCAGTAGTGA TGATAGCTGT TTTTGTCGCC GGTGCGCTCG ATCTCTTCGT CGCCGCGTTA CTGGCTTCGG GCGTCATGCT CGCCACGGGA TGTCTGACAC AGGAGCAGGC GCGCGCGGCG GTCAAATGGG ACGTCATTGT CACGATAGCT GCCGCGTTCG GAATTTCCGC GGCGATGGAG CAAAGTGGTG TCGCTGCCGG GATTGCATCG ACGCTCGTGA GCGCCGGCAA CGGCATGGGT ACTGGGAAGC CGGGAATTCT CGTCGCCGTC TACCTCGCCA CAGTTGTGTT GTGCAACATC GTCGGTAACA ACGCCGCCGC CGCGCTCATG TACCCCATCG CCGCTGGCGC TGCGGAAAAG CAAGGGATTG ATAACGTTCA GATGAGCTAC TTGCTCATGC TTGCCGCTTC GGCATCGTTC ATGTCTCCAT TTGGTTACCA AACCAACTTG ATGGTGTACG GTCCAGGCGG CTACGTTTTC GCCAATTTCC TCAAGTTCGG CGCACCGATG CAAGTAGTTC AGCTGATCGT CTCTGTTTCT GTTGTTCTAC TCGACAGCAA GTGGTGGATC GGGTGGATCG TGGGTTTCGG CGCCATCCTC ACGATTTACC TAACGCGTCT CGTCGTGCCA ATTATACGCA AACGGCGAGA CGCCGCGCGC GGCGGCTCGA CCGATTCGCG GCTCGACCCC ATTCAGGAAG TTCTGTGA
|
Protein sequence | MFIALLKNVA GPDVLMLGAL ALELAASIVD LGEGLKGFSN KGLLTVACLF VVAAGISNTG ALDYYMSKAL GTPKSAADAQ LRLMVPIAVV SAFLNNTPVV AIMIPIVQKW SRKCKISSAQ LFIPLSFSSI LGGTCTLIGT STNLVVDGMR RERYPNERAL GLFELSKFGV PVLLSGLAYM LIAAPALLPG GARDGNRAGG QDMDMDDLTF AIPMRVVSGG ALDGKTIRGV DLANVPGLFC TAIQSNASSE MRIPDAETIL NGEDILWFAG DMNGMQTLRR IPGLVSQENQ VEKLKHVRKT DRRLVQAVIS QGSSLVGRCV RDTHFRTRYD AVIIAVQRSG GRIQARIGDI VLEAGDVLLL DTSTSFLLDH KSDPTFALIS EIEDSAPPQF DKLLPSLGTA VVMIAVFVAG ALDLFVAALL ASGVMLATGC LTQEQARAAV KWDVIVTIAA AFGISAAMEQ SGVAAGIAST LVSAGNGMGT GKPGILVAVY LATVVLCNIV GNNAAAALMY PIAAGAAEKQ GIDNVQMSYL LMLAASASFM SPFGYQTNLM VYGPGGYVFA NFLKFGAPMQ VVQLIVSVSV VLLDSKWWIG WIVGFGAILT IYLTRLVVPI IRKRRDAARG GSTDSRLDPI QEVL
|
| |