Gene OSTLU_39202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39202 
Symbol 
ID5004856 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp495262 
End bp498219 
Gene Length2958 bp 
Protein Length634 aa 
Translation table 
GC content57% 
IMG OID640420277 
ProductDASS family transporter: sodium ion/sulfate 
Protein accessionXP_001420710 
Protein GI145352770 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.21287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATCG CGCTGCTGAA AAATGTCGCG GGGCCGGACG TGTTGATGCT CGGCGCGCTC 
GCGCTCGAGC TCGCGGCTTC GATCGTGGAC CTGGGCGAGG GATTGAAGGG GTTCAGCAAT
AAAGGCTTGC TCACCGTCGC GTGCTTGTTC GTCGTGGCGG CGGGGATATC GAACACGGGC
GCGCTGGACT ATTACATGAG CAAAGCCTTG GGGACGCCGA AGAGCGCGGC GGACGCGCAG
CTGCGGTTGA TGGTGCCGAT CGCGGTGGTG AGCGCGTTTT TGAATAATAC GCCGGTGGTG
GCGATCATGA TACCGATCGT GCAGAAGTGG TCGCGGAAGT GTAAGATTTC GTCGGCGCAG
TTGTTCATAC CGCTGTCGTT TAGCTCGATT TTAGGCGGCA CGTGCACGCT GATCGGGACG
TCGACGAATT TGGTCGTCGA TGGCATGCGA CGGGAGCGAT ATCCCAATGA ACGCGCGCTC
GGGTTGTTTG AGCTGTCGAA GTTTGGCGTG CCGGTGTTGC TGAGCGGGTT GGCGTACATG
CTCATCGCGG CGCCGGCATT GCTGCCGGGC GGCGCGCGAG ACGGCAATCG CGCGGGGGGG
CAAGACATGG ATATGGACGA TTTAACCGTA GGTGCAGTCG TTCCGCGTGG TTCACCCGTC
GTGGACGCCG AAGTCGCTGT CTTGCGCGGG TTGAATGGGT TGTATCTCGT CAGCGTGCAG
CGAGGCGACA TGCTGATGCG CGCAGTACAA CCAGATTTCA TTCTCGAGGT TGGTGACATC
TTGATCTTCA CCGGTCTCGT CGATCGCATA GGGGAAGTGT GTAAAACACA CGGCTTGGTG
CCGCTCACGC ACGAAGTGGA GGAGAAAATG TTAGAAGAGA AGAGGAATTC GTTCGACGAT
TATCGCAAGA CCAATGAACG CTCAGGTGAA AACGCCGAAA GCACGGTGAC GTTTCCTCCG
CGCGCGAGGA CGCCCTCGCT CACTGGTCAG TCGCAGGACG ACAGTGGTAT TCGAGGCGTT
GCGTGGATGC ATCGGGAGTC GAGAAAAGAT GGGCGAGCGA GTATAGAGTT GCACACGTAC
GAAAAGATGC TTCGCAAGTT CAAGTCGAGC GGCAACGACG AAGACGCCGA CGGGAGCGCG
AGCGATAAAG CCGCCACATC ACCGCCGAGC GAGCAGAGAA CGCAAAGTGG ACGCGCAAGT
TTAGATTACA ACTTCAAGCA CGGTGAGCGT CGACGACAAT TAAGCGACGA TATGCGTGAG
CATGACTATT TCACTCAGTC GGTTGAGGTC ATTCGCGCTC GCATACTCGA AAACTCTCCG
CTCGTCGGCA CAACGCCATC GGAGTGCAAG TTTAGGCACA AATTTAAAGC GTCCATAATG
GCGGTGCAGA GTATGGGCGA GAAATCAGGG ACAGGGAACG ACGTCCGCCG AGGGCATCTC
GGGACGACGA TTTTCCGAGC CAACGATATT CTTATGCTTC ATGTCTCTCA AGAGAGCCCA
CTTCTGGTGC AAAAGCAAGA ACGTGAAGCT GCGTCAGCGG AAAACTCGCG ACGTGGACTT
CCTTCGCCAG GTTCGCGACG GAATTTGTCC TTGGATGCCA TCGAGGCGTC GACGTACACG
TGCGGGAACG ATTTGGAGGT TTTGCTCACC GACGCGAAGA CCGAACTTAA CTTTAACGAG
TTTGCCATTC CCATGCGCGT CGTGAGTGGA GGGGCGTTGG ATGGCAAAAC CATCAGAGGC
GTTGATCTGG CGAACGTGCC GGGTTTATTT TGCACGGCAA TTCAATCGAA CGCGAGCAGC
GAAATGCGCA TACCGGACGC CGAGACAATT CTTAATGGTG AAGATATTTT GTGGTTTGCG
GGCGATATGA ACGGTATGCA GACGCTTCGA CGCATTCCTG GGTTAGTTTC GCAAGAAAAT
CAAGTGGAAA AGCTCAAGCA CGTCCGTAAA ACTGATCGAA GATTAGTGCA AGCCGTGATA
TCACAAGGGA GCTCGCTCGT GGGGAGATGC GTCCGTGACA CGCACTTCAG AACGCGTTAC
GACGCCGTCA TCATCGCTGT CCAACGCAGT GGAGGGCGTA TACAGGCGAG AATCGGCGAT
ATCGTACTCG AAGCGGGCGA CGTTTTGCTC CTGGACACGT CGACGAGTTT TTTGCTTGAT
CATAAGTCGG ATCCGACGTT TGCGCTCATC TCTGAAATCG AAGACTCCGC TCCGCCGCAA
TTCGACAAGC TTCTTCCTTC TCTCGGCACA GCAGTAGTGA TGATAGCTGT TTTTGTCGCC
GGTGCGCTCG ATCTCTTCGT CGCCGCGTTA CTGGCTTCGG GCGTCATGCT CGCCACGGGA
TGTCTGACAC AGGAGCAGGC GCGCGCGGCG GTCAAATGGG ACGTCATTGT CACGATAGCT
GCCGCGTTCG GAATTTCCGC GGCGATGGAG CAAAGTGGTG TCGCTGCCGG GATTGCATCG
ACGCTCGTGA GCGCCGGCAA CGGCATGGGT ACTGGGAAGC CGGGAATTCT CGTCGCCGTC
TACCTCGCCA CAGTTGTGTT GTGCAACATC GTCGGTAACA ACGCCGCCGC CGCGCTCATG
TACCCCATCG CCGCTGGCGC TGCGGAAAAG CAAGGGATTG ATAACGTTCA GATGAGCTAC
TTGCTCATGC TTGCCGCTTC GGCATCGTTC ATGTCTCCAT TTGGTTACCA AACCAACTTG
ATGGTGTACG GTCCAGGCGG CTACGTTTTC GCCAATTTCC TCAAGTTCGG CGCACCGATG
CAAGTAGTTC AGCTGATCGT CTCTGTTTCT GTTGTTCTAC TCGACAGCAA GTGGTGGATC
GGGTGGATCG TGGGTTTCGG CGCCATCCTC ACGATTTACC TAACGCGTCT CGTCGTGCCA
ATTATACGCA AACGGCGAGA CGCCGCGCGC GGCGGCTCGA CCGATTCGCG GCTCGACCCC
ATTCAGGAAG TTCTGTGA
 
Protein sequence
MFIALLKNVA GPDVLMLGAL ALELAASIVD LGEGLKGFSN KGLLTVACLF VVAAGISNTG 
ALDYYMSKAL GTPKSAADAQ LRLMVPIAVV SAFLNNTPVV AIMIPIVQKW SRKCKISSAQ
LFIPLSFSSI LGGTCTLIGT STNLVVDGMR RERYPNERAL GLFELSKFGV PVLLSGLAYM
LIAAPALLPG GARDGNRAGG QDMDMDDLTF AIPMRVVSGG ALDGKTIRGV DLANVPGLFC
TAIQSNASSE MRIPDAETIL NGEDILWFAG DMNGMQTLRR IPGLVSQENQ VEKLKHVRKT
DRRLVQAVIS QGSSLVGRCV RDTHFRTRYD AVIIAVQRSG GRIQARIGDI VLEAGDVLLL
DTSTSFLLDH KSDPTFALIS EIEDSAPPQF DKLLPSLGTA VVMIAVFVAG ALDLFVAALL
ASGVMLATGC LTQEQARAAV KWDVIVTIAA AFGISAAMEQ SGVAAGIAST LVSAGNGMGT
GKPGILVAVY LATVVLCNIV GNNAAAALMY PIAAGAAEKQ GIDNVQMSYL LMLAASASFM
SPFGYQTNLM VYGPGGYVFA NFLKFGAPMQ VVQLIVSVSV VLLDSKWWIG WIVGFGAILT
IYLTRLVVPI IRKRRDAARG GSTDSRLDPI QEVL