Gene OSTLU_1343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_1343 
Symbol 
ID5006147 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp61588 
End bp64695 
Gene Length3108 bp 
Protein Length583 aa 
Translation table 
GC content59% 
IMG OID640421568 
ProductDASS family transporter: sodium ion/sulfate 
Protein accessionXP_001422192 
Protein GI145355916 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.351628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.572139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCCTGCGAG ACTACGTCGC CGCGGACTGG ACGATGATGC TCGCCGTCGC GACGCTCAAC 
CTGTGCGGCG TCATCACGCT CGCCGAATCC CTCGCCGGGT TCGCGAACGA GGGATTGCTC
ACGGTCGGCG CGCTGTTCGT CGTCGCGGCG GGGATCAGCG CGACGGGGGG GCTGGACTGG
TACATGGGGA AGGTGCTGGG GAAGCCGCGA ACCCCGGCGG GGGCGCAGTT GCGATTGATG
CTGCCGATCG CGTGCGTGAG TGGATTTTTG AATAACACGC CCGTGGTGGC GGTGATGATA
CCGATCGTGC TGCGATGGGC CGAGGCCACG GGAATGGCGA AGGAGCAGTT GATGATTCCG
TTGTCGTTCG CGAGCGTGCT CGGCGGCACG TGCACGCTCA TCGGAACGTC GACGAATTTA
GTGGTGCAGG GAATGGTGGA GACGTGGACG CGCGAGCACG CGGGGCGAGG GGGGGAGGTG
AAGATCGGTC TGTTTGATCT CGGGCTGTAC GGCGTGCCCG TGGCGTTGGC GGGGATAGCG
TACGTGTTGC TGGCGTCGCC GTTTTTGTTG CCAAAAGGCG CGCGGCGAAT CGGAAGCGGA
CCGAGGGATC AACGGCGCGG AGAGGACGAA GAGGATTTAC TCGTCGGTGC GCGCGTCGAG
GGGTGGTCGC CCGCGGTCGG GCACACGGTC GCCGCGAGCG GATTGCGAGG TTTGCCGGGA
TTGTACCTGG TGAGCGTGCG CAGAAACCAA GCGTTGTTGC GCGCCATCGG ACCAGAGTTC
ATCCTGAACC AGGGCGATAT TTTGTATTTT ACCGGGATGA TCGAGTCGCT CGGGAAGGTG
TGCGCGGAGT ATGGGTTGAT GGCGATCACG CAAGAGTACG ACGAAGACGA AGACGAAGGC
GCGAGAGAGA CGTCGTTTGA CGGAGGTGAC GAGAAGGACG TCGTCGCCAT GCACTCGGCG
AGCAGCATTG CGGATTTACA AAAGCTCGCG CGCACGCACG CCATTTACAA AGAATCGGAA
GGTGGAATTT ATGCCTTGCG GAAGCGTCGA CAAAAGGCTC GACCTCGCAG ATCGCACGGA
TCGAACCAAG ACTCGAGCGA CGCTGGGAGC GCGCTCCCGC TGAGCCCGGG GTATTCATCC
TCCGACGGCA ACGCGTATAC TTCTGGTGGC GCCAGTGATT CTGACGACGG CACCGCACGA
GCGATCAAAG CTCAGGCGTT GCGGATGTAC GGCGAAAATC TGAAGAATGC TCTCGGTGAA
GCTCGCGGCG ACGGTGACGC GATGATTAGC GCCAAGGAGA GCGAAACCGC CCTGGGGCCG
CCGTTGGTGA CCGTGGATGT CGATCCCGAT GCCGATCAAA ATAACCCCGA ACACGTTGGA
CGTATGGTGC TCGGAATTAG TGCGAACGAT CGACCCGGGT TGTTGCACGA CATCTCGCAG
GCGTTGAATC GCTTGCGTGT GCAGTTGTTA CATTGCGAAG CTTCGGCGGT GGGTTCGCGA
TCGGTTTCTA TTTGGCGCTT GCAAGTGTTG CAAAACACCA CGACGAAAGA GGAGATCACG
ACGGTGATCA AAACTTTACT CGAACCCGTG ACTGGGATTG AAGCCGTGAA AAGGCGCGGG
CAATCCGTCG TTCGATGCAG TGTGAAACCG ACGTCATCGC TCGTGGGAAA GACTCCAGGA
GAGGCGGATT TGCGCGTCAC GTACGGCGGC GCCATCATCG CTGTGCAACG TGGCGGTCGC
GCGCCACCGG GTAAGCTAAA CGCGCTGATT TTTCAAGTGG GTGATACTCT CGTATTACAG
GCGCAGGATA GTTCACCTTT GATCAAGCTC TTGGAGGAAT CTGAAGACGT CGTCGTTGAG
GAGGACGAGC GCTTGGCGAA GACTCGCGAG GATTTAGAAA TCATCGGTCA TGGATCGGGA
ACGATGAACA CAGGGAAAGA GTTCCTCATC GCCGTGCGCA TTGAAGCTAC GGCGAAGAAT
TTCATAGGCA AGACGGCGGT CGAGAGCGGC TTGCGGTCGT TGCCGGGTTT GTTTTTGGTA
TCAATCGAGC GCACGCGATC CATCGTGAGC GTCGGCGCAG CGGCTGTCGT CGCCATGGTT
GAATCTCCCG CGGACAGAAC CACTGTGATC GATCCGAGTG AGCCTTTAGA AGCAAATGAC
GTGATGTGGT ACGCCGGTGG CGCCAACGCC ATCGCCAGTC TTCGTCGGGT TCCGGGGCTG
GCGCCGTACT CGAGCGACCA AGTGGACAAG CTCGAAATCT CCAGTCACGA CCGTCGATTG
ATTCAAGCCG TGGTCGCGAA AACTGGAGAT TTGGTCGGTA AGTCTATTCG AGACATCAAA
TTTCGCACGC GATTCAACGC AGTCGTCATC GCCGTCCATC GCGAGGGCGC GCGCGTGCAC
TCAAGAATTG GCGACGTCGT CTTGCACCCC GGCGACGTGC TTTTACTCGA CGCCGGTGAA
GATTTCAAAC AAAGCGCCCG GGCGCAGGGC GCGTTCGCTC TGATCAGTGT TTTGGACGAT
TCTACGCCCC CTCGTTTGCG ATTACTCATC CCCTCGCTCT TGTGCGCGCT GGCAATGATT
AGTTTGTACA CCGCCGGCGT CATGGAGCTC TTCACCGCCG CGGTGCTCGC GGCGGCTGTG
ATGATCGCCT CTGGCACTCT GACGCAGCAA GAGGCTCGAA ACGCCATAAA ATGGGACGTC
ATAGTCACCA TCGCCGGTGC TTTTGGAATT TCGCGCGCCA TGCAAAACAG TGGCGTCGCC
GAAGCCGTGG CGAAGAAACT CGTCGCGTTG GGTCGCGTCA CGAACACGGG TGAAATCGGC
TTGCTCGTCG CCGTCTACCT CGCCACGTTC TTGATCTCCA ACATCGTGAC GAACAACGCC
GCCGCGGCTT TGATGCTTCC CATCGCCGCC AGCGCCGCGG AATCTGAAAA CATCGCGCTC
GAAAAGATGG CCTTCTTACT CATGCTCGCC GCCTCGGCGT CCTTCATGTC GCCCTTCGGA
TACCAAACCA ACCTCATGGT GTACGGCCCG GGCGGCTACG TGTTCGCCGA CTTTATCAAA
TTTGGTTTCC CCATGCAGAT CACGCTCTTG ATCGTCAGTA TCGTCGTC
 
Protein sequence
LLRDYVAADW TMMLAVATLN LCGVITLAES LAGFANEGLL TVGALFVVAA GISATGGLDW 
YMGKVLGKPR TPAGAQLRLM LPIACVSGFL NNTPVVAVMI PIVLRWAEAT GMAKEQLMIP
LSFASVLGGT CTLIGTSTNL VVQGMVETWT REHAGRGGEV KIGLFDLGLY GVPVALAGIA
YVLLASPFLL PKGARRIGSG PRDQRRGEDE EDLLVGARVE GWSPAVGHTV AASGLRGLPG
LYLVSVRRNQ ALLRAIGPEF ILNQGDILYF TGMIESLGKV LRRVPGLAPY SSDQVDKLEI
SSHDRRLIQA VVAKTGDLVG KSIRDIKFRT RFNAVVIAVH REGARVHSRI GDVVLHPGDV
LLLDAGEDFK QSARAQGAFA LISVLDDSTP PRLRLLIPSL LCALAMISLY TAGVMELFTA
AVLAAAVMIA SGTLTQQEAR NAIKWDVIVT IAGAFGISRA MQNSGVAEAV AKKLVALGRV
TNTGEIGLLV AVYLATFLIS NIVTNNAAAA LMLPIAASAA ESENIALEKM AFLLMLAASA
SFMSPFGYQT NLMVYGPGGY VFADFIKFGF PMQITLLIVS IVV