Gene OSTLU_33387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33387 
Symbol 
ID5003593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp135099 
End bp136256 
Gene Length1158 bp 
Protein Length385 aa 
Translation table 
GC content61% 
IMG OID640419014 
ProductBASS family transporter: sodium ion/bile acid 
Protein accessionXP_001419499 
Protein GI145350192 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.307134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGCG GGCGTTCGGT GTCGACACCG CGCGAAGGCC GACACCGAAG ACGCGTCGAC 
GCGCGCTCGA GGGCGAGCGA GGACGATCCG GACGGCGAGG ACGATACCTC GGACGCGCGC
GGGAGCGACG ACGGCGTTCG GGCGGTGAAG GCGTGGTTGC GAGAGAACTT TTTCTTCGCG
GGCGTGGGGG CGGCGGCGTG CGCGAGCGCG TCGCCAGAGT TTCGCGACGT CGTGGAGTCG
TTCGGTGGTG CGGCGGGAGG GTCGTGGGGG TCGGGAGGGT TGGAAAAGTA TGCCATCGCA
GCCTTATTCT TCATCGCGGG CGTTGGATTA CCGGTGAGAG CGTTGAAGGA AGCCGCGAGC
GACGTGTCGC TCAACGCGTT TACGCAGGCG TTCATCTTCG TCTTTCCGAC GATCGTCATA
GCCGCAGCCG CGCCGGTTTT GATTGAATCG GGGTGGTTGA GCGAGAACGT GGTCGATGGT
TTATTCGTGT TGGCGTGCTT GCCCACGACA GTAGGTTCCG GCGTGGCGTT CACGCGGTCG
GCGAACGGGA ACGTCGAAGC CGCCTTGCTG AATTCCATGG CGGCGAACCT CGCGGGTATT
TTCTTGACCC CTGCGTTGAT ACATTTCTAT CTCGGCGCCG ACAGCTCGGT GGATCCGATC
GCATCGAGTT CGAAGCTGCT CGTTCAAGCA TTTTTACCCG TCGCTCTCGG TATGAGCTTG
CGTTTGATCC CGGGCGTGGC GTCCGCCGCC GAGGGCGGCT TGAAGGAGCC GAGCAAACTG
CTCGGCGATG CCATTTTGCT CGCCATCATC GCCAAAACCT TCGTCACAGC GGAACAAAGC
GAGGCGGGGA TGTTAGATTT CAACTCAAGC GCACACTTAG TGAGCGTCTT GTTGGTGTTC
ATGCTCTTGC ACAAGGGATC GATTTTTTTG GCGGCGTCTC GCGTCGGCGC CTTCTCGCGC
GAGGACGTCG TCTGCGCCCT ATACATGGGT TCGCACAAAA CCTTAGCGTT CGGCTTGCCT
TTGATATCGA CCACGTTCGA GGGCGATCCC AATCTCGCGT CTTATGTGCT TCCCTTGGTG
ATTTACCACC CCCTTCAAAT ATTCGCGAGC TCGCTCCTCG CGCGCCCTCT GGCGCGGTAC
GAGAAGCGGC GCGAATGA
 
Protein sequence
MRRGRSVSTP REGRHRRRVD ARSRASEDDP DGEDDTSDAR GSDDGVRAVK AWLRENFFFA 
GVGAAACASA SPEFRDVVES FGGAAGGSWG SGGLEKYAIA ALFFIAGVGL PVRALKEAAS
DVSLNAFTQA FIFVFPTIVI AAAAPVLIES GWLSENVVDG LFVLACLPTT VGSGVAFTRS
ANGNVEAALL NSMAANLAGI FLTPALIHFY LGADSSVDPI ASSSKLLVQA FLPVALGMSL
RLIPGVASAA EGGLKEPSKL LGDAILLAII AKTFVTAEQS EAGMLDFNSS AHLVSVLLVF
MLLHKGSIFL AASRVGAFSR EDVVCALYMG SHKTLAFGLP LISTTFEGDP NLASYVLPLV
IYHPLQIFAS SLLARPLARY EKRRE