Gene OSTLU_17829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17829 
Symbol 
ID5005148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp231899 
End bp232954 
Gene Length1056 bp 
Protein Length351 aa 
Translation table 
GC content56% 
IMG OID640420569 
ProductBASS family transporter: sodium ion/bile acid 
Protein accessionXP_001421092 
Protein GI145353590 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.00336169 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.02899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGA GCGGTGGGCT CGGGTCGTGG GCCGCGATCG TGGGTAACGT GCTGTTATTC 
GCGCTGGTGC TCGGATTAGC GGTGACGGTG GAATTCGAGG CGTTTAAAAA GTCGTTGAGG
AGCCGAGGTT TGGTCATAGG CGTGTTTTCG CAGTTCGTCT TCTTACCGCT GTTTGGGTTT
ACGATGGCGC GGACGCTGCT GTGGGATCAA CCGGCGCTGG GAATACCGCT CATTATCACC
ACGAGCTCGC CTGGGGGTTC GTACAGTAAT TGGTGGACGT CACTGTTCAA CGCGGACCTG
TCGCTGAGCG TGGCGATGAC GACGGTGAGT TCGATTTTAT CGGTGGGATT TTTACCGCTG
AATCTGTACA TGTACACCGA GGGAGCGTAT CCTAACGGAA CGAATGTGCG GATGAAATGG
GGGGGACTGT TTCTGTCGAT CTCGATGGTG ATTAGCGGGA TCGTGTTGGG GTTGTTTGTC
GGTCGGCGAG CGCCGAAAGC GCGCGCGCCT CTCAACTTTG TGGGCAACGT GTGCGGCGTC
TTGCTTGTGC TCTTGGGGTT CTTCTTTTCG TCAAATTCTT CGGCGCCGAT TTGGGATCGC
GAATGGAAGT TCTACGTCGC CCTCATCGTG CCCTGCGTCG CTGGATTAGT GGTTAGTTTG
GGATTCGCCA AATTGACTCG GTTGGATATG CCTCAAGCCC TGGCGGTGGC CATCGAGGTG
TGCTATCAGA ACACCGCCAT CTCGTTGGCG GTGATTTTAA GCTCATTTGA GGACGATCCG
TCGTGCGCAT CGAGCGCAGG ATCGGCGTGT AACGTCGTCG GCGTCGCGAG CGGCGTGCCG
ACGTTTTATC AGTTCGTGCA GGTGTTTTCG CTTGGGATTT TGTGCTTGTT GGCGTGGAAG
AGCGGTTGGA CGTACGCGCC GAAAGGCACG CCTCTTCTCA CCGCGATAAG CAAATCGTTC
CAGCCGACGC ATCGCGATCC AGGAGCCTTT AGGGAAGTAG ATTTAGATAG AAGCGAGGAA
ATGAGCGAGA TGCCGCGCGA AGTAGAGAAT GTCTAG
 
Protein sequence
MAVSGGLGSW AAIVGNVLLF ALVLGLAVTV EFEAFKKSLR SRGLVIGVFS QFVFLPLFGF 
TMARTLLWDQ PALGIPLIIT TSSPGGSYSN WWTSLFNADL SLSVAMTTVS SILSVGFLPL
NLYMYTEGAY PNGTNVRMKW GGLFLSISMV ISGIVLGLFV GRRAPKARAP LNFVGNVCGV
LLVLLGFFFS SNSSAPIWDR EWKFYVALIV PCVAGLVVSL GFAKLTRLDM PQALAVAIEV
CYQNTAISLA VILSSFEDDP SCASSAGSAC NVVGVASGVP TFYQFVQVFS LGILCLLAWK
SGWTYAPKGT PLLTAISKSF QPTHRDPGAF REVDLDRSEE MSEMPREVEN V