Gene OSTLU_18258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18258 
Symbol 
ID5005660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp37574 
End bp39127 
Gene Length1554 bp 
Protein Length517 aa 
Translation table 
GC content58% 
IMG OID640421081 
Productpredicted protein 
Protein accessionXP_001421556 
Protein GI145354574 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCGG CGCTCGCCGC TCGCGACCGA GCGCGAAGAG GGCTCACGAC ATCATCGGAC 
ACCGCTTCGG GTGTCGCGGT GGTGCAACGG ACCGATGAAG CGACGTCCAC GCTCGCGAAA
TCACGCCAGC CTCTGAACGA AAACAACACC ACCAAGGCGA AGCTTTCCAA GCTTTCCACG
GATATGGCGC CGCAGACGCC GTTCGCCTCG CCCCTCAGTC GCCCTCTCCC TTCGACACCG
TGCGGAAACG GTCGGGAACT CATCCTTCAA GGCTTCAATT GGGAGTCGTG CAACGAAAAA
GCGAACAACG ATCGATCTTG GTATCAACTG TTGAATGAAA AGGTTCCCGA AATCGCCGCC
GCGGGCTTCA CCTCCGTTTG GATGCCCCCG CCGACGAAAT CCGTGAGCAA GCAAGGGTAC
CTTCCCACCG ACTTGTACAA CTTGAACTCC TTCTACGGCT CAGAAGATGA GCTGAGATCA
TGCGTCGCTC GCATGCGCGA GTACAACATC ACGCCGGTGG CGGATATCGT CATCAATCAC
CGCTGCGCCG AGGCGCAGGA CGACGCGGGG CGATGGAACA AGTACACGGG GAAGATTGAT
TGGGATGCGC GCGCGATCAC GTGCGAGAAC CCGCAATTCG GGGGACAGGG ATCGCAAAGC
ACGGGCGAGG ACTATCTTCC CGCGCCGAAC ATCGATCACA CTCAACAGTT CGTCCGCAAG
GATTTGAAGG AGTGGCTTTC GTGGATGCGC GACGACGTCG GATTCCGCGG CTGGCGATTT
GATTTCGTCA AAGGCTACAG CGGCGTGTTC ACTGGAGAGT ACGTCGAAGA AACGCGTCCT
TTCTTATCGT TCGGCGAATT TTGGGACGAA TGCTCATACC GTGACGGGGT TTTGGAATAC
AATCAAGACG CGCACCGTCA ACGAACGTGC GACTGGGTGG ATTCCACCGG CGGTAACACG
GCGGCGTTTG ATTTCACCAC CAAGGGTATT CTGCAAGAAG CAGTCGCACG CACCGAGTAC
TGGCGATTGA TCGACACGAA AGGGCGCCCA CCGGGTTTTT GCGGCATGTG GCCCTCTCGC
GCCGTTACGT TCATCGAAAA TCACGACACG GGCTCGACGC TCCAGCACTG GCCGTTTCCG
AGGGATAAAA TATTGCAAGG ATACTGCTAC ATCCTCACCC ATCCGGGAAC ACCCACGGTG
TTCTACGACC ACTGGGTCGA CCCGAAGTGG TCGGAGGCCA TCGGCGTGAT GTTGGATATT
CGCAAGCGCA CCGGGCTTTC GTCCAACGCC GCCGCCGTGC ACATCGAGCG CGCCACCGCC
GGTCTTTACG CCGCGCACAT CGGGCACGCG CAAGAGATGT ACACCGAAGG GTTGAGCATG
GACGTCGATG TCAAACGTCC GTCGATCTGC ATGAAGCTCG GCGTCGAAGA CTGGTCGCCA
AACGCGGCTA AAGTTGGAAA CTTGTCGTGG AAGTGCACGG CGAGCGGCGA TGGATGGGCG
ATTTGGGAAG ACAAGAACCA TCTTGAAGAC GAAGAAATCA GCAAAGCGAG GTGA
 
Protein sequence
MEAALAARDR ARRGLTTSSD TASGVAVVQR TDEATSTLAK SRQPLNENNT TKAKLSKLST 
DMAPQTPFAS PLSRPLPSTP CGNGRELILQ GFNWESCNEK ANNDRSWYQL LNEKVPEIAA
AGFTSVWMPP PTKSVSKQGY LPTDLYNLNS FYGSEDELRS CVARMREYNI TPVADIVINH
RCAEAQDDAG RWNKYTGKID WDARAITCEN PQFGGQGSQS TGEDYLPAPN IDHTQQFVRK
DLKEWLSWMR DDVGFRGWRF DFVKGYSGVF TGEYVEETRP FLSFGEFWDE CSYRDGVLEY
NQDAHRQRTC DWVDSTGGNT AAFDFTTKGI LQEAVARTEY WRLIDTKGRP PGFCGMWPSR
AVTFIENHDT GSTLQHWPFP RDKILQGYCY ILTHPGTPTV FYDHWVDPKW SEAIGVMLDI
RKRTGLSSNA AAVHIERATA GLYAAHIGHA QEMYTEGLSM DVDVKRPSIC MKLGVEDWSP
NAAKVGNLSW KCTASGDGWA IWEDKNHLED EEISKAR