Gene OSTLU_50249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50249 
Symbol 
ID5003434 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp208071 
End bp209755 
Gene Length1685 bp 
Protein Length495 aa 
Translation table 
GC content54% 
IMG OID640418855 
Productpredicted protein 
Protein accessionXP_001419309 
Protein GI145349786 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.430177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.311045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATG AATTGTCGCA CTATTTTGTG AATTCTTCGC ATAATACGTA CTTGGACGGC 
GGACAGTTGT TCTCGAGGTC GACGAGTTCG GCCATCGCGT GGGCGCTCGA ACGCGGGTGC
AGAGTCGTCG AGCTGGACTG TTACGACGGA GGGTCGAAGG GGCCGATCAT CACGCACGGA
GGGACGGCGG TGCGCCCGAT GTTGTTTAAG GACGCGATCG CGGTTATCAA CGACAGGGCA
CACGTAGCTA GCGAGTATCC GGTGATTGTG ACTTTGGAGA ATCACGCGAG TCGGGAGACG
CGCGCGGTGA TGGCGAAAAT CATGCGACAC ACGTTCGGCG ATAAACTCTG GACGCCGCCG
TCGAAAGGCG AAGGCGAAGG CGAAGAAGAA TCATACGTGT TGGACCGCTG GCCATCGCCG
GCCGAGTTGA AGGGTAAAGT GATAATTCGG GACAAGGTGA AGCACAAGCA AGACGAAGTC
AAGAACGCAT CGTTCGGCGT CGGAAAAGTG TTCGAGGCGA TATCCTCGAG GGGCAAGTCT
AAGTCTACTC GGAAGCTTCT ACAGGAGAGC ACCAAAGCTA CACTTTCAAC TGGGAAAAAC
AAGCTCTCGG TGGCGAACTC CGTGGTCAAC CTCTCCAAAA GTGCGCCTGC AGGCGTTCCT
GAAGACTCCG ATGGGACGAG CGAGGACGAC GGCGGCGAGG ACGACGAGGA CATCAAGGCA
CTCGTCTCAT TGCGAAATTT GAAGTTTCAT GGTTTCAAGG AAGCGAAAGA TCTCGGTACA
AAGTTTTCTT GCAGTTGGAG CGAGAACAAG GCCAAGAAAT TAGTCGAAAA GTCGAGCCAA
AAGGATTTGC TCGAATTCAC CAAGGCGCAT TTGCTTCGCA CGTACCCGGG CGGTCAACGC
ATCATGAGTA ACAATTACGA TCCCTCCGAC GCGTGGTCCA TAGGCGCATC GCTCGTCGCG
CTCAACTTTC AGGCGCAAGA CAGATATATG TGGGTGAACC AAGCCAAGTT TGCGGTCAAC
GGTGGGTGCG GTTACGTGAA AAAGCCCGAC TATTTAATCA ACCCGTCGGT TCAAAGACCG
ACCAAGCCTA GAATTCTGCG CATACACGTC TTCTGCGGAC TAGGTTGGGA AAATTTCAAG
GATGCCGATT TCATGTCGGC ACCGGATACG TTCATGAAGA TTTCCCTCTT CGGTTGCGTC
GCCGATCGCT TGTCCGCGAC TTCCAGAGGG AATTCAATGA GAACGTCGGT GTACTCCAAG
GCACGAGTCG GTCCGTGCGC TCAACCCATT TGGAACGAGC ACTTTGACTT GGAAATTCGC
GAGCCCGAAC TCACGGTGCT ACAAATCCAA GCCATGGACA AAGATGGCGC GCGCGATGAG
TTCCTCGCAC ACTACGACGT CGCCGTTAGC GCTTTGCGCG AAGGCGTCCG CATTGTACCG
TTGCTCGCGC GCGACGACGA ATACGTACAC GACAGCAAGT CCTGCGCTGG CGTTTTGTGC
AAGTTTGAGT GGCTCGATGA GAAAAGCTCG TCCAACGATG CTCTGCCAGC TCGAACGAAC
GAGTCAAAAG AGACGACCAA CATCTCCGAA GATGCGTAAC AGACTAATAT GTGGCTGTGG
ACGTCGCCGA CGTCTCAGCG TCTCACACTT CTTACTTTTC TATGTGTAGA ACATTCGCGT
TGTAA
 
Protein sequence
MTDELSHYFV NSSHNTYLDG GQLFSRSTSS AIAWALERGC RVVELDCYDG GSKGPIITHG 
GTAVRPMLFK DAIAVINDRA HVASEYPVIV TLENHASRET RAVMAKIMRH TFGDKLWTPP
SKGEGEGEEE SYVLDRWPSP AELKGKVIIR DKSTKATLST GKNKLSVANS VVNLSKSAPA
GVPEDSDGTS EDDGGEDDED IKALVSLRNL KFHGFKEAKD LGTKFSCSWS ENKAKKLVEK
SSQKDLLEFT KAHLLRTYPG GQRIMSNNYD PSDAWSIGAS LVALNFQAQD RYMWVNQAKF
AVNGGCGYVK KPDYLINPSV QRPTKPRILR IHVFCGLGWE NFKDADFMSA PDTFMKISLF
GCVADRLSAT SRGNSMRTSV YSKARVGPCA QPIWNEHFDL EIREPELTVL QIQAMDKDGA
RDEFLAHYDV AVSALREGVR IVPLLARDDE YVHDSKSCAG VLCKFEWLDE KSSSNDALPA
RTNESKETTN ISEDA