Gene OSTLU_24498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24498 
Symbol 
ID5002065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp43946 
End bp45462 
Gene Length1517 bp 
Protein Length490 aa 
Translation table 
GC content58% 
IMG OID640417486 
Productpredicted protein 
Protein accessionXP_001417652 
Protein GI145346350 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0174559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTCG CACCGTCGCT CGGTAACACC CAACGCGATT CGCGCTTCGG CGCATCGTCC 
ATCGACGCTA CCGAGCGAAC GGTGCGAGAG TATAAGGACG AATTGTCAAT CCTCTCGTGG
TCGCACGATC GCGGATTCGA GACGCACAGA GGATCGTGGA GGCAGGAGGC GCGAAGCGCC
GACGCGAACG AGACGCCGGA CGCGCCGGCG AAGGACGAGC GCGACGTGTC GAATCGTCAA
CGAGGGGCGG CAACGCTGAT GGCGCAAACG ATCGCAAAGT ACATGCCTGG TCGATTATCG
GAGACGTCGG CGCCGTTCGA GTTGTTGTAC TCGACGTGGG ACATGCCGTC GACGCCGTGC
TTGGACGCCG AGTACGCGCA GAAGTTGTGC GAGTTTGATC GATGGGTACC TATATTTAAC
TTTGGGAGCT CGTTCAAGGA TCAAACCGTG CTGCCGACAA TGGTGCCAGC CACGCTCGGC
GCGCTTCGTA GTTGTTTCAT CGAGGGTTTG AATCCGCATT TAGGCTTTAG CGCGGACGAC
GAGCCACAGT GCGACTTTTT ACGCTTCCCG ACGACAAAGT ATTCCGAGCA AGGAAAGTGC
GACGCCAAGC TCGCTCAGTG TCGGTATCAC GGATTGTTTT CGCTCAATGC CGTGGATGAT
AAGTCGATGT ACGAGTGGGA TAATCTCAAA CCGCAGGTGG AATGGCGCGG GAGCGACTAT
TCGTTTCTCG CGCCCCGGGT ACCGGGGCAC AAGCCTGACG CGAATGAGTT TTTGAGCGAA
ATCGCGTCTT CAGCCAATGT TTCGCAGGCC TTACACGATA TGGCGTTTTC CACCGATATT
GGACCACGTC TCAGGGCGGT ACTGTTTTCA AAGTTGTACC CCGAGCTCAT CGACGCCAAG
TTTTTCAACT GGAAAAATCA GTCGAGCGCG CGCGACAAGA TGGCCGCCGA ACTCGGTATT
GACGCCACGG AACGCTTAAC TGAAGAGGCG CTGGGCAAAT ACAAGTATCA TCTCGACCTC
GGCGGTGGTG GAGGGACGAC GTGGAGCGGA CTGATTCCCA AACTCACCAT GCCGGGCGTG
CTGTTGCACC ACGAGACGTC GATGAAGGAT TCGTACTTTG ACACTCTGAA GCCGTGGGTG
CACTACGTGC CCGTCGCAGA AGACCTACAC GACGTTTTTG AGAAGATTTC TTGGTGCGAA
ACGCACCCCG AGGAAGCACG AAATATCAGC GCCAACGCCA ACGACTGGGT TCGCGACTTC
CGAGGCTTGA AATCGCTGCT TCGTCACAAC TATCAAGCCC TGGCGATTCC CTTGGCGAAA
ACGCTCGATC CCTCTGGCGA GACGCTGCTA GACTTTGAAG CCGCGCACGT GGCCGCGCGC
GCCGAGCGCC TGGCCGCGCG CGCCGCAAAA CTCGCCATCA AAGCGTCCAG AGACGCCGCC
AAAGCCGCGA AGGCCACCAA CGTTTCCGAT TAACCCCCGT TCCGCGCTGT ATCAACAGTA
TCAACACCAT CCGTCCA
 
Protein sequence
MMFAPSLGNT QRDSRFGASS IDATERTVRE YKDELSILSW SHDRGFETHR GSWRQEARSA 
DANETPDAPA KDERDVSNRQ RGAATLMAQT IAKYMPGRLS ETSAPFELLY STWDMPSTPC
LDAEYAQKLC EFDRWVPIFN FGSSFKDQTV LPTMVPATLG ALRSCFIEGL NPHLGFSADD
EPQCDFLRFP TTKYSEQGKC DAKLAQCRYH GLFSLNAVDD KSMYEWDNLK PQVEWRGSDY
SFLAPRVPGH KPDANEFLSE IASSANVSQA LHDMAFSTDI GPRLRAVLFS KLYPELIDAK
FFNWKNQSSA RDKMAAELGI DATERLTEEA LGKYKYHLDL GGGGGTTWSG LIPKLTMPGV
LLHHETSMKD SYFDTLKPWV HYVPVAEDLH DVFEKISWCE THPEEARNIS ANANDWVRDF
RGLKSLLRHN YQALAIPLAK TLDPSGETLL DFEAAHVAAR AERLAARAAK LAIKASRDAA
KAAKATNVSD