Gene OSTLU_93753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_93753 
Symbol 
ID5005788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp18662 
End bp20149 
Gene Length1488 bp 
Protein Length495 aa 
Translation table 
GC content62% 
IMG OID640421209 
Productpredicted protein 
Protein accessionXP_001421682 
Protein GI145354839 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGACG CGATGGGCGA CGACGCGGTC GTCGTGCTCG TGGCGACGAG CAGGGACGAT 
CGCGAATCGT ATCTAGACCG CGCGGCGCAC GACGCGCTCG CGACGACGAC GACGAAGGCG
CGCGATGACG ACGACGACGA CGACGAAGGC GCGATGCGAA GGATGAACGC GACGGACGAC
GCGAAGAGGA TCTTTATGGC GACGCGAAAG CGCGAACGAC AGCGAGAGTG CGCGCAAGTG
GGTGGTTTAT TGACGCGCGC GTGCCTGAAC GCGCTCGAAC GGCGAAACGG TGCGACGACG
GCGTCGTGGG GACAATTCCT GAGGAGCAAA CAGCGAGCGA TGCAACGACT GCCGCCGGGA
ACGTCGAGCG GGATACGAGT GCGCGCGACG CGGTGCATCG CGATCGAACA CGAAAGGATG
ACGTTCTCGA AAACGATGCG AAACGTGGGT AGGCGAAGAG GGGTGTTTGT GGGGATAAAT
TACGAGGAGT GCGGCGTCGA GCGGTGGCGG TTGCGAAGAC GCGGGGGCGA CGCGTTGCGA
ATGCGCGAAT ATTTAAAGAC GCACTGTGGG TACGACGAGG ACGACGAAGT GCTGGTGCTG
TTGGAAGATA GCGAAGCGAG TACGAACGAT GGATCCATCA ATCGCTCGTG CTCGAAGAAG
GCGATTTTGA AAGCGTGTCG ATGGCTCGTC GACGGCGCGC GCGCGGGGGA TTCGCTGTTC
TTTTACTTTA GCGGACGCGG GCAAGAAGTG AGCGAGACGG CGACGACGAC GACGACCAAG
GATAGCGTCG CCGGAGGTGG CGCGTACAAG GGTTTGAATA AAACCGCGCT GTGCGCGTCG
GACACGCCGG GCGACCCCAC GGCCAGGATC ACGCGCCAAG AGTTTCGAGA GGCGTTGCGC
GTCGACGCGG TCCCGTCAAA CGTCCACCTC ACCGTGTTTT TGGACATTTA CGGCGGCGGT
GGCGAGAACG CGCTGCATGA CATGCCGTAC ACGTGTGTGA ACGTAACGCT TCCGGACGAG
CGCGAAATCA AAGACGCAAA GAATGGTAAA CGGGTGTCCC CGGTGACGCC GCTCTGGATG
GTGCCGAACG GTGAAAAAGC CGTGAAGGAG TTCCTGGCGC TCGCCGAAAC CGCGGCGGAA
GCTTACGCGG AGTGCGCTGA GACGAACAAG GCGTATGACG CCATAAAGCG CGATGCGCCG
CAAGAAAAGC TACCGCAAGA AAAGCCTGCG GTCGAGGCGA ACGCCGAACC AAAGCCAAAG
CCAAAGAAAC CGTCGCCGCC GATGGGAGAG CCGAATCCGG AACGCGCCGT TGCGGAAGCG
GAAGAGGCCC CTGTGCCGGA GACGCACGCG AACGAGAAGT CGAGCGTCGT CCCGGCCCCG
GCGGCGGTGA TCGAGCGAAA CGCGCGAGAG GAACGCGCGG AAATGCAACC GCGCTCGCCG
CAGTCAATCG ACGAGTCAAA ACAGCCGGGC TGTTGCGTGA TAAGTTGA
 
Protein sequence
MGDAMGDDAV VVLVATSRDD RESYLDRAAH DALATTTTKA RDDDDDDDEG AMRRMNATDD 
AKRIFMATRK RERQRECAQV GGLLTRACLN ALERRNGATT ASWGQFLRSK QRAMQRLPPG
TSSGIRVRAT RCIAIEHERM TFSKTMRNVG RRRGVFVGIN YEECGVERWR LRRRGGDALR
MREYLKTHCG YDEDDEVLVL LEDSEASTND GSINRSCSKK AILKACRWLV DGARAGDSLF
FYFSGRGQEV SETATTTTTK DSVAGGGAYK GLNKTALCAS DTPGDPTARI TRQEFREALR
VDAVPSNVHL TVFLDIYGGG GENALHDMPY TCVNVTLPDE REIKDAKNGK RVSPVTPLWM
VPNGEKAVKE FLALAETAAE AYAECAETNK AYDAIKRDAP QEKLPQEKPA VEANAEPKPK
PKKPSPPMGE PNPERAVAEA EEAPVPETHA NEKSSVVPAP AAVIERNARE ERAEMQPRSP
QSIDESKQPG CCVIS