Gene OSTLU_15788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_15788 
Symbol 
ID5002489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp399461 
End bp400894 
Gene Length1434 bp 
Protein Length477 aa 
Translation table 
GC content63% 
IMG OID640417910 
Productpredicted protein 
Protein accessionXP_001418470 
Protein GI145348049 
COG category[S] Function unknown 
COG ID[COG2433] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00010721 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.705917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGAC CGGCGTCGGA ATCGCTGACG ACGATCGCGC CGCTGCAGCT GCGGTTCGAG 
GACATATCCG TGCACTCGCC GGGGGTGTTC GCGAAGTCGG ACGCGATGAA GACGCTGCGG
GAGGCGCCGA GCGGGGTTGG ATTTCGCGCG GCGCTCAAGG AGTTGAATCA GACGGAGGAT
TCGTTCGCGA AGATTTTGTC GCAGGCGCGG GAGAAGGCGC TCGGGAACGG GGACGGGGAT
TTGGAGCGAA TCAAGGCGCA CATTAAGCGG TTGAAGTTCG GGACGATCGA GTTGTCGACG
GAACAGGCGT TTTTGAAGGC GGTTTTGGAA TCGCGACCGT TGCCCAAGGA GACGCCGACG
GAGAGCGAAG GGTTCAAAGC TTTGCAACAG CGGTTGAAGA ATTTGACGAA CGAGAACGAA
CGACGGGCGG AGGAGATGGA ACGCGAGATT GCGTCGCTCG GGGACGAGTA CGAGACGTTT
CGCGTCGAGC ATAGGGCGTT GACGGAGGTT GTGGAAGAAT TAGAGCAGTT GGAGGCGGAA
GCCGCGGCGG CGGCGTGGGA AGCGCAGGCG GGAGACGCCT CGGTGAGCGC CGCGGACCGG
GCGAAGTCCA AGGAGGAGTT ACAGCGCGAG CTCGAAGCGC TCGATGCGGA ACTGCGCGAG
GTGAATGTGA AGTTGAGCGA GAATCAAAGC GGCGCGAGCG AACTCAAGGC GGAGTTGGCG
CCGGACGAGA GCGCGATTGA TCAAATCAAC GCCCAGGCCA AGTACTTGAT CGGTATCTCC
AAGGCTGCGG CGCGAGAAGC GGAAATGTCG GCGCAAATCG CCGAAGCGCA AGAGGCGGTG
CTCGAGCAAA CGAAATTGCT AGTCACTTTA CAAGGGGTGG AAATCATGGC CATCGAAGAA
AACGCCCTCG TGCTCAAGAT TCACACGCAC TTGCCCGCGA CGCCCGAGTT CGCCATGGAA
AGCGCCAAGC CGCGTGGCCC GAGCTCGACG ACGCACACGG TCACGCTGCA CTTGATGAAG
GATAGCGCTC GTCTCGCCGG GGCGACGCTC GAACCGTCGG ACACGCCCAT CCTCGACATC
ATCGAAGAGT CGGCGGGCAT GCCCGTCCTC GCTCGCGCGT TGGCGGAAAT CCGCCTTCGC
ATCGCCGCCA CGGCGCAACG TGCCGAGGCC CTCGCGGCGG CGGCGGCGCG AACGGCGCTC
AGATGGAGTT CCGGCGAGTC ATTGGTGCGC GCGGCGCTGC CAAACGGCGC CGTCGCCGTC
TTGGACGTGC CTTTCGAATG GCCGACGCGC GGCGCGAACA TTTCGTTGAT CAATGTCGAG
ATGGTGCCGC CGCAAGTGGC ACAACTCGCG GTGACCCGCC TGATGACGAA GAATCTGTTG
TGCATCGGCG ACGCGCTCGA AGCCACGAGC GAGGCGCTGA AGAACCAGGC GTAA
 
Protein sequence
MERPASESLT TIAPLQLRFE DISVHSPGVF AKSDAMKTLR EAPSGVGFRA ALKELNQTED 
SFAKILSQAR EKALGNGDGD LERIKAHIKR LKFGTIELST EQAFLKAVLE SRPLPKETPT
ESEGFKALQQ RLKNLTNENE RRAEEMEREI ASLGDEYETF RVEHRALTEV VEELEQLEAE
AAAAAWEAQA GDASVSAADR AKSKEELQRE LEALDAELRE VNVKLSENQS GASELKAELA
PDESAIDQIN AQAKYLIGIS KAAAREAEMS AQIAEAQEAV LEQTKLLVTL QGVEIMAIEE
NALVLKIHTH LPATPEFAME SAKPRGPSST THTVTLHLMK DSARLAGATL EPSDTPILDI
IEESAGMPVL ARALAEIRLR IAATAQRAEA LAAAAARTAL RWSSGESLVR AALPNGAVAV
LDVPFEWPTR GANISLINVE MVPPQVAQLA VTRLMTKNLL CIGDALEATS EALKNQA