Gene OSTLU_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_1531 
Symbol 
ID5004503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp583157 
End bp584497 
Gene Length1341 bp 
Protein Length447 aa 
Translation table 
GC content58% 
IMG OID640419924 
Productpredicted protein 
Protein accessionXP_001420388 
Protein GI145352083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.486556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TACGTGCACG AGCTGCGCTA CGTCGATTTG AGCGATAATT TGTTCACGGG AGATTTGCCG 
CGAGACTTGT TTAAGATGAC GCAGCTTCAG AGCCTCGTGT TGTCTGGAAA TCGCATCACG
GGCGCGCTTC CCGAGGACGT CGGCGCGCTG ACGAATCTGC GGCACATCGA CTTGTCCGCG
AACGCCATGC GCGGCGCGCT TCCCGAGTCG CTCGGCGCGT TGAGTGAGCT CAAAGTGCTG
TATCTGGGCG AGTCTGGGCT CGAGAACAAA AACGACTTCG CGGGCCCGAT TCCCGAGTCG
TGGCGACGTT TGAAATCGCT GAAATCGTTT TCGTTAGCCG GCAACTCGAA CATCGGTGGA
ACGTTGCCCG ATTGGTTGCT CAACAATCTG GACTCGCTCG AAGAGTTGAC GCTGTCGAGA
TGCGGTTTGA CTGGGGAAAT ACCGCCGAAC GTCGATCAAA TGAAGTCGCT TCGCGTGTTG
GATCTCGGCG AAAACTCATT CAGCGGCGTC GTGCCCGTGG AATCGCTGTC GAGATTGCGA
CGTTTGAAGC ACCTGCGTTT GGCCGGAAAC GCGCTCATCG GGTCGCTCGG CCCATCCGTC
GCCCATTTGC GAGAGATTGA AACCTTTGAC GTGAGCTCGA ACCGTTTGAC GGGGGATTTG
CCAAAGGAAC TCTTCTCGCT GCGATTGCTA GAAATTTTGG ACGTTTCAAA CAACGCGTTT
ACCGGGACGT TGGCTCCTCC CGACGGCGCG GAGACGTCGA ATTTGCGCGT CGTCGACGCC
GAAAGCAACC GTCTCGTCGG CGTGCTCTTA GACGGCGAGT TCTTCAAGCG CGCGCCGCAT
TTGAGGTATT TGAGACTGTC GAATAACAGA ATTTCCGGCG CGTTCACCGA CGGCGCGTTC
GACGACGCGG GCGAACTGGT GGAGCTACAC GCGTCGAATA ACGATTTGCT CGGCCCGTTG
CCGGATTCTG TCCGTCATTT GACAAAGTTG AAATCGTTGC GACTGAGCGG CAACGCGCGT
CTGGGCGCCG GTCGTGGAAT GCCCGACGCG CTGTCGGAGT GTTGGAATCT CAGAGTCGTC
GAGCTCGCGC GCGCGGGCTT CGAGGGCGAC ATCGCGGACG ATGCGTTCGC GCGCATGCGT
CGATTATCTT CGCTGAATTT GGCCGAAAAC AAGTTTTCGG GCAACGTGCC CGCGTCGTTG
AAATCGGCTG AATTTCTGCG GAAATTGGAG ATTCAAAACA ACGCATTCGT CGGGGAAATT
CCGTCGTGGC TCGTCGAGCT TCCGCACCTG GAACTCGCCG ATTTCACGGG CAACAAGTTC
ACGGGCGCGA TCCCGGATTC G
 
Protein sequence
YVHELRYVDL SDNLFTGDLP RDLFKMTQLQ SLVLSGNRIT GALPEDVGAL TNLRHIDLSA 
NAMRGALPES LGALSELKVL YLGESGLENK NDFAGPIPES WRRLKSLKSF SLAGNSNIGG
TLPDWLLNNL DSLEELTLSR CGLTGEIPPN VDQMKSLRVL DLGENSFSGV VPVESLSRLR
RLKHLRLAGN ALIGSLGPSV AHLREIETFD VSSNRLTGDL PKELFSLRLL EILDVSNNAF
TGTLAPPDGA ETSNLRVVDA ESNRLVGVLL DGEFFKRAPH LRYLRLSNNR ISGAFTDGAF
DDAGELVELH ASNNDLLGPL PDSVRHLTKL KSLRLSGNAR LGAGRGMPDA LSECWNLRVV
ELARAGFEGD IADDAFARMR RLSSLNLAEN KFSGNVPASL KSAEFLRKLE IQNNAFVGEI
PSWLVELPHL ELADFTGNKF TGAIPDS