Gene OSTLU_27017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_27017 
Symbol 
ID5005103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp73356 
End bp74555 
Gene Length1200 bp 
Protein Length399 aa 
Translation table 
GC content62% 
IMG OID640420524 
Productpredicted protein 
Protein accessionXP_001420897 
Protein GI145353173 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value0.233001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0292982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCA TCGTCATCGA CGACAAGTAC GCGTTGAGTG ACTCGGACTC TGTCGACGCC 
GACGCCCCGC GATGCGCGAA CTGCGGCGTC GCGAGCGACC GACTCAAAAA GTGCGCCAAG
TGCCGGCGCG CGCACTTTTG CAACGCCGCG TGCCAGCGCG CGGCGTGGGA CGCGCACGCG
CGCGAGTGCG TCGCGGATGC GAACGCGAAA CCGGCGTACA AACCGCCCGA ACCGCCGCGA
ATGCCGACGA AGGCGGAAAA GGAAGAGGCG AAGGAGAGCG AGACGCGGCG AATTCGCGAG
ACGACGTTGC CGCGAGCGCG CGCGGCGTTG CGAAGAGATG GCGTCTCGAC AACGGTAGAT
TTAGACGAGT TGATCGAGGG ATTAGAAGAC GCGATCGTGT TCGCGATCGG GGAGGAGGAT
CAGGGGTTGA CTCGCGAGGT GCGGTTGGTG CTGGCGAGGG CGTATTTAGA GGCGAAGCGC
GCGGATGAAT GTTTGCACTA TTTGGCGCCG GCGCTGGAGG AGGCGCGAAA GGAGGGCGGG
GCGGCGAGCG CGGACGCGCA CACGCTCGCG GCGAAGGCGC ATTGCGCGAA GGGCGAGAAA
GAACAGTGTC GCAAGGAATT GACGGCGGCG TTGGATTGCG CGAGCGAATC GACGAGCGAC
GAAGCGCAGT GCGATACGTT GCTCGACGCG GGGATTATTT TACACGATCT CGGTGACTGG
GAGCGATGTG CGCCGTTGCT CAGCACCGCG GGCGAGGCGG CAGAAAAACT CGGTCGCTTG
CGCGAGGCGG CGCGCGCGTA TAATCGCGCG GGTTCGGCAC TTTTGCGTTC GGGTCGGCCC
GACTACGCCG GGCGATGCTG GACTCGAGAG CTGCGAGTGC TAGAGGCGGA CGATTCCACC
GATCCAGGGA CGTTGGCGCA GGCTTTCGCG AACTGTGCGA GCGCTTTTTT ACTCACTCGC
GGCGAAGACG ATGATGCGTT CAACTTACAC AAGAAGTCCG CGCTCACGAA GGCCCGCGAG
TCTGGAAATG ACGCTGAGGC TCGCGTTTAC TTGCAATTAG GCAACGCTTA CAAACTCGCC
GGAGACGCGA TAGATGATAG TTTAGCGCGC GCGAAAGATT GTTTCGAGAA AGCAAAGTCG
TTATCGGCGA CCGATGCTGG CGAGATCGCT TCGCGGGCTT TAGAAATGCT TAGTCTGTAA
 
Protein sequence
MSTIVIDDKY ALSDSDSVDA DAPRCANCGV ASDRLKKCAK CRRAHFCNAA CQRAAWDAHA 
RECVADANAK PAYKPPEPPR MPTKAEKEEA KESETRRIRE TTLPRARAAL RRDGVSTTVD
LDELIEGLED AIVFAIGEED QGLTREVRLV LARAYLEAKR ADECLHYLAP ALEEARKEGG
AASADAHTLA AKAHCAKGEK EQCRKELTAA LDCASESTSD EAQCDTLLDA GIILHDLGDW
ERCAPLLSTA GEAAEKLGRL REAARAYNRA GSALLRSGRP DYAGRCWTRE LRVLEADDST
DPGTLAQAFA NCASAFLLTR GEDDDAFNLH KKSALTKARE SGNDAEARVY LQLGNAYKLA
GDAIDDSLAR AKDCFEKAKS LSATDAGEIA SRALEMLSL