Gene OSTLU_45212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_45212 
Symbol 
ID5000703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp616398 
End bp617831 
Gene Length1434 bp 
Protein Length427 aa 
Translation table 
GC content57% 
IMG OID640416124 
Productpredicted protein 
Protein accessionXP_001416712 
Protein GI145344381 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.249426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGACGACGCG CGCGCGCGAC GAGACGCTCG AAAGCGCGCG CGAACGCGTC GAAATATATC 
GCCGCCATTC GCTCGACGCC ACCGAATCGC GCGCGAGAGA CGGCGACGAC GGCGATAAGA
TAAAGCGTCA ACGATGGCGA CCGTCCCGAG TAAACGAAAG TTGGTCGGGT GCGCGAACTT
CGTGCGATCG AACCCGCTGA GCGACGCGTT CGAGTGTGAA AAGTTTGACC ACATCGAGTT
TTGGTGCGGG GATGCGACGA ACGCGGCGGC GAGGTTCGGG GTTGGCTTAG GCATGGGGCT
GCGATGCAAG AGCGACGCGA CCACGGGGAA CGGGACGTAC GCGTCGTACG CGATGAAGTC
GAACGATCTG ACGTTCGTGT TCACCGCACC GTACGGAGTC GAGAGCGGAG GTAGTCGAGG
GGAAGCGCCG CATCCGGGAC ACGAGGGACG GGCGATGATG CGATTTTTTG AGAAGCACGG
GCTGGCGGCG CGCGCGGTGG GCGTGCGAGT CAAAGACGCG CGCGCGGCGT ATGAGGAGGC
AGTGAAACGT GGTGCGCGTG GCGTGCTGGC GCCGACGGTT TTGACACACA CAGTAGACGA
CGGATGTGCG AAGGGTGGAC AAGTCATCGC GGAGATTGAG CTATATGGCG ATGTCGTCTT
GCGCTTCGTC AACGCGACGG ATGGATTTGA CGGAGACTTT CTGTGCAATT ATTCGGCGAC
GCGCGATGCG CCAGATGTGT CGTATGGGTT GCAGCGCCTC GATCACGCCG TCGGTAACGT
GCACGATTTG ATCGAAACCG TGGATTATAT CACCAAAGTC ACGGGCTTTC ACGAGTTTGC
TGAGTTCACG GCGGAGGACA TCGGAACGAT CGATAGCGGG TTGAATAGCA TGGTGTTGGC
AAACAATAAC GAGTACGTGT TATTGCCTGT GAACGAGCCG ACGTTCGGGA CGAAGCGGAA
GAGTCAAATC CAAACATATC TTGAGCAAAA CAATGGCCCT GGGTTGCAGC ACTTGGCGTT
GAAAACGGAT GACATCTTTG CGACGGTGCG AGAAATGCGC AAGTACTCGC ACTTGCGAGG
CGGATTCGAC TTTCAAGCGC CGGCAAGCGA TGACTATTAC AAGCAACTCA AGGCGAAAAT
CGGCGATGCT TTGAACGATG AGCAGTACGC GCTTGTCGAA GAGTTGGGTT TGCTCGTCGA
TAAGGACGAC CAGGGCGTAT TGATTCAGGT CTTCACGAAG CCCGTGGGCG ACCGGCCGAC
GTTATTTTTA GAAATCATCC AGCGCATAGG CTGCATGCGT AGAAAAGCGG ACTCGGAATC
ATTTGAGCAA GCAGCCGGAT GCGGTGGGTT CGGCAAGGGT AATTTCTCCG AACTGTTCAA
ATCTATTGAA GCGTACGAAG CGACGCTTCA AATTTAGCCA GGCATTGTTA TCAA
 
Protein sequence
MATVPSKRKL VGCANFVRSN PLSDAFECEK FDHIEFWCGD ATNAAARFGV GLGMGLRCKS 
DATTGNGTYA SYAMKSNDLT FVFTAPYGVE SGGSRGEAPH PGHEGRAMMR FFEKHGLAAR
AVGVRVKDAR AAYEEAVKRG ARGVLAPTVL THTVDDGCAK GGQVIAEIEL YGDVVLRFVN
ATDGFDGDFL CNYSATRDAP DVSYGLQRLD HAVGNVHDLI ETVDYITKVT GFHEFAEFTA
EDIGTIDSGL NSMVLANNNE YVLLPVNEPT FGTKRKSQIQ TYLEQNNGPG LQHLALKTDD
IFATVREMRK YSHLRGGFDF QAPASDDYYK QLKAKIGDAL NDEQYALVEE LGLLVDKDDQ
GVLIQVFTKP VGDRPTLFLE IIQRIGCMRR KADSESFEQA AGCGGFGKGN FSELFKSIEA
YEATLQI