Gene OSTLU_119509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119509 
SymbolOgd 
ID5000410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp461185 
End bp462744 
Gene Length1560 bp 
Protein Length519 aa 
Translation table 
GC content46% 
IMG OID640415831 
Producthypothetical protein 
Protein accessionXP_001416153 
Protein GI145342144 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3751] Predicted proline hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.557592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGATG ACATCTGTTC TCGAAGTTTC CTTGAATTCT CGTCACGAAT TACTTCGCCC 
ATTGTGAACG CTGTCCCGTA TCCTCACGTC AACATCCATA ACGTTTTTAA CGACCGCTTC
CTTCGCGAGT GTTTGGCAGA ACTCAAAGAC CAGCTGACAG CAAATTTCAA AGAGACTGAC
CTGTTCAAAG TATATCAAAC AACAGATCTA GCTAACTTAC AGGACTGTGT GCCACGAGCA
CGCGTGACCG TACCGCATTT ATTCAGGCTT CGACAGTATT TGTATTCAGA GGCCTTCCGT
GATTTCATCG TTCAGGCGAC AGGATGTGGT TCACTTGACG GCGCAGTGGA CTGCTCTTGT
AATATTTACA CAGCTGGATG TCATCTGCTG TGCCATGATG ATGTGATCGG TACTCGGAGA
ATTTCATATA TAATCTACCT TTCTGAACCA GACGAGGTTT GGACGGGTAC AGACGGAGGA
CAATTGGAGC TGTATCCCAT CGGCCCAGAT GGAAAAAATC CAACCGACAG TCCCGTGGTG
TCGATGATGC CGGAATGGAA TTCCATGGTT CTATTCGAGG TCTTGCCTGG ACACAGTTTT
CATGCTGTTC GGGAGGTGAG TTCGATTACA AAAACCAGGG TCAGCATATC AGGGTGGTTC
CACGCAAAGC AGATAAAACA AGCTGAAAAA CGCTATGGTG CTCCATCAAC ACTTCAACAG
CTTCAGGCAG CTGGCGATAT CTACTATCCA CCGTACACTT CAGTTTCAAT TGCAGCTCGG
TCTAGCCGAC AAATAGCAGA ACACGAACTG TCAAGTTTTT TGCACAAATG GGTCAAACCA
GAATATCTGA TACACGAAAA CATTATACGA ATTCGAGAGC ACTTTCAAAA CGAAGGTTCA
ATTCAGCTTC ACGACTTTCT CTTGCCAAGT ATTGCAGATT CGTTGCGTGA AAAGTTAAAA
CGCGAAGACA GCCGCAACCG TCGAAAATGT GGCCAGTACG ACTACGCATG TGGGCATGGT
TGGAGTGTGA AAGGTCCGCC TCATATTCGT AGGTATTTAA GTTATCAGCC TGATAGTGCG
CGTACGAATA GAAATGATGT AGGTGATCGC TTGGAGGAAG TTCTGATTAA TGTCACCAGC
ACAGTTGGAT TCCAGAACTG GTTGAGCGCG GTCACCGGTT TTCACTGCAC ACACGCATTT
TCTGAAATAC GCCGGTTCCG AGCGGGACTG GACTACACGC TCGCGCACTC TGACATTTTC
AAGGATTCTC AGATAGATGT CGATCTATGC TTTACGAGTG GTCCTTCTCA GTGGCTCTCT
GGTGACTTAG GGGGCTATCA ATGCTTTACT TCCACTGAAG CAACGGACGG TGCTGCAGAC
GTATACTCTG GGGACATTGC AGAGAACGAG TCTCTGCGAT CTATCGCACC TACATTCAAC
AGTTTAACAC TCGTAAAAGT GGATAAAGGC ATTACTAATT TTGTCAAATT CATTTCAACA
CATGCGAAGG GAAGTCGTTG GGACATAACT TCTCGTTTTG TAGTGGCCAA GTCAACCTAG
 
Protein sequence
MHDDICSRSF LEFSSRITSP IVNAVPYPHV NIHNVFNDRF LRECLAELKD QLTANFKETD 
LFKVYQTTDL ANLQDCVPRA RVTVPHLFRL RQYLYSEAFR DFIVQATGCG SLDGAVDCSC
NIYTAGCHLL CHDDVIGTRR ISYIIYLSEP DEVWTGTDGG QLELYPIGPD GKNPTDSPVV
SMMPEWNSMV LFEVLPGHSF HAVREVSSIT KTRVSISGWF HAKQIKQAEK RYGAPSTLQQ
LQAAGDIYYP PYTSVSIAAR SSRQIAEHEL SSFLHKWVKP EYLIHENIIR IREHFQNEGS
IQLHDFLLPS IADSLREKLK REDSRNRRKC GQYDYACGHG WSVKGPPHIR RYLSYQPDSA
RTNRNDVGDR LEEVLINVTS TVGFQNWLSA VTGFHCTHAF SEIRRFRAGL DYTLAHSDIF
KDSQIDVDLC FTSGPSQWLS GDLGGYQCFT STEATDGAAD VYSGDIAENE SLRSIAPTFN
SLTLVKVDKG ITNFVKFIST HAKGSRWDIT SRFVVAKST