Gene OSTLU_119486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119486 
SymbolSmkH 
ID5000128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp423777 
End bp425633 
Gene Length1857 bp 
Protein Length618 aa 
Translation table 
GC content43% 
IMG OID640415549 
ProductProtein Lysyl hydroxylase fusion protein, putative 
Protein accessionXP_001416384 
Protein GI145343554 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0222202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAG GTGACTTTCA CAAAGTTGAC TCGTCTACGA ATTTTCTCCT TCTACAAAAA 
AACAGTTCGC GGCAAACTTG CGGAAACTGT TTGCGTCTTT TCGGTAACTG CGACGATGAT
AAGGCAAGTT TCATTAGGTG TGACTGTGGA GATATCTTCT GTAGACGAGA TTGTGCACCG
CATGCGATGC GGGAATTAGG CTGGCATTCA ATTTTATGCT CAAATAGAAG TGAACCAATG
TTGGAATTCT ACGAGTATTG CCGAAGGTCG AGCGAAAGTT TCACTTTTGC TGCTAAGGTG
CTTGCTCAAC TTTTATCAGC AATGTCAGAG TGTGAAACGA GCGTATCAAA AATACAATTG
AGTTGGTCGG AGCAATTCGA TCCGCGTCAT AAGTGGTCCG AACTTGCTGC ACGCGCCAAC
TGGGAGTGTA CGGACTTCTA TGAGTTTGTG CGACGAGCGT GTATTTTAGA ACATCAAATA
GCAGATGCAT GGGACTTATT GAAAGCACAT TTGAATAATA GATTTGATCT AGAGACAAAA
ACACGATTTG CCGAGCTTTT AGACTTTGAG ATGTACGCAG ATATAGTGGC TTCCATTCAG
CGGTTTGCTG TAGCTATTGG TACAGAGAAG ACAACAATTC CACCGACACG CCAATCTCCT
TCCCAAAATG GTGATACTCA TCCTGCATTA GCTCATTACG GCTCAAAGTT TGATATTCAT
CCGAAAGTTT CATACCATTG TGCAATTATG AATAAATTGC CAGATTACTG CGGTTTTTTG
TTGGCACCAA TTCTGACCAA ATTCCAACAC AGCTGCAATC CAAGTGTGCA AGTTGAAGCT
AATTCAACCC TTTCACAGGT GAAATATGGA TATATAGCGC TCAGAGAAAT TGATCAGATG
GAGCCGATCT CTTTGACAAC TATACCTCTT CATTCTAGTG TAAGTGCCAG AGATGCCTCA
CTGTTGAGGC AGCTTGGTAA AGTTTGCGTT TGCTCGCGTT GTTCGTGGGA ACGAGAAGAT
TTTGGTAGAG TTTCGCCCGA ACAAATGAAG CAATTGGCTC TTCAAGCGCA GGAAGAGGGG
CGATACTCAG AGGCAGAGCT CTTATTTCAC TGCACACTTC TTCACAATCC ACACGACACA
GATGTGCTTC ATGCGTATGG TGTGACGCTC CTTAGTCAAG GCAAATGGAA ACTTGCCCAC
AGTATCTGGC AACTCGCTTT TAAGGTTGAC AAAGCTCATC TATGGCTATC GAAACAATTC
AACAAAGATG TTTCATACGC AGCGTATTTA TGTAGAAAAT GTCCCGAATA CTATGTGTCT
TTTCAGACTA TTTCAATTGA TGAAATTTAT TTGACAACAA ATAGTGCAAT ATCACCATCA
GCTTGTTCTA GCTGGATTAA AACCGCCGAA GCTCATGCAA CAAATAGAGG AGGCTGGGAC
ACAGATCGTC ACAAATCTGT AGCCACAACT GATTTGCCTA TACACGAAAT TCCCTCTGTC
TTGCGAGAGT GGAATTTGAT CTTCGGGCAA ATCATAGGTC CCTTCATTCA AGAACGTTTC
AGAGTCGATG GTGACACGAA TCTTAGGGTT CACGATGCAT TCATAGTAAA ATACGATGCC
AGTGATGGAC AGTGTCAGCT ACCAGTACAC ACCGATCAAG GTCACTTCTC CATAACTCTT
TCTCTAAATG ATCCTATACA ATACAAAGGT GGCGGTACGA TTTTCCCGGA GCATGAGTTC
ATTGTTCGAC CAAAGTGTGG AGATTTCGTC GCTTTCAGAA GTTATCTGAC GCACGGTGGC
GTGCCAATCA CATCTGGAGT CAGGTACATA GTCGTGGCTT TTCTCTACTT GAGTTGA
 
Protein sequence
MSKGDFHKVD SSTNFLLLQK NSSRQTCGNC LRLFGNCDDD KASFIRCDCG DIFCRRDCAP 
HAMRELGWHS ILCSNRSEPM LEFYEYCRRS SESFTFAAKV LAQLLSAMSE CETSVSKIQL
SWSEQFDPRH KWSELAARAN WECTDFYEFV RRACILEHQI ADAWDLLKAH LNNRFDLETK
TRFAELLDFE MYADIVASIQ RFAVAIGTEK TTIPPTRQSP SQNGDTHPAL AHYGSKFDIH
PKVSYHCAIM NKLPDYCGFL LAPILTKFQH SCNPSVQVEA NSTLSQVKYG YIALREIDQM
EPISLTTIPL HSSVSARDAS LLRQLGKVCV CSRCSWERED FGRVSPEQMK QLALQAQEEG
RYSEAELLFH CTLLHNPHDT DVLHAYGVTL LSQGKWKLAH SIWQLAFKVD KAHLWLSKQF
NKDVSYAAYL CRKCPEYYVS FQTISIDEIY LTTNSAISPS ACSSWIKTAE AHATNRGGWD
TDRHKSVATT DLPIHEIPSV LREWNLIFGQ IIGPFIQERF RVDGDTNLRV HDAFIVKYDA
SDGQCQLPVH TDQGHFSITL SLNDPIQYKG GGTIFPEHEF IVRPKCGDFV AFRSYLTHGG
VPITSGVRYI VVAFLYLS