Gene OSTLU_36349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36349 
Symbol 
ID5000294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp295784 
End bp299167 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table 
GC content51% 
IMG OID640415715 
Productpredicted protein 
Protein accessionXP_001416361 
Protein GI145343504 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5077] Ubiquitin carboxyl-terminal hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0304624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAGCG GTTCGGAAGA TAGCGAGTTG GCGTTCTCGA CGGCGGAGCC GACGACGTTC 
TCGTGGCGCG CCGAGTTCTC GCGATGGAAG AAACGAGACG CGAAGGTGGT GAGTCAAACG
TTTGAGTGCG GAGATACACT CTTTCGGTTG GCGATGTATC CGTTCGGGAG TAATTTGAAC
TCGAAATCGG AGACGCCGGC GCAGGTGAGC TTGTTCTTGG ACACGGGGGC GACAAAGCCG
CGCCGAATCG AGGACGACAT GAGTAGAGAG TGGAGAAGGC ACGCGAAGTT CGAATTGCAG
CTGCTTCATC CGACGGATGC GTCGGTGGTA GAATCGAAGG AAACGTCGCA CACGTTCGAC
AGACGCGAAG CGGATTGGGG GTTCGCGTCG TTCATCACGC GCGAAGACGT TTTTGAAAAG
GGCTACGTTG ACGCCGAAGG CTGTGTGAAT TTCCGAGTGC ATGTGACGCC GATTGAGGAG
CACGAAGTCG ACCGACCGAT GCAGAGTGCG TTTTATAGCG AATACGACTC GCGCAAAGAG
ACAGGGCTGA TCGGTCTGAA GAATCAAGGG GCGACGTGTT ACATGAACTC GCTCTTGCAG
ACGCTCTATC ACATTCCCTC CTTTCGTCGC GCGGTCTATC ACATGCCTAC GAACGAAACG
GAAGAGGCGC ACACATCGAT GCCATTAGCG TTACAATCGG TGTTCTACTG TCTTCAGTAC
GCCAAAGAAG GCGACGTGAG CACGGAGGAT TTGACGCGAT CGTTTGGATG GGACTCTTAC
GATTCCTTCA TGCAACACGA CGTACAGGAA CTCAACCGTG TACTTCAGGA TAAGCTTGAA
GAGGCCATGA AACAAACGTG CGTCGAGGGC ACGATTCAGA AGCTCTTCGA AGGGCACACG
ACGAACTTCA TCGAGTGCAT CAACGTTGAT TACAAGAGCG AACGCAAGGA GGAGTTTCTC
GATCTTCAGT TAGACGTCAA GGGATGCAAA GATATTTATG CGTCCTTTGA TCGTTACACT
GAGATTGAAA AACTTGATGG CGAGAACAAG TATCGCGCCG AGGGACACGG ATTGCAAGAC
GCTCGCAAAG GCACGCTGTT CCACGACTTT CCTCCCGTGT TGCAGATTCA GCTGAAGCGT
TTTGAGTACG ATTACCAACG AGACACCATG GTGAAGATCC ATGATAGATA TGAGTTCCCC
GAAGAGCTCG ATCTCGACGT GGGTGATCGT AAGTACCTCG TTCCCGAGTC CGACAAGAGT
GTTCGCAACA AGTATAAGCT TCATAGCGTC TTAGTACACA GCGGGGGGAT AAATGGTGGA
CATTATTACG CGTTTGTCAA GCCCAATTTG CAGGCGGAAG ATGCGCAGTG GTTCAAGTTT
GATGACGAAC ACGTGACGAA AGAAACCGCA GAAAAGTCGG TGGTGGAACA GTACGGAAGC
GGCGGCGCAG CCGCGGTCGA TAGCGATATG GATGCAGACG ATGACTCGAC AAACGTCCGC
GTGGCGCCGA ACTTGCGGTT CCAAAAAGTG AGTAGCGCCT ACATGCTCGT GTACATTCGA
GAGGATGACA TGGATCAAAT CATGTGCGTC GCCAATAAGT CTCATCTCAC GGAATACCTC
CAGGCCCGCT TCGCTGAGGA GCAGAAGGCA AAGGAGAAAG AGGCGCAGGA AAAGAAGGAG
GCGCATCTGT ACACCATCAT CAAGGTTTTG ACCAGGCAAG ATTTAGAGCG GCAGATCGCT
TCGGAGAAGT TTTTTGATCT GGGCAATTTT GAGAGCGCGC AAAGGTTCCG ACTGCATAAA
AAGTCGACAT TTACAAAGTT TAGGGAGCTC GTTTCCGAGA AGCTCGGAAT ACCAGCTGAG
AGGCAGCGGT ACTGGACTTA CTCGCCACGA CGCAACAAAA CATCTCGGCC AGCCACCGCG
CTTCCAGACC ACGCAAATAC CCCACCGACC TGGACGGTTG AAAAAACACG GCTGAAATAC
ACGGTGCCTA ACTCACAGCA TGCTTCATCA GGCGAATTCA GACTCTACCT GGAAGAGCTT
GACGATGATG CGTTCGCGAA CAGTGACCCA GAAAGAGACA TTTGGTTGCA CGTCAAGCTT
TACAATCCGC ATGAGGCGCG GTTGAGTTAC TGTGGCACGC TTTACGCGAG TCCCGAGGAG
ACGCTCAGTA CCTACATGCC GAAGATCAAA TCCATGGCAG GGTTTGCCAG TACCGCGTCA
ACGCTCATGT TTGAGGAAAT TGCGTTCGTT CGCGAAAGTA AGATTCAAAT TGATCAATTA
TCGGACAAAC AAGTGAAAAC ATATCCGTTG AGCGACCCCG ATGACGGTAG CAAGACGTTG
CAGCTCGGTA ACGGTGATAT CTTGCTTATT CAACCAGAAA TCACTGAAGA CATGGAAGAT
TCACTAAAGT TCCCCAACGT GGTTCAATAT GCGGACTTCA GACACAATCA TCAAATCGTT
CACTTCAGAG AGCTGGAGGC CCCTAAGGTG GACAAAGTGA CTCTGGAGTT GACGAAAATG
ATGAGTTACG ATCAAGTCGC TGATGTTTTG GCTTCGGCGA TCGGTTTGGA CGATCCTCTT
CGATTGCGCT TCACCGCGCA TCACGTGTAC ACGAACGGTC CAAAGAGTGC AAGCTTCCAA
TTTAGAGGCG CGGATACTTT GATAAAAATG CTGGAAAACC AGCAAAGCGA CGTGCTCTAT
TACGAAGTCT TAGACATGCC GCTGCCCGAA CTCCAGGAAT TGAAGACGCT AAAAGTTTTC
TTCCATGGAC TGAACACAAA GCTCGTTGAA GAATTCCAAC TGCGTCTGTC GAAGAGCGCA
GCGGTGAAGG ATGTCCTCGA AGAGGTCAGA TCTAGGCTCG GTACTCGAGT CGGCGGTCGC
AAATTACGTC TGCTTGAACT TTTCTACTCG CAAATCTACA AAGTTTTCGA GGAGGAAAAG
GATATCGCAG ATATCAACGA CCAATATTGG ACGCTTCGTG CGGAGGAAGT TCCCGATGAC
GAGTCAGAGG AAGACAGACT CTTGCGTGTG TACAACATTT CCAAAGACTT GTCCAATCCT
AACCAGTTCT ATGCCTACGA TGAACCGATG TTACTTCGCA CGTGTGAAGG TGAGACGCTC
GGGGAAGTGA AGGCTCGGAT CAAGACGAGG CTCGAAGCGA CGGATGAGGA CTTCGCAAAA
TGGAAGTTCT ACATCGGCCA CCCGCCGCGG TATGAAATCT TGGACGACGA CGAGTTGGTT
ATATCGAGTA AATTGGTTCG CATCGCCAAA GAGGGCTTTT GCGAATCGAC GCTGGGGATC
GAGCGCGAAG TTAGAGGTCC GCGAAGGCCG GCTAGCCGTC AGGGGAAGCC GGCTGGATTT
GAGCGAGCGA TCAAAATCAT GTGA
 
Protein sequence
MSSGSEDSEL AFSTAEPTTF SWRAEFSRWK KRDAKVVSQT FECGDTLFRL AMYPFGSNLN 
SKSETPAQVS LFLDTGATKP RRIEDDMSRE WRRHAKFELQ LLHPTDASVV ESKETSHTFD
RREADWGFAS FITREDVFEK GYVDAEGCVN FRVHVTPIEE HEVDRPMQSA FYSEYDSRKE
TGLIGLKNQG ATCYMNSLLQ TLYHIPSFRR AVYHMPTNET EEAHTSMPLA LQSVFYCLQY
AKEGDVSTED LTRSFGWDSY DSFMQHDVQE LNRVLQDKLE EAMKQTCVEG TIQKLFEGHT
TNFIECINVD YKSERKEEFL DLQLDVKGCK DIYASFDRYT EIEKLDGENK YRAEGHGLQD
ARKGTLFHDF PPVLQIQLKR FEYDYQRDTM VKIHDRYEFP EELDLDVGDR KYLVPESDKS
VRNKYKLHSV LVHSGGINGG HYYAFVKPNL QAEDAQWFKF DDEHVTKETA EKSVVEQYGS
GGAAAVDSDM DADDDSTNVR VAPNLRFQKV SSAYMLVYIR EDDMDQIMCV ANKSHLTEYL
QARFAEEQKA KEKEAQEKKE AHLYTIIKVL TRQDLERQIA SEKFFDLGNF ESAQRFRLHK
KSTFTKFREL VSEKLGIPAE RQRYWTYSPR RNKTSRPATA LPDHANTPPT WTVEKTRLKY
TVPNSQHASS GEFRLYLEEL DDDAFANSDP ERDIWLHVKL YNPHEARLSY CGTLYASPEE
TLSTYMPKIK SMAGFASTAS TLMFEEIAFV RESKIQIDQL SDKQVKTYPL SDPDDGSKTL
QLGNGDILLI QPEITEDMED SLKFPNVVQY ADFRHNHQIV HFRELEAPKV DKVTLELTKM
MSYDQVADVL ASAIGLDDPL RLRFTAHHVY TNGPKSASFQ FRGADTLIKM LENQQSDVLY
YEVLDMPLPE LQELKTLKVF FHGLNTKLVE EFQLRLSKSA AVKDVLEEVR SRLGTRVGGR
KLRLLELFYS QIYKVFEEEK DIADINDQYW TLRAEEVPDD ESEEDRLLRV YNISKDLSNP
NQFYAYDEPM LLRTCEGETL GEVKARIKTR LEATDEDFAK WKFYIGHPPR YEILDDDELV
ISSKLVRIAK EGFCESTLGI EREVRGPRRP ASRQGKPAGF ERAIKIM