Gene OSTLU_31122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31122 
Symbol 
ID5001174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp339457 
End bp343302 
Gene Length3846 bp 
Protein Length1197 aa 
Translation table 
GC content55% 
IMG OID640416595 
Productpredicted protein 
Protein accessionXP_001417480 
Protein GI145345989 
COG category[R] General function prediction only 
COG ID[COG5141] PHD zinc finger-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones147 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGCGACGCG CGCGACATCC GCGGCGACCG CGCGCGCACC GCACGCGGAA ATCGCGGTCA 
CGGTCGCGGT CGACGCGCCG CGGCGGCCCG CGAGCGCGCG AATCGACGGG AATTCGTCGC
GATCTTTCGG AATCGGCGGA ATCTTCGATT GCGTTCGTCG GAAGCGCGCG ACGTCGACGC
GCGATATGCA CGGCGCGAAC GCGGTCGAGA GCGATGCGGA TATGAGCGCA CAGGACGCGA
TCGCGGCGCT CGTGGCGCGC GTGGCGGCGG GGAGCGGGAA TGAAAGAGGC GGCTCGCGAG
GCAACGGCGA CGGCGAGACG ACGTCGGCGA ACGGCGAGGA AGACGATGGT GGAGACGGAG
AGCCGGCGTG GATGGCGGAT ATAGACGCAC GCATAGCGAC GGTGGAGGAT ATGGAGGTAG
AGACGGTGAA TGGGACGTAT GGGATGAACG ACGAGAGCGA CGCGGAGGGA GCGAGAGTTG
ACGCGCCGGG GCAGTTTGAT GATTTGTTTG ACGAACGCGC GGAGACGATT GTGGATTTGG
ATGAATCTAG TATGAAGACG ACGTCGATAA TGGCGTTTTT GGAGACGTTC GCACTCGGGC
CGGAAAGTTT GAATGGTGGA CGCGTGACGC CAACGATGGG CGAAGTGCTC GAGCGAGTCA
AGGCACGCAA GCCAGAATCT TTACCAGAGT GGACGATGAC GGCGGCGGCG GCGGCGAGCG
CGTATCAGTT TGCCGTCAGT AAAAAGCGCG CCGCTGATCG AGCCGTGGCA TACGCCAAGG
AGATGAGAAA GAAACACTCG AAGGTGCAAG CAAAAGCTGA CGCACTGGCG AGTCAGTTGC
ATGTGTTGCA ATACGGCAAG ACACCGGACG GAAAGCCCGT GAAACCGGGT CAGCTTAAAA
ACCCTCCGCA TCAAACTCGA CCACGTGGGA AGATGCCACC GCCGCCTCCC GAGCATATTT
CTAGGCTGTG GAGGACGTCT CTCGAGGATT ACGATGTGTG TCACGTCTGC GGCCATCAAG
CGCCAGATTG GTGGCATGCG AAGGATGAGA TTGTCTTCTG CGACGGATGC GAAATTCAGG
TTCACATGTC TTGCTACGGA ATCAAGAACC TTCCTGAGGG TGATTGGTAC TGCACTGGTT
GCAAAGAGGG CGTCACCAAA GGCCCGCTCG TGAGCTGTGG AACTCCTAGG GGTATCTGTG
CATTGTGCCC ACATCCGGGT GGCGCACTCG TGCGTGTGGT CCCAGCGTCG AAATTCGAAA
CACCATGGCT TTCGCCGGGT CACCATGCGC ATCTTGCGTG CGCTTTGCAC TTACCCGAAG
TAGTCATGCG TCATCAAAAG ACTGGGGAAT CTCCCATTGT CGACATGTTG CTCGTTCAGT
CGAAACGCAT GAAATTGAAG TGTTCGGTTT GTGAAGAAAT AGGCGCGTGC ACGCAGTGCG
CGATGCACAA GTGTTATACC GCGTTTCATC CACTATGTGC GCGTGCCGAT AAACAAATTA
TGTTGAGGCA AGCGGCATCA GGACAGCCCA TGGTGTTTTG CAAGGCGCAC AGCGCGCCCG
AATTCGAGCA ACAGCGTTTC CTCACGTGTG GATACCGCGA AGATGGCACG AGCGATATGT
TACGTAATAA GAATGTGTTT TGGGATGTGA AGGCGCAATC GCACGAGGAA GACGCTGCAT
TTGTGGATCC GAATAGAAGC ACTGCGAGCG CCCAAAAGTC GCATCCGAGC GCGCAGGAAG
AGAAGCAAAA AGATGTGACG CTGGAAGTTT TCCCGCGGGC GACGAGCGCT GAGGCAAAGC
GAATTGCGAC GATCACATGC GTGCGCCATT ACTTGGATTC GACGCACAAG GGAGGAGCCC
CGAGCAAGCT CTTACGGGAT ATGCAAGACG CCTGTAAACA TCTCGAAGCG GAAGAGGCGA
GTCGCTTGAA TTCGCTCAGT GACGAACTCG GGGAAGCAGA GATTAAAACG GCGTTGGATC
ATTTGACCAG ATTGGCGCCG AAGATTCCGC AAGGCAAAGC TCTTTACGCG GATCTCGCGC
CTTGGAAATA TTTGAAGTCA CATCAAATCG ACGCGATTCA TTGGATGCGA AACCTGCATT
CGCTGGGCGT GAACGGGCTG CTTGCTTTTG AAACGGGCCT GGGCAAGCGG CTCACGTCGC
TCGCTTTTAT TCAATGGGTT CGAGACGGCT TACGCGAACC GGGTCAGCAC TTGATTCTTT
GCCCGAAGGA AACAGCACAC TTATGGATCG CTGATCTTCA TCGTTGGTGT ACTTCCCTGC
GTAGTGTCAC GTTGATGTCT GAAGAGGATG AAAAGAGTGC GAACAATGTT CAAATGATAA
AAGCACTGTC GTATGATTAC CTCGTCGTTT CGTACGAGCG CATGAGTCAA TGCGAAGCGT
TGAAAACTAC GATGTTCAAG AGCGTGATTT GTGACGACTG GCGAAATGAG GCGAGTGTCG
GCGCGCTTTC CTCCGCTGTC CAAGAGAAAG TTCTTCAAGC AGGATCCACG TTCTTGTTGG
GCAAAGCCGA TGACTTTGAC GGCCCTCACG CCGTCGCCTT GGGACGCGTC ATTTTCCCAG
GACTGGGCCA AAGTACGAAG TCTCTACCGG TAAAGTGGAG AGATCTGATC GTACATAGCA
CGAGTGAGCG CGTGGTTGAA CCACCCGTCG TTCCCGCGCC ACTGGCCAAG TTTTTACAGT
TTGAGATTGA TGCGAAGACG AATCCCGTCG ACGCTCTCAT GCATATCCTT CAGACGATGC
GCAAGCAGGC GCAGAAAGTG CTAGTCATCG CGGAGGATGA AAATGCGTTG AATAGCGCGG
CGGAAGGGAT GAGTATGGCC AAGTTAGAAT ACGCTCGCTT GGACGAAACC GATCAGCCTT
TGGGCGTTGC GCTGTACTCT GCGGCGAGAT TCAACACCAT GGCGCCAGAG AACAACATCT
CCTTTCTGTT GTGCAGCTTC CACAATCTTG GACGTCAGCA AGGTTTGCTT TCGGGCGTTA
CTTCTATTTT GCAACTCGAT GGCGAATTCT CGAGTAAGAT CGACGCCTCT GGTGCGCCGA
ATTTGATGAA CATGTGGCGC ATTGCGAACA GAGTTTGGGG ATTGGATCCG GTGAAGCGCA
CGACGTACTT CATGAAAGTC GTCGGCACCA ACGGGCAAGA TATAGCGTGG AAGCGTGGTG
CTGCGTTGAA TAAGTTATCT GCAAACCCGC CGGAATCGCT CGTGTCTTCG CTCGAGAAAA
TTCCAGCGAA CGGGAACGTG CCTCCGGAAT TCGCCTCAAA AGATGCCGAA CTCTGGGACG
CCGCCGCTGA ACGAAAGCAA AAGAGGGCAG AAGTCGAAGA CCCCACGTGG TGGACTTTGC
ACGAACACAA CTGCGTGTAC TGTGGTGGAG TGCCTGGACA GTGCAAAAAC ATTCCGCCCG
CCTTTCTTCC ACCGGAAAAG GCTGGTTTGA AGATTCGCTG CATTTCGTGT CCGCGCATCA
CTTCGATGGG GTGCGCGTAC ATTCGCACCG TTCCGCAAAA GGGTTGGAAG TGTCCACAAC
ATAGTTGCTT GGTGTGCCTC AAAGAAGCGT CACCGAATCA CATCACATTC CGATGCATCT
CTTGTTCGCG GGCTTTCTGC GATTCGTGTT CTTCCGGGGC CTCATTCGAC GCCATCGCCG
ACCATCCAGT GTGGGCGCCG TCTGGCTTTC AGCTCCCGAA ATTCTACGAG TACGTGCGAT
GTGCGATTTG CGTCGATTCC ATCCTCGCGG AAACGAAACG TGCGAGACGA GGGAAATAAA
TAACGAGTGT TTACACGCAT AGCTCCATAG ATTAAAATCC TTTAGTAGTC ATCTAAGAAT
CCGATC
 
Protein sequence
MHGANAVESD ADMSAQDAIA ALVARVAAGS GNERGGSRGN GDGETTSANG EEDDGGDGEP 
AWMADIDARI ATVEDMEVET VNGTYGMNDE SDAEGARVDA PGQFDDLFDE RAETIVDLDE
SSMKTTSIMA FLETFALGPE SLNGGRVTPT MGEVLERVKA RKPESLPEWT MTAAAAASAY
QFAVSKKRAA DRAVAYAKEM RKKHSKVQAK ADALASQLHV LQYGKTPDGK PVKPGQLKNP
PHQTRPRGKM PPPPPEHISR LWRTSLEDYD VCHVCGHQAP DWWHAKDEIV FCDGCEIQVH
MSCYGIKNLP EGDWYCTGCK EGVTKGPLVS CGTPRGICAL CPHPGGALVR VVPASKFETP
WLSPGHHAHL ACALHLPEVV MRHQKTGESP IVDMLLVQSK RMKLKCSVCE EIGACTQCAM
HKCYTAFHPL CARADKQIML RQAASGQPMV FCKAHSAPEF EQQRFLTCGY REDGTSDMLR
NKNVFWDVKA QSHEEDAAFV DPNRSTASAQ KSHPSAQEEK QKDVTLEVFP RATSAEAKRI
ATITCVRHYL DSTHKGGAPS KLLRDMQDAC KHLEAEEASR LNSLSDELGE AEIKTALDHL
TRLAPKIPQG KALYADLAPW KYLKSHQIDA IHWMRNLHSL GVNGLLAFET GLGKRLTSLA
FIQWVRDGLR EPGQHLILCP KETAHLWIAD LHRWCTSLRS VTLMSEEDEK SANNVQMIKA
LSYDYLVVSY ERMSQCEALK TTMFKSVICD DWRNEASVGA LSSAVQEKVL QAGSTFLLGK
ADDFDGPHAV ALGRVIFPGL GQSTKSLPVK WRDLIVHSTS ERVVEPPVVP APLAKFLQFE
IDAKTNPVDA LMHILQTMRK QAQKVLVIAE DENALNSAAE GMSMAKLEYA RLDETDQPLG
VALYSAARFN TMAPENNISF LLCSFHNLGR QQGLLSGVTS ILQLDGEFSS KIDASGAPNL
MNMWRIANRV WGLDPVKRTT YFMKVVGTNG QDIAWKRGAA LNKLSANPPE SLVSSLEKIP
ANGNVPPEFA SKDAELWDAA AERKQKRAEV EDPTWWTLHE HNCVYCGGVP GQCKNIPPAF
LPPEKAGLKI RCISCPRITS MGCAYIRTVP QKGWKCPQHS CLVCLKEASP NHITFRCISC
SRAFCDSCSS GASFDAIADH PVWAPSGFQL PKFYEYVRCA ICVDSILAET KRARRGK