Gene OSTLU_32976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32976 
SymbolJMJ3501 
ID5003377 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp178438 
End bp182181 
Gene Length3744 bp 
Protein Length1194 aa 
Translation table 
GC content61% 
IMG OID640418798 
Productpredicted protein 
Protein accessionXP_001419094 
Protein GI145349340 
COG category 
COG ID 
TIGRFAM ID[TIGR01557] myb-like DNA-binding domain, SHAQKYF class 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.878376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.932901 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGCCGCGAC CTCCATGAGC GCGCGCCGCG ACGTCGACGC GCGTCACGCG TCGACGTCCG 
AAAAGGTGCG CCACGCGATG CCGCGCGAGT CCGCGCCGCG CCGCGCCGCG CCGCGCGCGC
GAACCGTCGA CTGACGCGCG ATACCGACGA CGGCAGGATT CCGAAAGAAA ACCGTCGCGA
ACGACGACGG CGGACATCGC GAGCGCGCCG ACGTTCCGTC CGACGCTCGA GGAGTTCGCG
GACCCGATCG CGTACCTGTC GTCGATCGAG GCGCGCGCGC GCGAAGCGGG GATATGCAAG
GTGATACCGC CGCGAGGCGC GGCGCCGAGG TGGAACGGCG AGGCGTGGAG GCGAGACGAC
GCGCGATTCG AGACGAAATT GCAGAACGTA CACTCGCTGA GCGAGGGAAG GACGTTTCAG
TTCGGGAAGG AGTACGCGAA AGGGGAGTAC GAGGCGATGG CGAAGGCGTA TGAGGAACGG
TGGGCGAAGG AACGTCCGGA CGTCGACGCG AACGACGCGA ACGCGCTGGA GCGAGCGTTT
TGGGATATGG TGGAGACGCG GAGCGAGCAG GCGCGAGTCG AGTACGGGAA TGATTTAGAT
ACCAAGATTT TCGGTACCGG GTTCGGGGTG GACGAAAACG GGGAGAAGCA TCCGTGGGAT
TTCGAGCATT TGTACTCGCA TCCGCTTAAT TTATTGCGCG TCGTCGAGCA CGACATTCCG
GGACTCACCA AGCCTTGGTT GTATCTTGGC ATGCTTTTCG CCACGTTTTG CTGGCACGTT
GAGGATCATT TCTTGTGTTC GCTTAACTAT TTGCATCGCG GGGCGGCGAA GACGTGGTAC
GGTGTGCCAG GAAGCGACGC GGAGGCGTTC GAGAATTGCG CTCGGGCGAC GGTGCCGCGC
CTATTCGAGC AAGCGCCAGA TATTTTACAT CAAATCGTCA CGATCGTCCC ACCTGGAGTA
TTGGTAGATC ATGGCGTCAA GGTCGTGCAC ACGGTGCAAC AGCCTGGGGA GTTTGTCGTG
ACGTTCCCTC GTGCTTACCA CGCCGGGTTT TCCCACGGTT TCAACGTCGC CGAAGCGGTG
AACTTCGGTC ATGTCAACTG GCTCGATTTC GGCCGTCGAG CCATCGACGT GTACAGCACC
GGATCGTTCA AACGCAACGC CGTGTTTGCA CATCATCGCC TCGTTTCGCG CGCCGCCGAA
ACCTTCGTCG AAGTTCTGGG TAAGAACGCT CGACTGGTGA AGAGTAAAGC CATGGGCGCC
ATCGTATCGA CGCTTCGCAA GGAGCTCGAA ACGATTTTGA GCGATGAAGA AATTTATCGT
GCCTCCCTCG TGCGTCGTGG ATTGAACATA GAAATCGTTC AAGCACCTAA CGAGGACGAC
GATGCGTGCT GTATTCGCTG CAAAGCGATG CCGTTTCTCT CCGTCGTGCG ATGCAAGTGT
CTACCGACGG CGGTGCGATG CCTTCGACAC GCCATGGACG CTTGTGATTG CGCGGCGGGG
GAGAGAACCT TAGAGATTCG CGTGGTTGAT TCACGACTTC GCGAGCTCAT TAAAGCACTG
TTCTTCGGTG ACGGCATACA AACCAAGAAC GACGCCGCGA AAGCGCGCGT AGATTTCTCG
GCCAATGTGA ACAGAGTCGC CGTCAATCGA GCGCCGCCGC CCAAGCCGAA AGTCGTCCTT
CCAAAGCCGA AAACCGTAAA ACCGCCGCCG ACGCGAGCGG TACTCGCGTC TCCACCCCCC
ACGCGCATCG TCGCATCGAA AGCCGACGAT GCTTTCACCG CGCGCGGTCT TCCGCGCAAG
CGAGCCAAGT GCGAAACCCG GCGGCGCTGG ACCGCCGAGA TGGTCGCCGA CTTCGAAGTC
GCCGTCGAGC GCCTGGGCGG CGTCGACGCG GCGACGGGCA AAAAGCTCGC CGAAGCGTTA
TCCGCGCACG ACGTCACGCG AGACCAATGC GCGAGTCGCT TGCAAAAGCA CCGCGAGAAA
ATCAAATCAA ACGCGGACGC GCGCGCAACC TTGTAATGTT ATTCCCTCGC CCCGCGCAGC
GCGCGTTACA GCATGGCGTC CAAACGACCC CGCGGCGACG CCACCGACGC GTCCACGCCT
CGAGCGCGCG TCTCCGAGGA AGACGCGCGC GCCCCAGTCT CCCTCAAGTC CCTGCTCGAG
CGATGGAACC TCGGCGACGT CGTAAACACC GCCGGCGTGT CGCGTCGAAT AAAGTGTCGA
CTCGTCCCGG TGTGTCGAAC GAAGGACGAC GAGCGCGCGC GCATCGCTCG AGGCGAGCCG
GTGATATGCG CGAGCGCAGA GTACGCGTCC GAAGTGTTCG ACGCGATCGG GACGGCGCGC
GACGGCCTCG CGTGGGCGAC GCGGTCGAGT TTCTCAAACT TTGACGCCGA GCGCGGCAAG
GTGGCCATGC GAAGCGGTTG GGGCGCGCCG GGGACGCACG TGCTGAGCAA TCGAGACATG
ACGATCGTGA CGTCGTTCGC CGAGGTCGTC GACCCGGCGA ATGAAAAGCC GCTGAATTTA
CAAATGTTTC ATCGCGAGGA CACCGCGGTG CCGGTGTTGT CGAGGAAATT GAAGTGGCCG
AGCGAGGAGG AATTTTTTGG AATCGAAGGC GAAGACGCGC CGGGGAGGCT TTTAGACGAC
GCGACGCGCG TGAGCGCGAG AGGGGCGATG ACGTGGTGGC ACTTGGATGA CTGTGGGGAG
TTTGTGTGTC AAGTCGGGTT GCCCGAGGCG GGGGAGGCGG CGGAGGACGT GTTGCTCGGG
CCGACGGGGA AACCCGTGGT GAAGTTGTTC ATTTTCGCCC AGAGGAAAGA CTACGCGTGG
GTGGCGCAAG ACGCAGAGAT GAATAAATCT TACAAAAATT GCGCACTGGA TCTTTTCGAT
ACGCCGGATC ATTATTATCC CACGGCGAGC GAGATGTGCC ACCCATCGAG CGCGCCGCTT
GACGTTTCGT CGCCAAAGGC GTTCGACGGC GCCGCGACGA GCGACGACGC CGAAGATCCA
TGTCCAACGT TTTGGGTCGC TCCGCTCGAG GCTGGAGGGC CACCTTTATT ATCACCTCCC
AATATCATAC ACTGTGTGCT CACCGTACGC GACTGCGTGA TGTGTGAAGA GCGCCGGCTT
TCGCTGGCGT ACATGGATGA AGTGTTGTAC TTTCAGCGAC GCGCGGCGAG ATGGTGCGAA
CCACCCATCT TCTACGCTTT CGTTCGTGAA GATTTGAGCG ACACGGAGAA GGCTAGGTCG
AACGCGATGC GGCCACTCGT GAAGATGCTG AATGACTTGA AGCGCGTTGG CGCGACAGAC
GGCGACGCGT ATCGCTTTGC GCGGTGTTTA ACGTCGTTGC GAGTTTTGGC GAATCATTCG
CCCGAATTCT ACGCACTCGA CGCAGATGGC GTCGCCGAGG CTCGTAAAAG CATCGACAAG
CTCGAGTCTT GGTTGGCTGA CGACTCAAAC TGTGAGTTCG TCGAGAAAAT TCAAGCCGCG
GTAAAGGCGG ATCCGCGAGC GGTGGAGGAC GCAGAACTCG CAGAGTCTAT GATGAGCGAA
ACACTTGGCG TGCTCAATCT CGCCGACGGA CGATCGTGCG CCGTCGTTCA CGAGCGTGGT
CGACCTCGCT GGGGCCCGGT GCGCAACTCC AAGTCGCTCG TAGACAAGGA CCGAAAAGAT
ATGAAGAATG CGATTCGTTC GGGAACGCTC GACGCGCTTC TCCTCGCGTA TCGCCGCGAC
ATCATCTAAT CTAGAAACTA GCGA
 
Protein sequence
MSARRDVDAR HASTSEKDSE RKPSRTTTAD IASAPTFRPT LEEFADPIAY LSSIEARARE 
AGICKVIPPR GAAPRWNGEA WRRDDARFET KLQNVHSLSE GRTFQFGKEY AKGEYEAMAK
AYEERWAKER PDVDANDANA LERAFWDMVE TRSEQARVEY GNDLDTKIFG TGFGVDENGE
KHPWDFEHLY SHPLNLLRVV EHDIPGLTKP WLYLGMLFAT FCWHVEDHFL CSLNYLHRGA
AKTWYGVPGS DAEAFENCAR ATVPRLFEQA PDILHQIVTI VPPGVLVDHG VKVVHTVQQP
GEFVVTFPRA YHAGFSHGFN VAEAVNFGHV NWLDFGRRAI DVYSTGSFKR NAVFAHHRLV
SRAAETFVEV LGKNARLVKS KAMGAIVSTL RKELETILSD EEIYRASLVR RGLNIEIVQA
PNEDDDACCI RCKAMPFLSV VRCKCLPTAV RCLRHAMDAC DCAAGERTLE IRVVDSRLRE
LIKALFFGDG IQTKNDAAKA RVDFSANVNR VAVNRAPPPK PKVVLPKPKT VKPPPTRAVL
ASPPPTRIVA SKADDAFTAR GLPRKRAKCE TRRRWTAEMV ADFEVAVERL GGVDAATGKK
LAEALSAHDV TRDQCASRLQ KHREKIKSNA DARATFMASK RPRGDATDAS TPRARVSEED
ARAPVSLKSL LERWNLGDVV NTAGVSRRIK CRLVPVCRTK DDERARIARG EPVICASAEY
ASEVFDAIGT ARDGLAWATR SSFSNFDAER GKVAMRSGWG APGTHVLSNR DMTIVTSFAE
VVDPANEKPL NLQMFHREDT AVPVLSRKLK WPSEEEFFGI EGEDAPGRLL DDATRVSARG
AMTWWHLDDC GEFVCQVGLP EAGEAAEDVL LGPTGKPVVK LFIFAQRKDY AWVAQDAEMN
KSYKNCALDL FDTPDHYYPT ASEMCHPSSA PLDVSSPKAF DGAATSDDAE DPCPTFWVAP
LEAGGPPLLS PPNIIHCVLT VRDCVMCEER RLSLAYMDEV LYFQRRAARW CEPPIFYAFV
REDLSDTEKA RSNAMRPLVK MLNDLKRVGA TDGDAYRFAR CLTSLRVLAN HSPEFYALDA
DGVAEARKSI DKLESWLADD SNCEFVEKIQ AAVKADPRAV EDAELAESMM SETLGVLNLA
DGRSCAVVHE RGRPRWGPVR NSKSLVDKDR KDMKNAIRSG TLDALLLAYR RDII