Gene OSTLU_19017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19017 
SymbolSDG3503 
ID5006750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp247408 
End bp250350 
Gene Length2943 bp 
Protein Length980 aa 
Translation table 
GC content58% 
IMG OID640422171 
Productpredicted protein 
Protein accessionXP_001422533 
Protein GI145356635 
COG category[R] General function prediction only 
COG ID[COG2940] Proteins containing SET domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.238798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC GCGCCGCGCG CGCGCCGCGC GCGGCGTCGC CGCGCGGCGC GGACGCGACG 
CGACGGCGCG ACGGCGCGAT CGACGCCGAC GACGCCGACG CGACGGCGAC CGCGCGCGCG
CGCGCGAGCG CGAACGACGC GCGCGCGCGA TCGGCGGCGG TGCTGGACGT CGTCGATCTC
ACGCTCGACG TCGAGGCGCC GCGCGCGGCG ACGGCGACGG CGACGGCGAC GGCGGACGCG
CGAGGCGACG CGACGCGCGG CGACGACGCC GAGGCGGTGA AGATCCTCGA CGACGGTTTG
CCGACGTCGA AGCCGAAGGC GAGGGATGGA AAGCCGACGG CGCGATCGGG TGAGAATGGA
AAGTCGCGCG CGTCGGGGTC GGGGCGGATC GAACGCGTGG CGGAGGTGCT GCAGTCGCCC
GAGAAGGCGT GGCCGCGAGA GGAGGCGGCG GCGGAGGACG ACGACGACGC GAAGGTTTCG
GTGAGTTTTG AACTCGAGTT CTGCGCGGCG ATGCGAGAGG GGTTGGAGGG AAGACGGACG
GAGGCGATGG AGGCGATGGA GACGCTCGAA GCGACGGTGG GATCGAGGGT GTTTCAGAAC
GCGGACGCGG CGGCGGCGGA GGCGCGGACG AGGACGAGGA GCGCGGCGAA GCGCGCGCGT
AGGATTTTTC GGGAGGAAAC TAAGCTTTGG GCGGCGGCGG TGGTGAAAGC GCAGCAACTG
ACGATAGACG ATGACGTGCC GTCGACGTCG GAACCCGTCG CGATGGAGTT GCCGTTTTGG
CAGCGCATCT CGAGCGAGCA CCTGGGCGCG TCGAGCGCGA GGAATACGAA GATTTTCACG
CAGCGCACCG AACGGCCGTA TTCGAGTGAT CCGACCGATG TAAGCGGCTG GAAGCTGCAA
ACAATAGTAG GGCGACAATC TGTACCTCGA ATAGACATCG GTAGCGCCAA GGCGGTGCCG
CCCTACGCTT ACTTTGCGTA CTCGACGCAC TGCAACTCGT ACGAGGCGGA AGGAAACGTC
TCGCGCCTAC TCTTCAGGGA CGACGATGGA GAGTTCTTAG AATCAGATCC CGTCGATCGG
CGCGAAGACG AGAGTAATGA GCTCACTCGC GAGCAGGAAA TCATCATGTG CGCGATATGC
GCGGAGTTTA GCGAGTTCAT TTTGACAGAA AAAGAAGTCG TGCGAGGGGT GAATCGCGAA
GACGGCGTGA AAGCCGTCGT GGTACAAACA GCCGAATATT TGAACTTGGA CGAAAACCAA
GTCAAGGATT GGTTTGATGA GACGCGCACG AAGCACAGCA CGAGCCGTGC ATGGTGTATG
TTTCTCGAAG TTGCCTCGCA CGTACGCAAG TTGATGGGCT TCTCCTCGGC GCATTGGCGC
GCGAAGATGG CAAACACATT CAGCGTTCTC GAAACTCTCG GGATCAGCGA ATTGTTCTGG
CGGAAGTTTT CGCGAATCAT CATCAATTGT CCGACGCTCG CTCCGTTGAA AAAGCCTGTC
ATCGTGTTTG ACAATCTCAA CGAAGCCATG GATCAGCTCG CCGGCATGTT TTGTCCGCGA
TGCTTCATCT TTGATTGCAG AACGCACGGG TCGTTGCAGC CCAAGTCGGA AGGCAGGAAG
CTTGATGCGG AAAGAAAGCT TGCATGGCGC GAGCGCATGG CAAAGAGCGG CATGTCTGCG
GAAAAGCCGT TAGCTGAGCG ACGATGTTCG ACAGATTGCT GGTATCAAAC AGAAGAGTAC
AAGTACTACT CTGCGCAGAC AACCTGCGCA CCATGCGATC CCACAGAAAC TCTCAATCGT
CCGTCGACGA AAGATCCGTT CATCGAGACG ACGAGGAAAT GGCGCAACGC GATGGATATT
GAAGTCTTGA AGAAGGCTGT CAAAATAATC GGTGAGAAAA CCACGGCGTG CGAGGCAGCG
TTGTTCTTTG GCCGTCGTCG CACGTGCGCC GAAGTCGGGA AGCAAATGCA CTGTTTGGAT
CTCATCAACC TTGGAACTGT GGTGAAGGAA GAAGAGCGCG ACGCGATGGA TGAAGATACC
GACGAATTGA GTAATCCGAA GAAGCGCAAA CGCGCGCCGA CGGGGGTCAA AAATCCAACA
ATTGCGCGAC GATTGAAGAT GCAAAAGGAT GCCGATTTTC TGGAAACGCA ATACTCCCCG
TGCGAATGCG TCGGCGCTTG TGACGCTAAC ACGTGCTCCT GCATTAAGAA TGGTACCTTT
TGTGAGAGAT TTTGCAACTG TGGACCGAAG TGTCACAACG AGTTCGAGGG TTGCAAGTGC
GACAGTACGA AGCGCGCAAC GTGCGGCACA AGAACGTGTC CGTGCTACGC CGCCGGTCGC
GAATGCACGC CAGATAAATG TAAACGGTGT TGCAAGACCG CTGATGCGTA CTCTTTGCCC
GCTCGTAAAA GGTATGGCCT CGTCGATCCG AACATGCAAC TGCCCATGCC GGCGTTTCCG
TGTGAGAACA TGAAGCTACA ACTTCGACAG AAGGAGCACA TTTGTTTGGG TCGAAGCGGT
GTTGCCGGTT GGGGTGCGTT CGTGTTGAAA GGCGCTCGGA AAGGAGAGTT CATCGGCGAA
TACGTCGGCG AACTCGTGAC TCAGGACGAA GCCGAACGTC GAGGAACGGT GTACGATGTC
AACAACTGCT CGTACTTGTT CAATCTCAAC AGCGAATGGT GCGTCGACGC TCAATACAGA
GGGAACAAAC TGCGCTTTGC CAATCACTCG AAGAACCCGA ATTGCGTGCC TCGCGTTCTC
GCGGTGAATG GTGATCATCG ACTGGCGCTG ATATCAGACA AAGACATCAA ACCAGGCGAT
GAATTACTGT TCGACTACAA TTACAAGGAC GAAGTCGCAC CCGACTGGCA CGAGAAAAAC
GCATCGACGT TGCCCAAGTC GAAGCACCTT CCAACGAAAA GCGCGAAGAA ATCATCGAAC
TGA
 
Protein sequence
MTTRAARAPR AASPRGADAT RRRDGAIDAD DADATATARA RASANDARAR SAAVLDVVDL 
TLDVEAPRAA TATATATADA RGDATRGDDA EAVKILDDGL PTSKPKARDG KPTARSGENG
KSRASGSGRI ERVAEVLQSP EKAWPREEAA AEDDDDAKVS VSFELEFCAA MREGLEGRRT
EAMEAMETLE ATVGSRVFQN ADAAAAEART RTRSAAKRAR RIFREETKLW AAAVVKAQQL
TIDDDVPSTS EPVAMELPFW QRISSEHLGA SSARNTKIFT QRTERPYSSD PTDVSGWKLQ
TIVGRQSVPR IDIGSAKAVP PYAYFAYSTH CNSYEAEGNV SRLLFRDDDG EFLESDPVDR
REDESNELTR EQEIIMCAIC AEFSEFILTE KEVVRGVNRE DGVKAVVVQT AEYLNLDENQ
VKDWFDETRT KHSTSRAWCM FLEVASHVRK LMGFSSAHWR AKMANTFSVL ETLGISELFW
RKFSRIIINC PTLAPLKKPV IVFDNLNEAM DQLAGMFCPR CFIFDCRTHG SLQPKSEGRK
LDAERKLAWR ERMAKSGMSA EKPLAERRCS TDCWYQTEEY KYYSAQTTCA PCDPTETLNR
PSTKDPFIET TRKWRNAMDI EVLKKAVKII GEKTTACEAA LFFGRRRTCA EVGKQMHCLD
LINLGTVVKE EERDAMDEDT DELSNPKKRK RAPTGVKNPT IARRLKMQKD ADFLETQYSP
CECVGACDAN TCSCIKNGTF CERFCNCGPK CHNEFEGCKC DSTKRATCGT RTCPCYAAGR
ECTPDKCKRC CKTADAYSLP ARKRYGLVDP NMQLPMPAFP CENMKLQLRQ KEHICLGRSG
VAGWGAFVLK GARKGEFIGE YVGELVTQDE AERRGTVYDV NNCSYLFNLN SEWCVDAQYR
GNKLRFANHS KNPNCVPRVL AVNGDHRLAL ISDKDIKPGD ELLFDYNYKD EVAPDWHEKN
ASTLPKSKHL PTKSAKKSSN