Gene OSTLU_31513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31513 
Symbol 
ID5001761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp73997 
End bp76395 
Gene Length2399 bp 
Protein Length756 aa 
Translation table 
GC content57% 
IMG OID640417182 
Productpredicted protein 
Protein accessionXP_001417909 
Protein GI145346879 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[K] Transcription 
COG ID[COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGCGCGTC ACGACCGACC GACCGACGCG CGCGGTCTCG CGACGAAGAC GAAGGTAAAG 
ACGCTTTCAG CGCCGCGCCC TTCGTGCGAA CGACGACGCC GACGACGACG GAAGAGAGAC
GCGAACCATG CGCATCCAAA TCAAAGGCGG CGTGTGGAAA AACACCGAAG ATGAGATCTT
AAAGGCGGCG GTGATGAAGT ACGGCAAGAA CCAGTGGCCG AGAATCGCGT CGCTGTTGAA
TCGGAAGTCG GCGAAGCAGT GCAAAGCGCG ATGGTTCGAA TGGCTCGATC CGAGCATCAA
AAAGACGGAG TGGACGCGGG AGGAGGACGA AAAGCTGCTG CACCTGGCGA AATTGATGCC
GACGCAGTGG AGAACGATCG CGCCCGTGGT CGGACGGACG CCGAGTCAGT GCTTGGAGAG
ATATGAGAAG TTACTCGATG CGGCGTGCGC GAAGGATGAC GATTACGACG CCGGAGATGA
TCCGAGAAGA CTGCGTCCGG GGGAAATTGA CCCTAACCCA GAGACGAAGC CGGCGAAACC
GGATGCGGTG GATATGGATG AAGACGAGAA GGAGATGCTC GCTGAGGCTC GAGCGAGACT
TGCGAACACG AAGGGTAAAA AAGCGAAGCG AAAAGCACGG GAGAAGCAGT TGGAGGAGGC
GAGAAGGTTG GCTGAGTTGC AAAAGAAGCG CGAACTGAAG GCGGCAGGAA TCGCACACGT
GCGGCGCGCA AAGCGCGTTC GAGGCGTGGA TTACAACGCC GAAATCGCGT TTGAACGAAA
GCCGGACGCA GTGATGTACG ATACGCGCGA AGAAGACGAA GCATTTGCAA AGCAGCAATC
TGCAAAGGTG TTTAAACCAA TTTCGCTCGC CGAGCTCGAA GGGAAGAAGA GCGCAAAGCA
ATTAGACGAA GAGAGCAAGA AGCGTGAGGC GGCGAAACAG AAAATGCAAG AGCGTCGCGA
TATGCCCGGT GCAGTACAAC AAGCGCTCAA AGTGAACGAC GCGTCGTTTT TTCGACGATC
GAAGCTCATG TTACCGACGC CGCAAGTGTC TGACCGAGAG CTGGAGGACA TCGCAAAGAT
TGGCAAAGGA GGCGTCGGCT TGCTCGACGA CGGCAGCGCG ACGCCTGCAT CTGGGCTATT
AGGATCATAC GGGCAAACGC CGGCGACCTC GTCCGGATTA GCCGGGCGAA CGCCGATGCG
AACGCCTCAA GTCGGGGGCG ACGCGATTTT GATCGAGGCC CAGCAGCAAG CCGCCCGACG
TCAACAACAG TCGACTTTAT TCGGTGGTGC TGAAGAGGCG GCGGCCGTCA TGCCCACTGA
CTTTGCTGGT GCGACGCCAA GCCACGCGAA AGCGGCACCG ACGCCGTCGC GAAGTGATGT
GAGCTCGCAC ATGGGCGCGA CGCCGTCACT GCACGGGCAA ACGCCCATTC GCGACGGATT
GAACATCAAC GACCAGTATG CCTCGCACTT CGGCGATCTC TCGGCGCGAG AGCGACGTGC
ACACACTGCG TCGACTGCTG CCTCATTGAA GAGTGCGTTC ATGTCGCTTC CGAAGCCACA
AAACGAATAT CAAATTGATC TCCCAGACGA GCCGATGGAA GACGAGCCGA TGGAGGACGC
CGTCGTGGAA GATGAAGCCG ACGTTCGTGC GCGCGAAGCC GCGGCCTTGG CGGAGTATGA
AGCGATTCAA CGCCGTAAGC GCTCCCAAGC TGTCCAGCGC GATTTACCTC GACCGACTGA
GTTGACGCCA GTCGCGCCGC TCGCCGAGGA TTCGATTTCG AAGCTCGTCA ACGAAGAGGC
GCACGCCTTG CTTGAGAACG ACATCGCCAA ATATGGCAAG AAGTCCAAGT CCGCGCCCGC
ACTCGAAGAT TTTGACGAAT CTTTGCTCGT GGCGGCCCGT CGACTCGTCG ATACGGAGGC
TGATGAGATG TTGCGAGAGC AAAACGTGTC GAGAGAAGAT TTCGCCGAAG CCTTCTCCGC
TGCGCTCGTC GCGGAGCGGA AGAAACTTAT TTTCGTACCG AGTCTCAATG CGCAGATCTC
TGTCGATGAA GCGTCGAAAG AACAACAGCT CGAAGCGGCA AAAGCGACTT TCGAACTCGT
TAGAGGGGAA ATGGAGAAGG ACGCAAAACG CGCAGCCAAG TTGGAGCAAA AATGTATCCT
CCTCACGGCC GGGCTTCAGA AGCGGAATGG AGAATTGTGC AACAAACTGA AGAAGACTGT
GGAGGAAGTC AAGGCTTTGT CCACTGAGGC GGCGTCTTAC GCCGTTTTGC ACGTGCAAGA
AGAACGCGCA GCGCCTAATA GAATCGAATA CTGGCTCGAG CTTGTGGAAG CCGCGCGAAC
GCGCGAAAAG CTTCTTCAAG AGAAGTTTGA GACCCTCACG CGACAGTTGA ACGCGTAAC
 
Protein sequence
MRIQIKGGVW KNTEDEILKA AVMKYGKNQW PRIASLLNRK SAKQCKARWF EWLDPSIKKT 
EWTREEDEKL LHLAKLMPTQ WRTIAPVVGR TPSQCLERYE KLLDAACAKD DDYDAGDDPR
RLRPGEIDPN PETKPAKPDA VDMDEDEKEM LAEARARLAN TKGKKAKRKA REKQLEEARR
LAELQKKREL KAAGIAHVRR AKRVRGVDYN AEIAFERKPD AVMYDTREED EAFAKQQSAK
VFKPISLAEL EGKKSAKQLD EESKKREAAK QKMQERRDMP GAVQQALKVN DASFFRRSKL
MLPTPQVSDR ELEDIAKIGK GGVGLLDDGS ATPASGLLGS YGQTPATSSG LAGRTPMRTP
QVGGDAILIE AQQQAARRQQ QSTLFGGAEE AAAVMPTDFA GATPSHAKAA PTPSRSDVSS
HMGATPSLHG QTPIRDGLNI NDQYASHFGD LSARERRAHT ASTAASLKSA FMSLPKPQNE
YQIDLPDEPM EDEPMEDAVV EDEADVRARE AAALAEYEAI QRRKRSQAVQ RDLPRPTELT
PVAPLAEDSI SKLVNEEAHA LLENDIAKYG KKSKSAPALE DFDESLLVAA RRLVDTEADE
MLREQNVSRE DFAEAFSAAL VAERKKLIFV PSLNAQISVD EASKEQQLEA AKATFELVRG
EMEKDAKRAA KLEQKCILLT AGLQKRNGEL CNKLKKTVEE VKALSTEAAS YAVLHVQEER
AAPNRIEYWL ELVEAARTRE KLLQEKFETL TRQLNA