Gene OSTLU_25549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25549 
Symbol 
ID5005479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp564788 
End bp566483 
Gene Length1696 bp 
Protein Length403 aa 
Translation table 
GC content57% 
IMG OID640420900 
Productpredicted protein 
Protein accessionXP_001421504 
Protein GI145354463 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[K] Transcription 
COG ID[COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.51719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.155521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCCGCGATCG AGGCGCGTCG CATTTCCGAC GCCCACGCGT CAACGCACCG CGAGAATCGC 
GCGCACGCCG GGAAACGCGC GAAGAGCGCA CCGAGGAGGC GCAATTAACG CTCGCCGTTG
ACTGCGAAAC GTCGCCGGCG AATCGAATCG AGCTCGCGTT CGGCGCGCGC GACGATCGCG
GCGGGTCGAC GCGGTCGGCG ACGACGCGCG CGCGTGATCG AGGCGCGGTC TCAGCGTCGA
CGGGACGCGT CGAAGCGTCG CGTCGATCGC ACGTTCGAGG GTGACGGCTG ACTGGCAAGA
TTCGGGGCGT CATCGAACGC GCGATGGCTC GCACGCGAGG CGAGGAGCCG GCGGAGAAGT
CGGCGACGAT CGCGTTGGCT GATATCGCCG CCGCGACTGA GACGATTAAG AAGGCTCACA
TGAAGAAGGA ATCGAAGAAA CGTAAGGCGG AACTCGACGT CAGGGCGCCG TTGAGCAAGC
GACAGCTGGA CGACGACGCG GCGTACGCGG CGTTGATGCA CGACAAGTCA AAGGATGCAG
ACTTGCCGGT GGCGCAGGCT TCGGAGAAGG AATTGCAAAC TGTTTCTGAT AGTAGCGAGT
GGGACGTCAA GTTTCCGGGG CAGAGATTCG GGCAGTGGAG CACGCTCGAG GTGGAACAGA
TGAAGCGCTC GCTCGAGAAA TGGGCCAACG AGCACGGGCT CGCAGAAGAT TTCATGAATG
GCAACTATGA GTTCTTGTTC AACCGTCGTC AAAAGCAAGG AGGTAAAGGT GCACACTTAC
CGCTATCTGA GCGACGCGCG TTTATCGAAG TCGCTCGCGA GACGCCGACA AGAAACGCCA
AGCAAATTTA CGGCTGGATT TTGAGAAACA TGGACAAAAA GTCGAAGTCG GGGAAGTGGC
AGAAGGAGGA GACGGAAGCT TTGCTCGAGC AATACACGAA ACTGGGCCCG AAATGGTCCA
AGATCGCGGA AATAGTCGGC AGGCCGGCGT CGGCGTGCCG TGACAAGTGG CGTCTCGCCA
AGGGAGGTGA ACACAAAAAG TCGGGGCACT GGAGCCAGGA AGAAACCGAC AAGTTGTGTG
AGCTCGTGAA GGAACACTTC CGCCAGCGAG GCGCGGAAGC TGGATGCGGG CCGGGAACGG
GCAACGAACA CCTTTCACTT CGCGACAATA TCAACTGGGT CACCATCTCT GCCAAAATGG
GCACTCGAAA CGAGCAGGCT TGTTTGCAAC GATGGTATCA AATCTCGCCT CCAATGACGA
GTACGGGCGA GTGGGATGTC GAACAAGACT ACGAGATGTT GAATAACGTC ATCAAGTACA
GATCGATGAC TGCCGAAGCT GTGCCATGGG CGTCGACTGT TCGGGGCCGT GATTTGTCTC
GAATCATGCG ACGGTGGAAA TTGCTTTCGT CCAAAATCTC TGGACACGTC GACATGGCAT
TCCGCGAACT CGTGCTCCAA GTTTGCAAGA GTAAGGACTA CAAAGACCTC GTTATCAAGG
CGCAAGCTTT GGTCAAGTCT TCCTCGAGCG CGTGACTACG AGCTTAAGAT GCGATGATGT
TTTCACCTAC CCACTGTACC ATCCACATCA CCTGCCCACT GTACCATCCA TATTCATGTC
ATAATTTTTT CCGTTTCAGC GTGAGTTTTG AAACAGGCTC GTCATACAAT TTAGATAATT
CCTTAGTCAT GCAAGA
 
Protein sequence
MARTRGEEPA EKSATIALAD IAAATETIKK AHMKKESKKR KAELDVRAPL SKRQLDDDAA 
YAALMHDKSK DADLPVAQAS EKELQTVSDS SEWDVKFPGQ RFGQWSTLEV EQMKRSLEKW
ANEHGLAEDF MNGNYEFLFN RRQKQGGKGA HLPLSERRAF IEVARETPTR NAKQIYGWIL
RNMDKKSKSG KWQKEETEAL LEQYTKLGPK WSKIAEIVGR PASACRDKWR LAKGGEHKKS
GHWSQEETDK LCELVKEHFR QRGAEAGCGP GTGNEHLSLR DNINWVTISA KMGTRNEQAC
LQRWYQISPP MTSTGEWDVE QDYEMLNNVI KYRSMTAEAV PWASTVRGRD LSRIMRRWKL
LSSKISGHVD MAFRELVLQV CKSKDYKDLV IKAQALVKSS SSA