Gene OSTLU_33329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33329 
Symbol 
ID5003626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp42506 
End bp44511 
Gene Length2006 bp 
Protein Length664 aa 
Translation table 
GC content60% 
IMG OID640419047 
Productpredicted protein 
Protein accessionXP_001419474 
Protein GI145350138 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG5459] Predicted rRNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.335485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGT ATGATGCGGT CACGGAAGAG GCGCTGCGAC GGGCGGTGCG GGACGCGTTC 
GGGGGCGAGC GAGGGGTTCG CGTAGGACGA CGCGTCTCGC GGCGGTTCGA CGCGCTGCGC
GAAGCGCTGA AAAGATCGAG CGGCGGACGC GTGGTGAACG CGTTGGAGAC GAGCGCGGGT
GAATTAGATG CGAAACAACT CGATGGACGA TCAGCTTCGG AGGTAGGCGC AACGAGAGGG
CTGGAGGGCG GCGCGCTCGC TTCGTGGGAC GACGACATCG AGTGGGGCGC GCGGTTAAGC
AAGCGAAAGC TGCGTAGAGT GGAATCGCCC GTCGACGTCG AGCGAGCGGG AGAAAAATCG
TCGGTGGAGG TGAAATATGA CGAACTCGAT GCCGCTGTGT ACGCAGTGAT GCGAAGTCCG
ATGACGATGG GGGCATTGAA GAGGATTTTG TACGAGACGC GCGATCGCTT GGGCACGAGT
TTCAAGCCGA GGAGCGTTTT AGATTTTGGG TCGGGGCCGG TCCCGACGAC TCTGTTCGCC
GTGCGCGCGG TTTTCGGAGA TGATGTTGGC ACGCATGGCG GGCGTGATAC CGATGTTAAG
GTGGCGTTCG TCGATAGCAA CCCAGGGATG ATGCGTTTTG CGAGACGCGT GACTGGATAC
GCGAAGAATA TAGAGCAAGA ACGTCGAACG ACCGCGGCGC TTGAACGCGC GTCGACGACG
TCAGAGGACA AAATAGGTGT TAGTGAAGAG CTCAGCGACG TTTTAGACGA CGTCGACGCG
TCTGTGACGC GAGTATTGGA CTTTAGCGAT TTCGAAGCGG TGGAGGATGA GCGCCACGGG
TTTCGGCCCG CGAGCGCGTT GAGATTGAAA ACGCCTCACC CTTGGCACGA AAGTGAAGGC
ATCCGCACGA GCGCGAGTCT TCGTGGCGCC AATCGACGAG GTGGATTTGA CGTCGTTGTT
TCGTCGTACG CGCTGGGCGA AATTCCCGAT AATCACGTCG TCAACGCGCG CGGACGCGAA
GTGCGAAATC AACGACAGCT CGACGTAACG ATTCGTCAGT TATGGGATAA AGTCGCTCCG
GGTGGCATCT TAGTCTTGGC CGAACCTGGA ACACCGCGAG GGAGCTTACT CATTCGTCGC
GCGAGAGCGA TGCTTCTCGA CGTTGCTCGA CGCGATATGG AACAGGACGC GCGAAGACTC
GACTTTGAAC CGAGCGAAGA CGCCGTGGAG GCGTACGTCG TGGCGCCTTG CCAGCACGAC
AAAGCGTGTC CGCTCAAGGA TGTCAACGCT GAGGATGGCT TTTCGACGTG GTGCCACTTT
CCTCAGCGAA CGCTGCGAAG CGCGTACGTG CGAGAAATGA AACATGGCGC CAGACCATAC
CAGGATGAAA AGTTTTCATA CGTCGTACTG CGCAAGATGC GTCGAAGCGC GGCGCGCAGA
GACGCCGAGC GCGCGACGCA AGCCGCTCGC GACGCGCTCG CGGCGCAAAA ATCCGCGACT
CCGAACGACG AAAGAAACGG AGACGAAGAA GACGAAGAAG ACGAAGACGA AGAATATCAC
GAAAGTCTCG CGCGCGAGTC TTTCGAGGAT TGGTCGCGCG TCATTCGTCA GCCCATGAAG
CGTAAAGGCC ACGTCGTCTT TGAGCTCTGC GCCCCCTCTG GCGAACTCGA GCGCGTCACC
GTCGCCAGAT CTCACGGCGA TCTCATCGGC CGTGACGGCT ACAAGTACGC CCGCAAGCTT
CGTTGGGGTG ATTTATGGCC CTTCACGCAC AAAACCGTCG TCAAACCCGG CGATCAGCGC
GCGTTCGAGC TCGAGGCTTC GCGTTTCGAG AGCGACTTTC TTCGGTCTCT CCGTCGCGAC
CGCGCCGCGC GTCTCGTCGC GTCTCCCGCG CCTCTCGACC CGATCGCGCG CGACGAGATC
GAGATCGAGC TCGACGACGA CGAGCTCGAC GACGAGCTCG TCGATCTTTG GAATAGCCGC
GCCGGCGCGC GCTAGACGCG CCGGTC
 
Protein sequence
MTSYDAVTEE ALRRAVRDAF GGERGVRVGR RVSRRFDALR EALKRSSGGR VVNALETSAG 
ELDAKQLDGR SASEVGATRG LEGGALASWD DDIEWGARLS KRKLRRVESP VDVERAGEKS
SVEVKYDELD AAVYAVMRSP MTMGALKRIL YETRDRLGTS FKPRSVLDFG SGPVPTTLFA
VRAVFGDDVG THGGRDTDVK VAFVDSNPGM MRFARRVTGY AKNIEQERRT TAALERASTT
SEDKIGVSEE LSDVLDDVDA SVTRVLDFSD FEAVEDERHG FRPASALRLK TPHPWHESEG
IRTSASLRGA NRRGGFDVVV SSYALGEIPD NHVVNARGRE VRNQRQLDVT IRQLWDKVAP
GGILVLAEPG TPRGSLLIRR ARAMLLDVAR RDMEQDARRL DFEPSEDAVE AYVVAPCQHD
KACPLKDVNA EDGFSTWCHF PQRTLRSAYV REMKHGARPY QDEKFSYVVL RKMRRSAARR
DAERATQAAR DALAAQKSAT PNDERNGDEE DEEDEDEEYH ESLARESFED WSRVIRQPMK
RKGHVVFELC APSGELERVT VARSHGDLIG RDGYKYARKL RWGDLWPFTH KTVVKPGDQR
AFELEASRFE SDFLRSLRRD RAARLVASPA PLDPIARDEI EIELDDDELD DELVDLWNSR
AGAR