Gene OSTLU_39526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39526 
Symbol 
ID4999868 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp326136 
End bp327806 
Gene Length1671 bp 
Protein Length536 aa 
Translation table 
GC content64% 
IMG OID640415289 
Productpredicted protein 
Protein accessionXP_001415457 
Protein GI145340698 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR00479] 23S rRNA (uracil-5-)-methyltransferase RumA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.612078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCGCG CGTTCGCCCG CGTCGTCCTC GCGCGCGCGC TCGCCGCCGC GCGCGCCCCC 
GTCGCGCACG CCGCGCGTCG TCTCGCCGTC GCGCGTCCGC CCTGCGCCGC GAAATCCACC
GCCGCCGCCG CCGCGTCCGA CCGCGGCGAC GTTCGCGTCG GCGACGAAGT CTCCGTGGAG
TGCGTCGATT TCGCCACGTC GGGCGAGGGC GTGTGCAAGC TCGCGAATGG GATGGTTTTG
CTCTGCGACG GCGCGACGCC CGGAGAGGTG GTGCGCGCGC GGGTGACGAA GCTGCGGAAG
AAGCTGGCGC ACGGCGCGAA GACGGCGACG GAACGCGCGG CGCCGAACGC GGTGACGGCG
CCGTGCCCGC ACCACGAACA GTGCGGAGGG TGCGCGTGGC AGCACGTGAA TTACGACGCG
CAGGTGGGGC ATAAGCGGAA TCGGGTGGTG GACGTGCTGG CGAGAATATA TAAAGCGGGC
GAGGACGCCG AGGCGAAAGT CGGCGCGTGC GTCGGCGCGG ACGAGACGTC GAGGTATAGG
AATAAAATGG AGTTCGCGTT CGCGAGTGGA ACGAGAGGAA AGACGGTCGT CGGGTTGCGA
CCGCGAGGGG CGAACGATTC GGTGGTGGAT TTAAGCGGTG GATGTCTGTT GCAGAGCGAG
GAGGCGGATC GCGTGTTGGC GGCGATTCGG GAGACGCTGG AACGCGCGGA CGGGCGCTTG
GAGGCGTTCG ATCGCACGAG CGGCGAAGGC ACCCTTAGAA GCGTGACGAT TCGCACGGCT
GGTGGCGAGC GCGGCGGTGA GAAGGCTGTG ATGGTCGATT TAGCGACGAC GGCTTCGCCG
AACGAACTGA AAACGGGACC GCTCGCGGGA CTCATCGACG TCGTCTCGAA GGTACCGGGC
GTGGTGTCCG TGGTGCACAC TTCCGTGCCG AGCGAAGCCG AACTCCGACG CGCGGGCGGT
GGACGCTCGT CAAAGTTCGT CAAGGGTGGC ACGACGACGA AGGCTGGGTC AACTAAGAAA
GTCGAGGCGG TGTTCGGCGA GAACAAATTA GTCGAAACGC TCAACGGCAT CGACTTTGAA
CTTTCTTCGG CGTCCTTCTT TCAGACCAAC ACTGAGCAGG CTGCGAGATT GGTGCGACAG
GTTCGCGAGG CGTGCGCTTT CAGCGGCGAT AAGTCTGAGA TCGTGCTCGA CCTGTTTTGC
GGTGTCGGTA CGATGGGACT CAGCGTCGCG AGCGACTGCT CGCGAGTGAT GGGATGGGAA
GTCGTTCCAG AGGCGGTGAA AGACGCGAAA CGCAACGCCG AGCTGAACAA CATCACGAAC
GCCAAATTCT ATCGCGTTGA TTTGGCGAGA TTAAATCCGT CCAAAGGCCC GAAAGGTCTT
CTCACGACGC CGAAAGGCAA AGAGCTTCCC ATGCCGGACA TCGTCATCAC GGACCCGGCG
AGGCCTGGTA TGGACTCCGC ACTCATCGCG ATCCTGCGCA CAATCGGTGC TCGGCGTATC
GTCTACGTGT CGTGTAATCC CGCGACGCAA GCGCGAGACT TGCTACTTCT CACGGCGCCG
TCGGAGGGCG CGGACGACGT CGCGTACGAG CTCAAAACCG TCACGCCCGT CGATATGTTT
CCTCACACGA CGCACGTCGA GTCCGTCGCC GTGCTCGAAC GCAAAGCTTA G
 
Protein sequence
MPRAFARVVL ARALAAARAP VAHAARRLAV ARPPCAAKST AAAAASDRGD VRVGDEVSVE 
CVDFATSGEG VCKLANGMVL LCDGATPGEV VRARVTKLRK KLAHGAKTAT ERAAPNAVTA
PCPHHEQCGG CAWQHVNYDA QVGHKRNRVV DVLARIYKAG EDAEAKVGAC VGADETSRYR
NKMEFAFASG TRGKTVVGLR PRGANDSVVD LSGGCLLQSE EADRVLAAIR ETLERADGRL
EAFDRTSGEG TLRSVTIRTA GGERGGEKAV MVDLATTASP NELKTGPLAG LIDVVSKVPG
VVSVVHTSGG TTTKAGSTKK VEAVFGENKL VETLNGIDFE LSSASFFQTN TEQAARLVRQ
VREACAFSGD KSEIVLDLFC GVGTMGLSVA SDCSRVMGWE VVPEAVKDAK RNAELNNITN
AKFYRVDLAR LNPSKGPKGL LTTPKGKELP MPDIVITDPA RPGMDSALIA ILRTIGARRI
VYVSCNPATQ ARDLLLLTAP SEGADDVAYE LKTVTPVDMF PHTTHVESVA VLERKA