Gene OSTLU_86714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_86714 
Symbol 
ID5000788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp426785 
End bp428041 
Gene Length1257 bp 
Protein Length418 aa 
Translation table 
GC content61% 
IMG OID640416209 
Productpredicted protein 
Protein accessionXP_001416659 
Protein GI145344269 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR02143] tRNA (uracil-5-)-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.305478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0142919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGCG CGTCTGTGGC GCTCGGCGCG CGTCGTCTCT CGAATGGGTA CCTTTTCGCG 
TGCCCGCGCG CCGCCGGCGC GCGCGCGCGA CGACGATACC GCCCGAGCGG CGCGACGTGT
CGACTCGACG TAGAATGTCG TCCGTCGTCG TACGACGAAG AATTGCGAGA GAAGACGGCG
CACGCGCGCG CGCACTTTGA CGGCGTGGCG ACGCTGCCGA GCCGCACGGA GACGTTCGAG
AGCGCGCGAG AGGAGTTTCG CATGCGCGCC GAATTCAAGG TGTGGCACGA CGGCGATAGG
AGTTACTTCG CCATGTGTGA TTCCGATCGA CCGAAGGACC CGGTGGAGGT GGTGGACTTT
CCGATGGCGA GCGCGCGCAT TCGCGAACTC ATGCCAGAGT TGCTGCGCGA GGTGACGGCG
ACGGAGACGC TGCGAAGAAA GCTGTTTCAA GCGAATTTCC TCACCACGAC GACCGGAGAT
GCGGTGGTGA GTTTGTTGTA CCACAGACAG TTGGATGAAG ATTGGGAACG CGAGGCGGAG
GCGATGCGGG CGCGGTTGGG GATCGACGTC ATCGGTCGCG CGAGGAAGCA AAAACTTGTC
CTCGCCAAGG ATTTCGTGAC CGAGAAGGTG TTGATCGATG GGAAGGAATT TAGTTACAAG
CAGTTGGAGG GGTCGTTCAC GCAACCGAAC GCTGGAATTG CGGCGCAAAT GCTCGCGTGG
GCTCGCTCGG CGGCGGTGAG CGATTCACCA GACGTCGTCG CGGCGGCGAC GCCGCGAGAC
GCGAATCGAG ATTTTCTCGA GCTCTATTGC GGCAACGGAC ATTTCACGAT CGCGCTCGCG
CCCTTGTTTC GCAAATGTTT GGCGACTGAA ATATCGAAAT CGTCCGTCGC CGCCGCGCAC
GTGAACATGG CCGCGAACGG AATCGACAAC GTCGTCACCG CGAGACTATC GGCGGAGGAG
CTGTGCGACG CCCTCGACGG CGGACGCGAG TACACGCGCC TGAAGGATAT AGATCTCACG
ACGTACGATT TGAGCACCGT GCTCGTCGAT CCGCCGCGCG CCGGCATGGG CGACGAAGTG
AGCAAATTCT GCGCGCGCTT CGACAGGATC ATTTACATCT CGTGCAATCC GGAAACTTTG
GCGCGAGACT GTAAAATTCT CGGCGAGACG CACGAAATCA AACGATTCGC CGTCTTCGAC
CAGTTCCCTT ACACCCCGCA CTTAGAGTCG GGCGCGCTCT TGGTGAAGCG CGCGTGA
 
Protein sequence
MFRASVALGA RRLSNGYLFA CPRAAGARAR RRYRPSGATC RLDVECRPSS YDEELREKTA 
HARAHFDGVA TLPSRTETFE SAREEFRMRA EFKVWHDGDR SYFAMCDSDR PKDPVEVVDF
PMASARIREL MPELLREVTA TETLRRKLFQ ANFLTTTTGD AVVSLLYHRQ LDEDWEREAE
AMRARLGIDV IGRARKQKLV LAKDFVTEKV LIDGKEFSYK QLEGSFTQPN AGIAAQMLAW
ARSAAVSDSP DVVAAATPRD ANRDFLELYC GNGHFTIALA PLFRKCLATE ISKSSVAAAH
VNMAANGIDN VVTARLSAEE LCDALDGGRE YTRLKDIDLT TYDLSTVLVD PPRAGMGDEV
SKFCARFDRI IYISCNPETL ARDCKILGET HEIKRFAVFD QFPYTPHLES GALLVKRA