Gene OSTLU_34867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34867 
Symbol 
ID5003729 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp155396 
End bp157444 
Gene Length2049 bp 
Protein Length652 aa 
Translation table 
GC content55% 
IMG OID640419150 
Productpredicted protein 
Protein accessionXP_001419728 
Protein GI145350681 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.437532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACG CGCCGTCGTC GGCATCGTGG CGCGAGAACG ACGCGATCGA GCACGTGGAG 
GAGCGACTCG TCGGCCTCGA GCTCTACGCC AACGCCGCCG CCGGATTCGA TGGTGTGCTG
AAAGAACGCT ACAGCGATTT CATCGTGCGG GAGATCGACA TCGAGAGCGG TGAGGCGGTG
GTTTTGAATG AACTGAGCGC GACGTGCGAT GTAGAGATCG ACGCGCGAGA GCGGAAGAAG
TTTGAGGCGA TGAAGGCGAA AGTGCAAGCC ATGGAACGCG AGCGCGGGCC GGACGACGAA
GCGGAGGACG AGGACGCGGC GAAGGATGAC GTGGAGGGGG ATAAAGAAGA GAGTATTGAG
GAGGCAATGA AAGAATTCGA AGCGCTGTGC GGTGCAGAGG ACGCGCGGCG ACTGAGAGAA
TTTTTAGCCA CTCCCGGGGT GACGCGTCGA AGTCAGAAGA CGCACGATGG AAAGGCGGAA
ACGCCCCAAC CGCTCGTGCT CGAGCCGACG ACGGATAAAG CGAAACGGAC GCAAATTCAT
CAATTTTTCA AGAAACACTT TTTATTACCT ACGGACAACG TCGTGGAGTC GAACGAGGAA
GAGAAGGAAG CTCTCAAGAA CTTGAAAAAG CCGTCTTCGA GCGTGCGCGT GCATGCCGCC
GTCAAGCAAG GCAAGAAGCG AACGCGCGTG GAGGCAATGG ATCATCGAGC GGTGGGAAAC
TTTTGGCCCG AAGGCGTTCC AGAACATGTG CGATTTGCGT TTTGCAAAGA AAACAAAGAG
TCTTACGAGA TGCTCAACGT CATAGCCCGG GCCTTGAAGG TGAACTTCAA GTCTATTGGC
GTCGCCGGAA CGAAGGACAA GCGCGGAGTG ACAACGCAAC ACGTCACCGT GCACAGAGTT
CGGGCGAAAA GGTTGGCCAA GCTGGTACTT TATGGGTGTA AAATTGGTAA CTATACGTAT
GTCGACAGAC AACTTGGTTT CGGAGACCAT TGCGGGAACG AGTTTGAGAT AACGCTCCGA
GGCATCGACC CAGACGTCGT CGGGAACGTG GAGGAGGCAG TGCGCGCACT CAAGTCTTCA
GGGACCATCA ACTACTACGG TTTGCAGAGA TTTGGTAGCG CTGGGGGCAA ACACGCAACG
CATAAAATTG GAATTGAACT TTTACGTGGC GAATGGCAAG CTGCGATCGA CGCTTTGCTG
CTGCCGCGCG AAGGCGAGCG CGACGATGTC ATGAAGGCGC GCTTGAAATG GATGGAAACA
AAGGATCCCA ATGAGACTTT GAAGTTGATG CCACGCTGGT GCGCGGCAGA GCGCTGTGTG
CTCGAGCGCA TGTCAAAAGT TCGCTCCACG GACTTGGTCG GATCGCTGTT GGCCGTGCCG
AAGCAGATTA GACTGATGTA CATTCACGCC TACCAGGCAT ACTTGTTTAA TCGTGTCGTG
TCGGAGCGCA TTCGTAAGTA CGGAGTCAAC ACGGTCGTCG AAGGTGACTT AGTGCTCGAA
GAGGGAAACT GTGCCGGAGA TGAAGGCGAA GACGATATGA ATGGCGATAC TCGGGTGAGT
ATGCCGAGGG TTCGCGTAGT GACAGCCGAG GAAGCCGCTT TGGGTGCGAT TGACTCGTCG
CTCGTGGTGC TGCCGCTTCC TGGAAACTCG ATAACGTACC CAACAAATTT GGGTGATGTT
TACGATCGAT TCGCCGCGGA GGATGGAATC AGTTTGAATA CTACGACGCA TTCGGTTCGT
GAATTCGCAA TCAACTCATT CACCGGTGAC TACCGTAGAT GCTTTCTCAA ACCCACAAAC
GTATCGCACA CCGTCATTTC GTACGCGGAC GCGGCGGCGG ATTTGGTTTT GACCGATCTC
GATCGCATCA ATGGCATCAC CGAACGCACC ATCGAAGACG GCCCTTTACG TGCCGTAACG
GTGAAGTTCA CTCTGCCCCC GTCTTCTTAC GCCACCATGG TTCTTCGGGA GTTGATGAAG
GCGAACACCT CGGTGAGCTC GCACAAGCGC AAGACGCTCG ACGCGCGAGC GGCGGCTTCC
GTAGAGTAG
 
Protein sequence
MSHAPSSASW RENDAIEHVE ERLVGLELYA NAAAGFDGVL KERYSDFIVR EIDIESGEAV 
VLNELSATCD VEIDARERKK FEAMKAKSIE EAMKEFEALC GAEDARRLRE FLATPGVTRR
SQKTHDGKAE TPQPLVLEPT TDKAKRTQIH QFFKKHFLLP TDNVVESNEE EKEALKNLKK
PSSSVRVHAA VKQGKKRTRV EAMDHRAVGN FWPEGVPEHV RFAFCKENKE SYEMLNVIAR
ALKVNFKSIG VAGTKDKRGV TTQHVTVHRV RAKRLAKLVL YGCKIGNYTY VDRQLGFGDH
CGNEFEITLR GIDPDVVGNV EEAVRALKSS GTINYYGLQR FGSAGGKHAT HKIGIELLRG
EWQAAIDALL LPREGERDDV MKARLKWMET KDPNETLKLM PRWCAAERCV LERMSKVRST
DLVGSLLAVP KQIRLMYIHA YQAYLFNRVV SERIRKYGVN TVVEGDLVLE EGNCAGDEGE
DDMNGDTRVS MPRVRVVTAE EAALGAIDSS LVVLPLPGNS ITYPTNLGDV YDRFAAEDGI
SLNTTTHSVR EFAINSFTGD YRRCFLKPTN VSHTVISYAD AAADLVLTDL DRINGITERT
IEDGPLRAVT VKFTLPPSSY ATMVLRELMK ANTSVSSHKR KTLDARAAAS VE