Gene OSTLU_43090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43090 
Symbol 
ID5005483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp140363 
End bp141721 
Gene Length1359 bp 
Protein Length452 aa 
Translation table 
GC content59% 
IMG OID640420904 
Productpredicted protein 
Protein accessionXP_001421216 
Protein GI145353856 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1736] Diphthamide synthase subunit DPH2 
TIGRFAM ID[TIGR00322] diphthamide biosynthesis protein 2-related domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0109024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0260674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG CCGACCCACC GTCGACGTCG TCCGCGGTGG ACGTCCGCGA AACGACCACC 
GCGCCCGTGG TTCGGCGTTT CGTTCGCAAT CAGGTGCGCA CGCGACGCTC GCGCGTTGTC
CCACTCGCCG AAAAGTGTCC CGAACCGAAA CTGACCGCGC CGACGCAGAT TCCGGACGAG
ATCGCCAACG ATCCCGAGCT CGAGACGGCA ATGAAAGTGT TGCCGGTGAA TTACGACTTC
GAGGTGAAGA AGACGGTGTG GCGGTTGCGA CAGGTGGGGG TGAAGACGCT CGCGCTGCAG
TTTCCCGAAG GTTTGTTGCT GTACGCGACG ACGCTGTCGG ATATATTTCA AACGTTCGCG
GGCGTGCGAG ACGTGGTGAT ACTGGGAGAC GTGACGTACG GTGCGTGTTG CGTGGATGAT
TACACCGCGG AGTCGCTGGG GTGCGACTTT TTAGTGCATT ACGGGCACTC GTGCTTGGTG
CCGGTGGACG TGACGCGAAT GAAGTGTCTG TACGTCTTTG TGGATATTTC GTTCGACGTC
GGGCACTTGT GCGCGTCTGT GGAGCACAAT TTTAAACCTG GGTCGAGGTT GATCCTGGCG
GGGACGATTC AGTTCGCGAG TGCGATCCAA GAGACGCGGA CGCGATTGGC GGAAAGGTAT
CCCTCGCTCG CGGTACCGCA AGCCAAGCCG TTGTCGCCGG GAGAGGTTTT GGGATGCACG
GCACCTGTGA TCGAAGACGC GAAGGATAGA GACGCAATAG TGTTCGTCGC CGATGGACGA
TTCCATCTCG AGGCGATTAT GATCGCCAAT CCGACGGTGC CGGCGTTTCG CTATGATCCG
TACCAGCGCA TTCTGACGCG CGAAGAGTAC GCGCACAAGG AGATGCGTTC GGTGCGTAAA
AGCATGGTAT CTCGAGCGAA AGACGCAAAA ACATTCGGCA TAGTTCTCGG CACGCTCGGT
CGTCAGGGAA ATCCGGCGAT TCTGGAACAT TTAATGTCTC TCATGCGCGT CAAGGGTCGA
GAGTACGTCG TGTTTTTAAT CAGTGAGATG AACCCTGCGA AGATGGCCGC GCTCGAGGGC
CTCGACGCGT TCGTCCAAGT GGCTTGTCCG CGATTATCGA TCGATTGGGG GGAAGAATTC
GATCGACCGG TGTTGACGCC GTATGAGGCG GAAGTCGCGC TCGATAACGT CGAGCCGTGG
TGGTTACTGG CTGGCGTCGC CCCGGGCGAG GAATACGCAC CTTACCCGAT GGACTACTAC
GCCAAAGACG GCGGCTCGTG GAGCAGTAGC TACCACAAGC AAACAGGTAA GAATGGCAAG
CCCAAGCGCG CGCCAGTGCG CATCGAAACC GCCGAGTGA
 
Protein sequence
MTDADPPSTS SAVDVRETTT APVVRRFVRN QVRTRRSRVV PLAEKCPEPK LTAPTQIPDE 
IANDPELETA MKVLPVNYDF EVKKTVWRLR QVGVKTLALQ FPEGLLLYAT TLSDIFQTFA
GVRDVVILGD VTYGACCVDD YTAESLGCDF LVHYGHSCLV PVDVTRMKCL YVFVDISFDV
GHLCASVEHN FKPGSRLILA GTIQFASAIQ ETRTRLAERY PSLAVPQAKP LSPGEVLGCT
APVIEDAKDR DAIVFVADGR FHLEAIMIAN PTVPAFRYDP YQRILTREEY AHKEMRSVRK
SMVSRAKDAK TFGIVLGTLG RQGNPAILEH LMSLMRVKGR EYVVFLISEM NPAKMAALEG
LDAFVQVACP RLSIDWGEEF DRPVLTPYEA EVALDNVEPW WLLAGVAPGE EYAPYPMDYY
AKDGGSWSSS YHKQTGKNGK PKRAPVRIET AE