Gene OSTLU_32667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32667 
Symbol 
ID5003026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp449498 
End bp451051 
Gene Length1554 bp 
Protein Length517 aa 
Translation table 
GC content66% 
IMG OID640418447 
Productpredicted protein 
Protein accessionXP_001418709 
Protein GI145348549 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1736] Diphthamide synthase subunit DPH2 
TIGRFAM ID[TIGR00272] diphthamide biosynthesis protein 2
[TIGR00322] diphthamide biosynthesis protein 2-related domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.563535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.65168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACG ACGACGCGAC GTTCGACGTC GCGTACGACG TCGCGCGAAC CGCGTCGTGG 
ATCCGCGACG GCGCGTTCTC GCGCGTCGCC CTGCAGTTAC CCGACGACAA GCTTCCCGAC
GCCGCCAGGC TCGCGCGCGC GCTTCAAACC GCGCTCGCGC GCGACGACGA CGCCGCGCGA
GCGTCGCGCG AGGTCTTCGT CCTCGCGGAC ACGACGTTCG GGTCGTGCTG CGTCGACGAA
GTCGCGGCGG CGCACCGGGA CGCCGACGCC ATCGTACACT TCGGACGCGC GTGCTGCTCA
CCGACGAGCC GAACGCCGGC GCGGTTGGTG TTCGATCGTA AAGAAATCGA TTGCGCTCGA
TGCGCGCGCG CGCTCGAGGC GCACGCCGCG GCGTCGGCGA GGCGCGGGGC GCGCGCGGTG
GTGGCGCTGT GCGACCAGGA ATACTTTTGG GCGCTCGACG AGTTGAAGCG AGCTTCGACG
AGCTCGACCG CGAACGTTCA GGGCGCGGAG ATCTTCATCG CGGACGCGGT TGAGGTCGAA
GTCGATCCGA GCCGCGATTC GAGAGTGCGA GGGAGGGATG AGGACGCGTC GACGACGCGA
GTCGGCGCGT CGCGATTTCG ACCGCCGAAT GGGACGACGA ACGAGGATTG CGCGTACGTG
TGGATCGGTG CGACGGGGCC GGCGATGACG CATGCGATGC TGGTTTTGGG CGATTGCGCG
GCGAAGTTCG GTGGGATGGC GCAGTACGAT CCATCGGTGG ATGGCGACGC GGTGCGCGTC
GAGGCGGACG GCGCCGGCGA AGCCGCGAGG GCGTTGAAGC GAAGGAGATT CTTGATAGCG
AAGGCCAAGG AAGCCAGGGT GGTTGGCATC ATCGCCGGTA CTTTAGGTGT CGCGGGTTAC
CGCGAAATGA TCGAAAACTT GCGCAAGTTG ATTGCGAATA GTGGGCGTAA GAGTTACACC
GTCGTCGCGG GAAAGCCGAA TCCGCAAAAG TTGGCCAACT TCCCCGAGAT TGAGGTCTTC
ATAATGGTGA GCTGCGAGTT GACGGCGTTG ATGGACGGGC GCGATTACAT GCAACCGATC
ATAACCCCGT ACGAGGCCAC GATCGCGTTC ACGCCGGGAA AGATGTGGAT GGGCGAAGTC
AAGCTCGATT TCGCGTCGGT GCCGACGTTC GAAGACGTCG TCGCGAATGG CGACGACGAC
GACGACGTTC AACCAGAGTT CAGCCTCGTT TCTGGCACGT ACATATCACC GGTGAACGCC
TCGTCGTCCG CGCACGACGC CGACGACATC GACGCCCTCA CCGGCACCGA GCTCGCTCGC
CGCGCCGAGG GCGCGCTCTC CCTCCGCGCC TCCGGCGTCT CCGACGCCGT CGTCACCTCC
GGCGCCGAAT ACCTCATCTC CAAGCGCACG TACACCGGTC TCGAGCCCGG CCCGAAGCGC
GACGACGAAA CGGGCGCCAT CGCGGACGCC CCGCTCGAAG CCGCGCGCGG TCTCTCCGGC
CGCGCCAAAT CGTACGCCGA CGAGTCGCCC GCGTCCGCCG CCACCGACCC CTAG
 
Protein sequence
MRDDDATFDV AYDVARTASW IRDGAFSRVA LQLPDDKLPD AARLARALQT ALARDDDAAR 
ASREVFVLAD TTFGSCCVDE VAAAHRDADA IVHFGRACCS PTSRTPARLV FDRKEIDCAR
CARALEAHAA ASARRGARAV VALCDQEYFW ALDELKRAST SSTANVQGAE IFIADAVEVE
VDPSRDSRVR GRDEDASTTR VGASRFRPPN GTTNEDCAYV WIGATGPAMT HAMLVLGDCA
AKFGGMAQYD PSVDGDAVRV EADGAGEAAR ALKRRRFLIA KAKEARVVGI IAGTLGVAGY
REMIENLRKL IANSGRKSYT VVAGKPNPQK LANFPEIEVF IMVSCELTAL MDGRDYMQPI
ITPYEATIAF TPGKMWMGEV KLDFASVPTF EDVVANGDDD DDVQPEFSLV SGTYISPVNA
SSSAHDADDI DALTGTELAR RAEGALSLRA SGVSDAVVTS GAEYLISKRT YTGLEPGPKR
DDETGAIADA PLEAARGLSG RAKSYADESP ASAATDP