Gene OSTLU_29766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29766 
Symbol 
ID5006997 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp247778 
End bp249457 
Gene Length1680 bp 
Protein Length471 aa 
Translation table 
GC content58% 
IMG OID640422418 
Productpredicted protein 
Protein accessionXP_001422850 
Protein GI145357284 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0025927 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00128965 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAGGC CGATCACGAC GAGTGCGTGA AGAACGACGC GAGCGACGCG ACGCGACGCG 
ACGCGAGACG ACGCGACGCG CGAGACGACG CGACGAGGGC GATCGCGACG GTCGTCGGAA
AGATGTCCGG AAACCATCCG CGAGACGCGG CGACGACCGG AGACGCGACG ACGGCGACGG
GAGAGCGTTG GGACGGGCGC GAAAGATGGG AGAGCGCGCG ACGGTGCGAA TGGAGGCGCG
CGAGGACGAA GGACTGACGA CGCGCGCGGT TTCGGGGACG ACGCAGTATG TTGCATCGGC
GCGGGATACG TGGGCGGGCC GACGATGGCG ATGATCGCGA AGAAGTGCCC GCAGATCTCG
GTCACGGTGG TGGACATCTC GCAGCCGAGG ATCGACGCGT GGAACTCGAG CGAGCTGCCG
ATTTATGAGC CCGGGTTGGA TGAGATCGTG TTCGAGTGCA GAGGGAAGAA CTTGTTCTTT
TCGACGGATG TCGAAGGGGC TATTCGAGAC TGCGAGATGA TTTTCGTCTC CGTGAACACG
CCGACGAAGA AGACTGGTTT AGGAAAAGGT AAGGCGGCGG ATTTGACGTA CTGGGAGTTG
GCGGCGCGCA CCATCGCGGC GTGCTCCGAG AGTGATAAGA TCATCGTTGA GAAGTCTACG
GTGCCCGTGC GAACGGCGGA GGCCATTGAA AAGGTGCTCC AGCGCAACTG CCCGCACGAC
GGCGTGCGAT TCGACATCTT GTCCAACCCA GAATTCTTGG CCGAAGGTAC GGCCATTGTA
GACTTGGATG CGCCCGATCG CGTGTTGATC GGGGGTAAGA TTGAAAACGC CAAAGGTCAA
GCGGCGGTGG ACGCGCTCGT AGGCGTGTAC TCCAACTGGG TACCGAAAGA GAACATTCTG
ACCGCTAACT TGTGGTCTGC CGAGCTCTCA AAGCTCGCTG CGAACGCATT CTTGGCGCAG
CGCATTTCTT CCATCAACGC CATGAGCGCG CTTTGTGAAG CGACCGGCGC GGATGTCCAG
CAAGTTTCGC ACGCCATCGG CACCGACACG CGTATCGGCT CCAAATTTTT GAACGCCTCG
GTCGGTTTCG GTGGCTCGTG CTTCCAAAAG GATATCTTAA ACTTGGCATA CATTTGTGAG
TGCCACGGCT TGCCGGAAGT GGCGGAGTAC TGGCACAGCG TTGTCGGCAT GAACGACTAC
CAAAAGAGCC GCTTCGTCAA GCGCATGATT TCCGCCATGT TCAACACGAT CAGCGGCAAG
AAGATTTCAA TGCTCGGTTA CGCTTTTAAG AAAGATACGG GTGATACTCG TGAGTCTCCG
GCTATCGACG TCGGTCACGG CCTCATCGAA GACGGTGCGA AGCTCAATAT TTATGACCCG
AAGGTTGCTG CGGCGCAAAT CGCGTTGGAC ATGGGCGAGG AAGCGATGAA ATCCATCACG
TGCTGCAAGA CTCACACTGA GTCTTTGACT GGTGCGCACG CCGTGTGCAT TATGACGGAA
TGGGACGAGT TCAAGACGTA CGACTGGGAA GCGATCTATG GCGTCATGCA AAAGCCGGCC
TTTGTGTTCG ACGGCCGCCT CATCTTGGAC CACCAGAAGT TGAAGGACAT TGGTTTCATT
GTGTATGCGC TTGGGAAGCC TTTAGATCCT TTCCTTTGCT CCGCCGAAGG CGCACCGTAA
 
Protein sequence
MTRPITTICC IGAGYVGGPT MAMIAKKCPQ ISVTVVDISQ PRIDAWNSSE LPIYEPGLDE 
IVFECRGKNL FFSTDVEGAI RDCEMIFVSV NTPTKKTGLG KGKAADLTYW ELAARTIAAC
SESDKIIVEK STVPVRTAEA IEKVLQRNCP HDGVRFDILS NPEFLAEGTA IVDLDAPDRV
LIGGKIENAK GQAAVDALVG VYSNWVPKEN ILTANLWSAE LSKLAANAFL AQRISSINAM
SALCEATGAD VQQVSHAIGT DTRIGSKFLN ASVGFGGSCF QKDILNLAYI CECHGLPEVA
EYWHSVVGMN DYQKSRFVKR MISAMFNTIS GKKISMLGYA FKKDTGDTRE SPAIDVGHGL
IEDGAKLNIY DPKVAAAQIA LDMGEEAMKS ITCCKTHTES LTGAHAVCIM TEWDEFKTYD
WEAIYGVMQK PAFVFDGRLI LDHQKLKDIG FIVYALGKPL DPFLCSAEGA P