Gene OSTLU_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3884 
Symbol 
ID5006405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009372 
Strand
Start bp118503 
End bp119561 
Gene Length1059 bp 
Protein Length353 aa 
Translation table 
GC content51% 
IMG OID640421826 
Productpredicted protein 
Protein accessionXP_001422305 
Protein GI145356159 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.0376581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGCTGGCT TCATCGGTAT GGGTCGTCTG GGATTGTGTA CCGCCCTCAA GTTTGAGCAA 
GCGGGTTGGG ATGTCTGTGG GTCTGACGTC TTTCCGTCGT ACGTTGAAAG TATCAACGAC
AAGAGTCTTC GAAGCAAAGA GCCCGGAGTC GAAGAGGCTT TGCGAAAGAG CACGCGTTTG
CGAGCGACTC TGAATCTCCT TGATGTCCTA GAGCATGCCG ATATCGTTTT CATTCTCGTC
GCGACTCCAA CCAGCGCGGG CGAAGAGGCC TACGACACGA CAACTCTGAG CAGGGTCTTG
AGTGATATCG CTAAATTAAG GCCGACAAAC AAACACATCG TGATTTGCTG CACAGTGTTG
CCTGGTTACA TCTCGAACAT CGGTAGTTAC CTTATCGAAA GTTGCACTGG ATGTAGTTTG
AGTTACAATC CAGAGTTCAT CGCCCAGGGT GAGATCATGA AGGGCCTCAG TGAACCCGAC
GTGGTGCTCA TAGGGGAAGG AAGCGAAGAA GCCGGTGATA TTCTACAGTT TTTATACGAG
ACCGCGACGT CCAATGAGCC TCGGATCTGT CGCATGTCTC CGCAAAGCGC TGAGATAATG
AAACTAAGCG TAAACTGCTT TGTTACCACG AAGATCAGCT TCGCCAATAT GATTGGTGAC
ATTGCTGACG CGACGCCTGG TGCAGACAAA TTCGACATCC TTAGAGCCGT TGGTCAGGAT
ACACGTGTCG GCCACCGGTG CATCCTTCCA GGTTACGGCT TCGGAGGTCC TTGCTTTCCA
CGTGATAATA GAGCACTCGG AATGTACGCA CGCAAGGTTG GAATCACTCC TTCAATTTGC
GACGCGACGG ATGAATACAA CCGACTTCAC GCGGACGCCA TGGTAAAGGC TCTTTTGGAA
CAAAAACTAG AGCATTATAC CATCAGTGAT GTTGCTTACA AGCCACAGTG TCCGGTGGAT
ATTATTGAAG AGTCGCAACC ACTCGAAGTG GCCAAGAGAC TTGTTCAAGC AGGCAAGCGA
GTCGTTATAC GCGATCGACC CGCCATCATC GAGCTCGTA
 
Protein sequence
RAGFIGMGRL GLCTALKFEQ AGWDVCGSDV FPSYVESIND KSLRSKEPGV EEALRKSTRL 
RATLNLLDVL EHADIVFILV ATPTSAGEEA YDTTTLSRVL SDIAKLRPTN KHIVICCTVL
PGYISNIGSY LIESCTGCSL SYNPEFIAQG EIMKGLSEPD VVLIGEGSEE AGDILQFLYE
TATSNEPRIC RMSPQSAEIM KLSVNCFVTT KISFANMIGD IADATPGADK FDILRAVGQD
TRVGHRCILP GYGFGGPCFP RDNRALGMYA RKVGITPSIC DATDEYNRLH ADAMVKALLE
QKLEHYTISD VAYKPQCPVD IIEESQPLEV AKRLVQAGKR VVIRDRPAII ELV