Gene OSTLU_37521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37521 
Symbol 
ID5006092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp377040 
End bp378044 
Gene Length1005 bp 
Protein Length334 aa 
Translation table 
GC content67% 
IMG OID640421513 
Productpredicted protein 
Protein accessionXP_001421923 
Protein GI145355344 
COG category[R] General function prediction only 
COG ID[COG0656] Aldo/keto reductases, related to diketogulonate reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.414085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00268426 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGCGCG CGCTCAGCGC GCGACGCACG AAGGCGCGCG CGGTGACGCC GCGCTGCGAC 
GCGTCGCGCG TCGCGCGACT CGCGACGGGA CGCGTCGTGT CGCGCGTCGG ATTCGGCACC
GCGGCGTGGG GCGACGAGAC GCGCGGGTTC GGGACGCGCT ACCGCGAGCG CGACCTCGCG
GCGGCGCTAT CGCGCGCGCT CGAGCGAGGC GTCACGTTCG TCGACACCGC GGAGACGTAC
GGGGCAAGCG CGCGGGCGTT CGAACAGGGC GCGGAGGAGA TGGTGAGACG CGCGAGGACG
ACGGCGAGGC GAGACGACGC GCGAGACGAC GCGTTCGTCG GAACGAAAGT GTTGACGGTG
CCGTGGACGA ACGTGAGCGC GGGAGGGGAC GTGCGATCGA CGACGAAGAG CTTGGTGGAC
GCGATCGAGG CGTCGGTGGG GAGGAACGGG GGGGAGGCGT ACGATTTGGT GTCGATTCAT
TTTCCGTTTC CGACGTGGAC GCAGAGCGCG CTGTGCGACG CGCTCGCGGA GGCGACGGAG
CGAGGGCTGT GTCGGGCGGT GGGGGTGAGT AATTACGACG TAAGGCAGAT GACGGAGGCG
CATGGGTTGT TGGCGAAGCG TGGGATCGCG TTGGCGACGA ATCAGGTGAA ATATTCCGTG
CTCGATCGAG GCGCCGAAAA GAGCGGGGTG CTCGCCGCGG CGCGGGATTT AGACGTCGCC
GTCGTGGCGT ATTCGCCCTT GAGCGGTGGG GCGCTGCGGA CGAGCGCGGA CCCGGAGATT
CGCACGTTGG ACAAGTTGCT CGAGTTCATC GGCGCCGTCA ACGGTGGTTG GACGTCGGCG
CAGGTGGCGT TGAACTATCT CGTCCGCAAG GGCGCGATTC CGATTCCGAG TTGTACGAGC
GTCGCGCGCG CCGACGCCAT CGCGGACGTC CTCGAATTCG AGCTCGGCGT CGAAGACATC
GAGACTATCG ATGAAAAAAT GGATTACATC GAACGCAAGT CGTGA
 
Protein sequence
MRRALSARRT KARAVTPRCD ASRVARLATG RVVSRVGFGT AAWGDETRGF GTRYRERDLA 
AALSRALERG VTFVDTAETY GASARAFEQG AEEMVRRART TARRDDARDD AFVGTKVLTV
PWTNVSAGGD VRSTTKSLVD AIEASVGRNG GEAYDLVSIH FPFPTWTQSA LCDALAEATE
RGLCRAVGVS NYDVRQMTEA HGLLAKRGIA LATNQVKYSV LDRGAEKSGV LAAARDLDVA
VVAYSPLSGG ALRTSADPEI RTLDKLLEFI GAVNGGWTSA QVALNYLVRK GAIPIPSCTS
VARADAIADV LEFELGVEDI ETIDEKMDYI ERKS