Gene OSTLU_51237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51237 
Symbol 
ID5005059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp20698 
End bp21846 
Gene Length1149 bp 
Protein Length360 aa 
Translation table 
GC content68% 
IMG OID640420480 
Productpredicted protein 
Protein accessionXP_001421029 
Protein GI145353457 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.587591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCGCGTCGAT GCGCCCTCGC GTCGCGTCGC GATCGACGCG TCGCGATCGA CGCGCGTCGA 
GCGCGCGCGC GTCGCCGACG GCGCCTAACG TGCGGACGAT CCTCGGGACG ATGACGTTCG
GGTGGCGACA CGCGAGCGAA GCGTGCGACG ACGACGCGAG CGCGCGAATG CTGGACGCGT
TCGCGCGCGC CGGGCACGAC GAGATCGACA CCGCGATCGC GTACGCGAAC GGGGAGACGG
AGCGAATCCT CGGACGCGTC GACGCGGGAC GGCGCGCGCG CGTCGACACG AAGGCGAATC
CGTGGCCGGG CGGGACGATG ACGCCGAGCG CGGGACGAGG GGGGTTAGGG GCGAACGAAC
TGCGGGCGCA GGTGCGACGG AGCGTGGAAT CGCTGCGAGG GACGAAGATT CGAACGCTGT
ATTTACACGC GCCGGACGCG GACACGACGC TGGAGGAGGC GCTGCGAGAG TGCGAACGGC
TGCGCGTCGA GGAGCGCGCG TTCGAAGACG TGGGACTGTC GAATTTTTCG GCGTGGGAGA
CGGTCAAGGC GCACGAGCTG TGCGAAAAGT ACGGGTGGAA GAGACCGACG ATTTATCAGG
GGATGTACAA CGCGCTGACG CGAAACGTCG AGGCGGAGTT GGTGCCGGCG CTGCGGGCGA
CGAAGATGCG CTTCGCGGCG TACAATCCCC TCTGCGGAGG GTTGTTGACG GGGAAATACA
AGGGCAACAC CGACGTCGGC GCGGTGTCCG GCGGGCGATT CGCCGGGAAC GACATGTATC
AGTCTCGATT TTGGTTGCCG TGCTATCACG AAGCCGTGGC CGAGGTGGTG GAGGCGTGCG
AGAAGCGCGG CGTCGCGCCC GCGGACGCCT CGCTGCGATG GCTCTACCGG CACTCCGCGT
TGGACGGCGC CGAGGGCGAC GCCGTCATCG TCGGCGCGTC GAGCGCGGCG CAGCTCGAGG
CGAATTTAGC GAGCGCCGCG CGCGAAGAGC CGCTGCACCG GGACATTCTC GACGCCTTCG
ACGCGGGTTG GGAAAAGTGT CGCGCGTCCG CCGCGCCGTA CTTTCGCGGC CACTGTAAAA
TCGCGCGTTG AAGTCATCAT CAAGTCACTC ATCGCAGTCA CCATCGTCGC GCCGTACTTT
CGCGGCCAC
 
Protein sequence
MRPRVASRST RRDRRASSAR ASPTAPNVRT ILGTMTFGWR HASEACDDDA SARMLDAFAR 
AGHDEIDTAI AYANGETERI LGRVDAGRRA RVDTKANPWP GGTMTPSAGR GGLGANELRA
QVRRSVESLR GTKIRTLYLH APDADTTLEE ALRECERLRV EERAFEDVGL SNFSAWETVK
AHELCEKYGW KRPTIYQGMY NALTRNVEAE LVPALRATKM RFAAYNPLCG GLLTGKYKGN
TDVGAVSGGR FAGNDMYQSR FWLPCYHEAV AEVVEACEKR GVAPADASLR WLYRHSALDG
AEGDAVIVGA SSAAQLEANL ASAAREEPLH RDILDAFDAG WEKCRASAAP YFRGHCKIAR