Gene OSTLU_50750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50750 
Symbol 
ID5004022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp371416 
End bp373346 
Gene Length1931 bp 
Protein Length523 aa 
Translation table 
GC content61% 
IMG OID640419443 
Productpredicted protein 
Protein accessionXP_001420154 
Protein GI145351589 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0113332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGC CGCCGCCGGG ACGATGCGGA CGGGTGGAAC AATTTATTAA TAACGCGTGG 
GTGCGCGCGC CGCAGGCTGC GGCGCAGGCG CAGGCGCTCG GCGGTGCCGC GAGGACGCTG
CCGGTGGTGA ACCCGCACGA CAATAAAAAC ATCGGTGCCG TCGGTGCGGG CGATCGCGCG
ATGATCGACG ACGCGGTGCG GGCGGCGCGC AAGGGATATA AGGTGTGGAG CGCCACGCCG
GGGCGCGAAA GGTCGCGGGT GCTGCGGGGG ATTGCGAGAG GGATCGAACG GCGGAAACGC
GCGCTCGCCG AGCTGGAGAC GCTGGACGCG GGGAAGCCGA TCGAGGAGAG CGAGTGGGAT
ATCGATGATG TCAGCGCGTG CTTTGATTAT TACGCCGATC GGTGCGATGA AGTGTTCGGG
GATAAGGCGT ACGCGGAGGA GGATGTGAAG TTACCCATGG ACGAGTTCGC GGGACGGTTG
CGACGCGAGG CGCTGGGGGT GATCGGTTTG ATCACGCCTT GGAACTATCC GTTGCTCATG
GCGACGTGGA AAGTCGCGCC CGCGCTCGCG AGTGGATGTG CGGTGGTGCT GAAACCGAGT
GAGCTCGCGA GCTTGACGTG TCAGGTGTTG GGGGACGTGT GCGTCGAAGC AGGGTTGCCG
CCGGGGGCGT TCAACGTAGT CACGGGGCGA GGCGACGAGG CTGGCGCTGC GCTGTGCGCA
CACAAGGGTG TGGATAAAAT ATCGTTCACT GGCTCGTTGC AAACCGGCCG CATCATCATG
AGCGCGTGCG CGAAAGATGT CAAGCCCGTA TCGTTGGAAT TGGGCGGCAA GAGCGCGTTG
GTCATTTTCG ACGACTGCGA TTTGGAAAAG GCGGTGGAGT GGGCTCTGTT TGGGTGTTTC
TGGACGAATG GTCAGATCTG CAGCGCGACG TCGCGCGTCT TCATTCACGA GCGCATTCGA
GAGAAGTTCT TAGCGCGTCT CAAGGAGGCT GCGGAAGCCA TTCCGTACGG CAACCCACTC
GTCAAGGGGT GTCGCCTCGG CCCGCTCGTG AGCGAAGGGC AGTACAAAAA GGTGATGAAA
ATGGTCGAAC GCGCGAAGCG CAAGGGATAC ACGCTTTTGA CGGGCGGCAA GAAACCGAGC
GATCCGGACT GTCGAGAGGG GTTCTATCTC GAACCCACGG TATTCGTCGA CGTCCCTATG
GATGCGGAAG TGTGGCGAGA AGAAATTTTT GGCCCGGTGA TGTGCGTCAA GACGTTCGCC
TCTGAAACGG AAGTCGTGGC GATGGCGAAC GATTCCGATT ACGCCTTAGC CGCGGCGGTG
ATCACAGACG ACTTAGCGCG ACGCGAGCGT ATGACTGCCG CGTTTGACAC CGGGATCGTG
TGGGTGAATT GCTCGCAACC CTGCTTCGCT CAGCTCCCGT GGGGCGGTCG CAAGCGAAGC
GGATTCGGTC GCGACCTCGG CGCGAACGGC ATGGACAAGT ACTTGCACCA AAAGCAAGTC
GTCACCTACG TGTCCGAAGA CCCGTTCGCG TGGTACCCCA TGTTCGACGC CAAGCCGAAC
AGCAAGCTGT AGCGCGTGTA CTATCGTCGT CGATGAGTGA ATGAATAAAT ATGTATTTCA
AGATTTCAAC GCGAGCGTCG AGCGCTCCTT CGCCCTCCGG CGACTCAACT CGTTCGTCTC
GACGCCTCAC GGCGGCAACG CCGCGGTAAT TTGCACCTCA ACCAACAATT CAGGTCGCGC
CAACTCGCTC TGCACGCACG CCCGCGTGGG TTTATTATCC GGATCGATCC ACGCGTTGTA
CACGGCGTTG AACTCGGGCG CGTCTTTGAT GTCCCGCAGC CAGCACATCG AGGTCAATAT
TCGCGACTTA TCCGTCCCCG CCATCGCGAG CAGTTTGTCA AACTTTTCCA GCGTTTCGAT
CGTTTGTTCT C
 
Protein sequence
MAAPPPGRCG RVEQFINNAW VRAPQAAAQA QALGGAARTL PVVNPHDNKN IGAVGAGDRA 
MIDDAVRAAR KGYKVWSATP GRERSRVLRG IARGIERRKR ALAELETLDA GKPIEESEWD
IDDVSACFDY YADRCDEVFG DKAYAEEDVK LPMDEFAGRL RREALGVIGL ITPWNYPLLM
ATWKVAPALA SGCAVVLKPS ELASLTCQVL GDVCVEAGLP PGAFNVVTGR GDEAGAALCA
HKGVDKISFT GSLQTGRIIM SACAKDVKPV SLELGGKSAL VIFDDCDLEK AVEWALFGCF
WTNGQICSAT SRVFIHERIR EKFLARLKEA AEAIPYGNPL VKGCRLGPLV SEGQYKKVMK
MVERAKRKGY TLLTGGKKPS DPDCREGFYL EPTVFVDVPM DAEVWREEIF GPVMCVKTFA
SETEVVAMAN DSDYALAAAV ITDDLARRER MTAAFDTGIV WVNCSQPCFA QLPWGGRKRS
GFGRDLGANG MDKYLHQKQV VTYVSEDPFA WYPMFDAKPN SKL