Gene OSTLU_42872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42872 
Symbol 
ID5003438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp62629 
End bp64140 
Gene Length1512 bp 
Protein Length503 aa 
Translation table 
GC content62% 
IMG OID640418859 
Productpredicted protein 
Protein accessionXP_001419272 
Protein GI145349712 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.701812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACCG CCGGTGAAGA CGACGACGCG CGTCGAAGCG CGCTCGCCGC GGCGCTGAAA 
CTGCGAAACG CAAATTTGCT TCGTTTCGCG GGCGCGAGCG CGACGGACGC GTCGCGAGGC
GCGTCGCGCG CGAGCGAAAC GTTCGAGGTG AAGAACCCGG CTACAAACGC GACGCTCGCG
ACGGTGCTCT CCACGCCGCG CGCCGCGATT TCCGACGTCT TGAGGCGAAG CGAAGACGCA
CAGAAAGTGT GGGCGACGGA ATTCAGCGCG CACGCGCGAG GAAAGGTGAT CAGACGATGG
TTTGAGCTCG TCGAGGCGAA CGCCGAGGAC TTGGCGAGAA TCATGACGGC GGAACAAGGC
AAACCTTTGA TAGAGTCCCG GGGAGAGGTG GCGTACGCGG CATCGTTTTT AGAGTGGTTC
GCGGAAGAGG GGAAGCGCGT GTACGGTGAT GTCGTGCCTT CGTCGTCGAC GGGGACGAGA
ATCATGGTTG TGAAGCAACC AGTCGGCGTG ACGGCGGCGA TTACACCGTG GAATTTTCCT
TTGGCGATGA TCACGAGGAA AGCTGGTGCG GCGCTCGCGG CGGGGTGTTC GATGGTCGTC
AAGCCGAGTG AAGAAACGCC GCTGAGCGCG TTTGCGCTCG GGGTGCTCGC GGAGCAAGCT
GGGTGCCCGG ATGGTGTTTT ACAATTCATC GTGGGCGATC CGAGCGCGAT AGGCGCGGCG
CTGTGCGAGT CTCCCGTCGT TCGAAAAATT ACCTTCACCG GAAGCACGCG CGTGGGAAAG
TTATTGATGA AGCAGAGTGC GGACACCGTG AAGCGCGTAA GCATGGAGTT GGGTGGTAAC
GCGCCGTTTG TGGTGTGCGC CGACGCCGAC GTGGACGCCG CCGTTCAGGG CGCGATGGCG
AGCAAGTTTC GTAACGCGGG TCAAACGTGC GTATGCGCGC AACGTTTCAT CGTGCACGCG
TCCGTGGAGG CTGAATTCGT GCAAAAGCTC GCGGACGCTG CGAGCGCGCT CGTTATGGGC
GATGGCTTGG AAAATGAAGA CGCCACGCAA GGCCCGTTGA TCAACGCCGC GCACGCCGAA
AAAGTCGATT CGCACGTTCG CGACGCGATG AGCAAAGGCG CGGTGTGTCA CACCGGTGGC
AAGCGCGCGC ACGGTAGCTT TTATGAACCG ACCGTGTTGT CCAAGTGCAC GGAAGACATG
CTGGTGATGC GCGAAGAAAC GTTCGGACCT GTCGCCGCGG TGACGACGTT CGTCGACGAC
GCCGAAGCCA TTCGCATCGC CAACGCCACC ACCGCCGGTT TAGCGTCGTA CGTGTACACC
TCTGACGTCA AGCGTACGTT TTACTTTAGC GAAAAGCTTG ACTTTGGTAT CGTGGGCGTG
AACACGGGCG CCATATCCAC CGCCCAAGCT CCGTTCGGCG GGACGAAGGA GAGCGGGATC
GGTCGCGAGG GCGGCAAGGA CGGCGTTCAC GAGTACGTCG AGCAGAAATA CGTCTGCGTC
GGCGGCCTTT AG
 
Protein sequence
MSTAGEDDDA RRSALAAALK LRNANLLRFA GASATDASRG ASRASETFEV KNPATNATLA 
TVLSTPRAAI SDVLRRSEDA QKVWATEFSA HARGKVIRRW FELVEANAED LARIMTAEQG
KPLIESRGEV AYAASFLEWF AEEGKRVYGD VVPSSSTGTR IMVVKQPVGV TAAITPWNFP
LAMITRKAGA ALAAGCSMVV KPSEETPLSA FALGVLAEQA GCPDGVLQFI VGDPSAIGAA
LCESPVVRKI TFTGSTRVGK LLMKQSADTV KRVSMELGGN APFVVCADAD VDAAVQGAMA
SKFRNAGQTC VCAQRFIVHA SVEAEFVQKL ADAASALVMG DGLENEDATQ GPLINAAHAE
KVDSHVRDAM SKGAVCHTGG KRAHGSFYEP TVLSKCTEDM LVMREETFGP VAAVTTFVDD
AEAIRIANAT TAGLASYVYT SDVKRTFYFS EKLDFGIVGV NTGAISTAQA PFGGTKESGI
GREGGKDGVH EYVEQKYVCV GGL