Gene OSTLU_16034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16034 
Symbol 
ID5003059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp162408 
End bp163940 
Gene Length1533 bp 
Protein Length510 aa 
Translation table 
GC content58% 
IMG OID640418480 
Productpredicted protein 
Protein accessionXP_001418857 
Protein GI145348852 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0673665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.542124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCGG AGGAAGTGCG AGAGAGAATC GCCAAAGCGG CGAAGGCGCA GAGAGAGTGG 
GCGAAAAGCT CGTTCGCGAC GCGTCGGAAG CTGCTGCGGG TGATACAGCG GTTCATATTG
GAAGAGCAGG ATACGATTTG TCGAGTGAGC GCGCGAGACA GCGGTAAGCC GTTGGTGGAC
GCGGCGTTCG GGGAGGTTCT GGTGACGCTG GAGAAGATTC GATGGCTGTG CAACGAGGGC
GAGCGGTGGT TGAAACCCGA GAAGCGATCG ACCGGGGCCA TGATGTTTTA TAAGAAGGCG
CGGGTGGAGT ATCATCCGGT GGGCGTCATG GGCGCAATCG TGCCGTGGAA TTATCCGTTT
CACAACGTTT TCAATCCTTT GGTGGCGAAC TTGTTCGCGG GGAACGCGCT CGTCGTCAAG
GTGAGTGAGT ACGCGAGCTG GAGCTCGCAG TACTACGGTC GCGTCATCGA TGCCGCGCTT
GATGCCGTGG GAGCGCCGCG CGACTTGGTG CAAATCATCA CCGGTTACGG CGAAGCGGGC
AGCGCGTTGG TCACTGGCGG TGTGCAAAAG GTAGTCTTCG TCGGATCCAC TGGCATCGGA
CGTAAGGTTA TGGAGGCAGC GGCGAAGACT TTGACTCCGG TCGTCCTCGA ACTCGGTGGT
AAAGATCCGT TCATCGTCTG CGCAGACGCG GATCTCAAGC AGTGCGTTCC CATGGCGCTG
CGCGGCGCGT TTCAATCGTG CGGTCAAAAC TGCGCGGGAG CCGAACGATT CTACGTGCAT
GAGAAGATTC ACGACAAGTT TTTAGCCAAA GTACTGGAGT CGGCGAGGAA GTTACGTCAA
GGATGGGCGC TGAGCTCGTC CGTGGATTGT GGGGCGATGT GCATGCCAAA GCAAGCGCAG
TACGTGCAGT CTCTCATCGA CGACGCCGTC GCACGTGGTG CGACCGTCCA CGTCGGTGGT
AAGATCGAAC TGGGCGCCCA GGGTGGTCAG TTCTACCCTC CGACAGTGAT CTCTGGCATC
ACGCACGATA TGCGAATCGC TCGCGAAGAG GTCTTTGGTC CCGTGCTCGC CATCGTCAAG
ACAAAGAGCG ACGAGGAATC CATCTCGCTC GCGAACGACT GCGACTTTGG TCTCGGATCA
AATGTTTTCA CGCGCTCAAC GAGACGCGCA GAAAAGCTTG GCTCACAGCT GGAGGCCGGT
ATGACTTCCA TCAATGACTT TTGCTCGACG TACATGGCGC AGTCCCTTCC CTTTGGCGGC
GTCAAGGAAT CCGGCTTCGA CCGCTTCGCC GGTATTGAAG GACTCCGCGG TTGCTGCGTC
CCGAAATCCG TCGTCGTCGA TCGATTTCCA TGGCTCATGA AGACCAATAT CCCTCCTCCG
TTGTGCTACC CCGTGGCGGA TAACGCGTTT GCCTTTTGCA AGGCGCTGTC GCGCATGTTC
TTCGGCTTGA ACGTCGCGCA ACAGTTCGGT GGATTGTTGT CGCTCGCCAA GTGCTTCCTC
ATGCCATCGA ATTCTTACAC CAAGTACGAT TAA
 
Protein sequence
MRAEEVRERI AKAAKAQREW AKSSFATRRK LLRVIQRFIL EEQDTICRVS ARDSGKPLVD 
AAFGEVLVTL EKIRWLCNEG ERWLKPEKRS TGAMMFYKKA RVEYHPVGVM GAIVPWNYPF
HNVFNPLVAN LFAGNALVVK VSEYASWSSQ YYGRVIDAAL DAVGAPRDLV QIITGYGEAG
SALVTGGVQK VVFVGSTGIG RKVMEAAAKT LTPVVLELGG KDPFIVCADA DLKQCVPMAL
RGAFQSCGQN CAGAERFYVH EKIHDKFLAK VLESARKLRQ GWALSSSVDC GAMCMPKQAQ
YVQSLIDDAV ARGATVHVGG KIELGAQGGQ FYPPTVISGI THDMRIAREE VFGPVLAIVK
TKSDEESISL ANDCDFGLGS NVFTRSTRRA EKLGSQLEAG MTSINDFCST YMAQSLPFGG
VKESGFDRFA GIEGLRGCCV PKSVVVDRFP WLMKTNIPPP LCYPVADNAF AFCKALSRMF
FGLNVAQQFG GLLSLAKCFL MPSNSYTKYD