Gene OSTLU_16870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16870 
Symbol 
ID5003834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp524214 
End bp526646 
Gene Length2433 bp 
Protein Length810 aa 
Translation table 
GC content59% 
IMG OID640419255 
Productpredicted protein 
Protein accessionXP_001419832 
Protein GI145350899 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.254094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0243125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTCG CGGTGAAGGG TGAACCAAAG GTGACGGATT GCTTGATAAA CGCGACGGAT 
ATGGCGGCGA AACGGGACGA TACGTACGCG GCGGAGTTGA CGAAGCTCGA GGATAAGCAC
GTGTCCACGG CGAAGGCTTT GTTGACGGAT AAAGCCGAGT ACGACGCGTA CATCGCGGCG
TTCAACGAGG AATTGAACGA TTTACGAGCG ATGTTTAAGG CTATTTACAT CGCCGGTTGC
TCGACCGATG CGTTCGGGGA TTTCGTCGTC GGACACGGAG AGCTCTGGAC GGCGCGCTTG
TGCGCGGCGA CGATTCGCTG CAAGGGGGGC AAGGCGGTCT GGATCGACGC GCGAGACATT
CTCGTCGTCA CAGAATCCGA GGACGGCGGC GTGGACGTCG ACTACAGCTT GTCCAACGCG
AACTTGGATA AGTGGTACGA CGAGCATATG CAAGAGGGCG CCGTGGTCAT GGTGACTGGT
TTCATCGCGA GGACGCCGGA GGGCGTGCCG ACGACGCTCA AACGTAACGG TTCCGATTAC
TCCGCCACAA TTTTCGGTGC GCTCACGCAA GCGAGAAATA TCACCATCTG GACCGACGTC
GATGGCGTAT TCAGCGCCGA TCCTCGCCGC GTGAAGGGGG CCAAGTGCCT GAACTCGATT
TCCTACAACG AAGCGTGGGA ACTCGCGTAC TTTGGCGCCA ACGTTCTCCA CCCGCGCACG
ACTTTACCGG CGATGAAGTA CAACATCCCA GTCACGTTGC GCAACTACTT CAACCAAGCC
GCGCCAGGGA CGTCCATCGG CATGGCGTGC CCGCTGCCCG CCGGGGATGA GGGCAACGTC
GGCAAGTTTG AGACTCGCGA CATGAGCGGC GAGCTCGTCA AGGGTATCGC CACCATCGAC
GACGTGTGCC TCATCAACGT CGAGGGTACG GGCATGGTTG GTGTGCCGGG CACGGCCAAC
ACTGTGTTCA AAGCCGTCAA GGAAGCTGGC TGCAACGTGG TTATGATTTC TCAAGCTTCC
TCGGAGCACT CCATTTGCTT TGCGGTTCGC TCGCACGAGG CGGACGCGGC AGTCGCGGCG
CTCAACAAGA CGTTCGAGAA AGCCATCGCC GCCGGTCGCA TCTCTCGAAT CCTCCCTTTG
AAGGATTGTT CCATCTTGGC CATCGTCGGG CAGAACATGT GCCAAACGCC GGGCGTGTCG
GCGATGTTGT TTGAAGCGCT GGCGCAGAGC GCCGTCAACG TCATCGCCAT CGCGCAAGGG
GCGTCCGAGT ACAACATCAC GGTCGTCGTC TCCAAGAAGG ACGTCAATAA GGCCCTTCAA
GCCGCTCATG GTCGATTCTA CCTCTCCAAG ACCGCGATTA GCGTCGGTCT CGTGGGACCA
GGCCTCGTCG GCAAGACGTT GCTTCGCCAA ATGAAGGAAC AACTCGAAAC GCTCCAAGAC
GAGTTTGCGG TTGAACTGCG CGTGGTTGCC ATCACGGGCG GTCGTAAGAT GTTGCTCAGC
GACGGTGCGA TCGACTTAGA TTCGTGGGAA GACGAATACG CTAGTGGTGT GCAGGCGAAC
ATGGACGACT TCACAAAGCA CGTGCTCGAG TCGGAGGCGC CAAACCAAAT CATCGTCGAC
TGCTCCGCCT CCGACGTCGT CGCGGGTCAC TACAAGGAGT GGTTGTCCAA GGGATTGCAC
GTCGTCACGC CGAATAAGAA GGCGAACAGC GGACCGCTCG CGTATTACAA GGAATTGCGT
TCCATTCAAC GTAATTCGTA CACGCACTAC TTTTATGAAG GCACCGTCGG CGCCGGCTTG
CCCATCATCG CCACGCTTCA AAGCCTGCGA AACTCGGGCG ACAAGGTCGA GCGCATCGAA
GGCATCTTCT CCGGTACGCT TTCTTACATC TTCAACACCT TGGAACCGGG CAAGAAGTTC
AGCGACATCG TGGCGCAGGC CAAGGAAGCG GGATACACCG AACCCGATCC CCGCGACGAC
TTGAGCGGTA TGGACGTCGC TCGCAAGGTG ACCATCTTGG CGCGCGAATG CGGTTTGAAC
ATCGAGCTCA GCGATGTCCC GATTCAATCT CTCGTTCCCG AGCCGTTGCG TGACATCGAA
AGCGTGGATG AGTTCATGAA GGAGCTTCCC AAGTACGACG GCGATATTTT GTCCAAGCAA
GAAGAAGCCG CCGCCGCGGG CGAAGTCTTG TGCTTCGTCG GTGTCGTCGA CGTCAAGAAC
GGCACGGGCT CGGTCGAACT CCGTCGTTAC CCCGCCGACC ATCCGTTCGC TCAGCTCAAG
GGCAGCGACA ACATCGTCTC CTTCACCACA AAGTATTACA CGTCCGCTGG ACCCTTAGTC
GTTCGCGGTC CGGGCGCCGG CGCCGAGGTC ACCGCCGGCG GCGTCTTCGG CGACATCTTG
CGCGTGTGCG CCTACCTCGG CGCTCCGTCA TAA
 
Protein sequence
MGVAVKGEPK VTDCLINATD MAAKRDDTYA AELTKLEDKH VSTAKALLTD KAEYDAYIAA 
FNEELNDLRA MFKAIYIAGC STDAFGDFVV GHGELWTARL CAATIRCKGG KAVWIDARDI
LVVTESEDGG VDVDYSLSNA NLDKWYDEHM QEGAVVMVTG FIARTPEGVP TTLKRNGSDY
SATIFGALTQ ARNITIWTDV DGVFSADPRR VKGAKCLNSI SYNEAWELAY FGANVLHPRT
TLPAMKYNIP VTLRNYFNQA APGTSIGMAC PLPAGDEGNV GKFETRDMSG ELVKGIATID
DVCLINVEGT GMVGVPGTAN TVFKAVKEAG CNVVMISQAS SEHSICFAVR SHEADAAVAA
LNKTFEKAIA AGRISRILPL KDCSILAIVG QNMCQTPGVS AMLFEALAQS AVNVIAIAQG
ASEYNITVVV SKKDVNKALQ AAHGRFYLSK TAISVGLVGP GLVGKTLLRQ MKEQLETLQD
EFAVELRVVA ITGGRKMLLS DGAIDLDSWE DEYASGVQAN MDDFTKHVLE SEAPNQIIVD
CSASDVVAGH YKEWLSKGLH VVTPNKKANS GPLAYYKELR SIQRNSYTHY FYEGTVGAGL
PIIATLQSLR NSGDKVERIE GIFSGTLSYI FNTLEPGKKF SDIVAQAKEA GYTEPDPRDD
LSGMDVARKV TILARECGLN IELSDVPIQS LVPEPLRDIE SVDEFMKELP KYDGDILSKQ
EEAAAAGEVL CFVGVVDVKN GTGSVELRRY PADHPFAQLK GSDNIVSFTT KYYTSAGPLV
VRGPGAGAEV TAGGVFGDIL RVCAYLGAPS