Gene OSTLU_40785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40785 
Symbol 
ID5002206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp536665 
End bp538158 
Gene Length1494 bp 
Protein Length497 aa 
Translation table 
GC content59% 
IMG OID640417627 
Productpredicted protein 
Protein accessionXP_001418517 
Protein GI145348146 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.725056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.204992 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGAG GGCGAACGAC GACGAACGCG AGGACTAACC CGGAGCGCGG CGCGCGCGAC 
GCGAGCAGGG CGCAGGCGAC GGCGAGCAAG TCGAAGGCGA GGAGCGCGGA CGTCAGCGCG
ACGGGGGCGA ATTTGAAGGA GTGGAACCCG AAGTCTTGGC GACAGCGCGA GGCGCTGCAA
CAACCGAACT ATGAAAATCA GGCTGAGCTG GAGGAGGCGC TGAAGGTGAT CGCCAACCGG
CCGCCGCTCG TGTTCGCGGG AGAAGCGCGC GACTTGCAGG AAAAGCTCGC GAACGCGGCG
GCTGGTAACG CGTTCGTCTT GTTCGGTGGT GATTGTGCGG AAAGTTTTAG AGATTTCACG
TCGGATAACG TGCGGGATAC GTACCGGGTT TTGCTGCAAA TGTCCGTCGT GTTGATGTAC
GGCTCGGGCG TGCCCGTGGT GAAGCTGGGA CGAATGGCTG GCCAGTTCGC CAAACCCCGT
TCCGAAGACT TGGAAACGAT CGATGGGCTC TCGTTGCCGT CGTACAGAGG CGATAACATC
AACAGCTGTG AGTTCACCCC GGAAGCGCGT CGGCCGGACC CTTCGCGTTT GGTCAAGGCG
TACGATCAGT CGTGCGCCAC GCTCAACTTG CTGCGAGCTT TCAGTAACGG AGGGTACGCC
GCGATGACGC GCGTGAGCGA TTGGAATTTG GATTTTATGG AAAACACCGA ACGAGGCAGT
CAATATGAGG ATCTTGCGCA GCGCGTCGAC GCCGCGATTG ACTTTATGGC GGCGTGCGGC
ATCGACGAAA CTCATCCGTC GATGCAAGAG ACATCCTTCT TCACGGCGCA CGAAGCTCTT
CACCTGGGTT ATGAAGAATC GCTCACGCGT TTGGACTCCA CGACCGAGGA GCATTACGGC
TGTTCCGCGC ATTTCTTGTG GTGTGGTGAG CGCACGCGCC AACCCGAAGG CGCACACATG
GAATACTTCC GCGGTATTTC CAACCCGATC GGCATCAAGA TTTCCGACAA GAGCGACGGC
GAGGGTGTGG TGAGCTTGGT GAAGAAGTTG AACCCGGACA ACGTCCCGGG TCGCATCACC
CTCATCTCTC GCATGGGTGC TGCCAAGTTG CGCGAGCATC TTCCGCGTCT CATCACCGCC
ATTGAAGACG CCGGGCTCAA CGTGTTGTGG GTTACGGATC CCATGCACGG GAACACCATC
AAGACTGATA ACGGTTTCAA GACGCGTCCG TTCGAGGCGG TGCGCGACGA AATTATGGCA
TTCTTTGAAG TGCACGAAAA GATGGGTACT TATCCTGGTG GGGTTCACTT AGAGATGACG
GGGCAAAACG TCACCGAGTG CACGGGCGGC ATCATGGACG TTTCGGTGTC TGATTTGGAA
AAGCGCTACC TCACCCATTG TGATCCGCGC TTGAACGCGA GCCAAGCCAT CGAGCTTGCG
TTTTTGATGG CTAGCGAGTT GAACGATATG CGTCGTCGCC GCGCGGCGCA ATAA
 
Protein sequence
MGRGRTTTNA RTNPERGARD ASRAQATASK SKARSADVSA TGANLKEWNP KSWRQREALQ 
QPNYENQAEL EEALKVIANR PPLVFAGEAR DLQEKLANAA AGNAFVLFGG DCAESFRDFT
SDNVRDTYRV LLQMSVVLMY GSGVPVVKLG RMAGQFAKPR SEDLETIDGL SLPSYRGDNI
NSCEFTPEAR RPDPSRLVKA YDQSCATLNL LRAFSNGGYA AMTRVSDWNL DFMENTERGS
QYEDLAQRVD AAIDFMAACG IDETHPSMQE TSFFTAHEAL HLGYEESLTR LDSTTEEHYG
CSAHFLWCGE RTRQPEGAHM EYFRGISNPI GIKISDKSDG EGVVSLVKKL NPDNVPGRIT
LISRMGAAKL REHLPRLITA IEDAGLNVLW VTDPMHGNTI KTDNGFKTRP FEAVRDEIMA
FFEVHEKMGT YPGGVHLEMT GQNVTECTGG IMDVSVSDLE KRYLTHCDPR LNASQAIELA
FLMASELNDM RRRRAAQ