Gene OSTLU_43874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43874 
Symbol 
ID5006630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp380245 
End bp381555 
Gene Length1311 bp 
Protein Length436 aa 
Translation table 
GC content58% 
IMG OID640422051 
Productpredicted protein 
Protein accessionXP_001422572 
Protein GI145356716 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.949222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAGC TCACGTTGAA GCCGATGAAG AAAATTGAAG GCACGGTTCG ATTGCCGGGG 
TCTAAGTCGC TGTCAAACCG CATTCTGCTC CTCGCGGCGC TCGCAGAAGG AACGACCAAG
GTGGAAAACT TGTTGGATAG CGACGACATT CGTTACATGG TTGACGCGCT TAAAGTTCTG
GGGCTGTCCT TCACGGAAGA CAGAGAGAAC AACATTTTGG AAATCACCGG TTGTGGGGGT
AAGCTCCCGG TCGAGGGCGC GGAGTTGTTC CTCGGTAACG CCGGGACGGC GATGCGACCG
CTCACCGCCG CCGTGGCTGC GGCGGGTAAA GGCACGTTCA TCCTCGATGG TGTCGAGCGT
ATGCGTGAAA GACCAATTCA AGATCTTGTT GATGGTTTGG TTCAGCTTGG TGTCAAGGCG
GAGTGCACGA TGGGTACGGG TTGCCCGCCG GTGAAGGTTG AGGCGAACGG CTTGCCTGGC
GGTCGCGTCG AGCTCAGCGG TTCCGTGAGC TCGCAATACC TCACCGCGCT CCTCATGGCG
GCCCCACTTT GCGAAGGGTC GATTGAAATC GTAATTGTCG ATGAGCTCAT CTCCAAACCG
TACGTTGAGA TGACTATCAC ACTCATGGAA CGCTTCGGTG TCAAGGTAGA AAAGGCGGAC
GACCTCCAGA GCTTTAAGAT TCAAGGCGGA CAAAAGTACA TCTCCCCGGG CAGTGCATTT
GTTGAAGGCG ATGCTTCGTC CGCTTCCTAC TTCCTCGCCG GTGCCACCAT CACCGGTGGT
ACCGTCACTG TCATCGGTTG CGGTTCGGAG AGCATCCAAG GCGACACAAA CTTTGCGTAC
ACGATGGAGC AAATGGGCGC GACGCTCGAG TGGGGTCCGA ACTCGGTCAC CTGCACCGGT
CCCAAGGGCC CGCTCAAGGC GATTGATGTG AACATGAACG CCATGCCCGA CGCTGCGATG
ACTCTTGCCG TCGCTGCACT CTTCGCTGAC GGCATCACTA CCATTCGTGA TGTCGCGAGC
TGGCGCGTGA AGGAAACCGA GCGCATGATT GCGATTTGCA CCGAGCTGCG CAAGCTCGGT
TGCGACGTCT TCGAAGGCGC GGATTACTGC GTCATCACCC CTCCTCACAA GCTCGACCCC
CCGGCCAAGA TGAAGGCCAA CGTTGACATC GATACCTACG ACGATCACCG CATGGCCATG
GCTTTCGCCT TGGCCGCGTG CGGCGACGTC GACGTCGTCA TCAACGACCC GAAGTGCACG
AAGAAGACTT TCCCGACGTA CTTTGACGTC CTCAAGTCCG TCGCCAAGTG A
 
Protein sequence
MEQLTLKPMK KIEGTVRLPG SKSLSNRILL LAALAEGTTK VENLLDSDDI RYMVDALKVL 
GLSFTEDREN NILEITGCGG KLPVEGAELF LGNAGTAMRP LTAAVAAAGK GTFILDGVER
MRERPIQDLV DGLVQLGVKA ECTMGTGCPP VKVEANGLPG GRVELSGSVS SQYLTALLMA
APLCEGSIEI VIVDELISKP YVEMTITLME RFGVKVEKAD DLQSFKIQGG QKYISPGSAF
VEGDASSASY FLAGATITGG TVTVIGCGSE SIQGDTNFAY TMEQMGATLE WGPNSVTCTG
PKGPLKAIDV NMNAMPDAAM TLAVAALFAD GITTIRDVAS WRVKETERMI AICTELRKLG
CDVFEGADYC VITPPHKLDP PAKMKANVDI DTYDDHRMAM AFALAACGDV DVVINDPKCT
KKTFPTYFDV LKSVAK