Gene OSTLU_29930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29930 
Symbol 
ID5000271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp209414 
End bp210852 
Gene Length1439 bp 
Protein Length458 aa 
Translation table 
GC content64% 
IMG OID640415692 
Productpredicted protein 
Protein accessionXP_001416103 
Protein GI145342038 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGCACACGC GCGCGCGCGA CGCTCGAAAC CGCGCCCGAA CGCCGAGAAC ACATCGCGCG 
CGATGCGCAC CGCGCGCGCG ACGTCGAAAC CGAGCGCGCG CGCGGCGCGA TCGACGACGA
AGTCGACTCG AACGGCGACC GGGCGGACGC GCCCCGCGCG GTGTCCCGAC GCCGGCGACG
CGCGTCGAGG CGCTCGAACG ATCCCGCGCG CGGGAAGCAC GTTTGGACGG ATATTCCGCG
TCACCACGTT CGGGGAATCG CACGGTGGCG GCGTTGGGTG CGTCGTGGAT GGGGTGCCGC
CGCGACTGCG CGTCACGCGC GAGGAGCTGC AGTTCGAGCT CGACCGGCGA CGCCCGGGAC
AGAGCCGAAT CACGACCCCG CGAAACGAAG AGGACTCGTG CGAGATACTG AGCGGGGTCG
GGCTCGATGG CGTCACGCTG GGCACGCCCG TGGCGGTGCT GGTGAGGAAT AAAGACCAGC
GAAGCCAAGA CTACGGAGAA ATCGCGGTGG CGTATCGACC GTCGCACGCG GATGCGACGT
ATGATATGAA ATACGGCGTC CGAGCGATCG CCGGTGGTGG ACGAAGCAGT GCTAGAGAAA
CTATCGGACG CGTCGCGGCC GGTGCGATCG CGAAGAAGGT GCTCAAGGAG GTGGCGGGGA
CGGAAATTTT GGCGTACGTG AGCGCGGTGC GCGACGTGAA AACCACCGCG GTGAACCACG
AGACTATGAC GATGGATGAC GTTGAGTCAA ACATCGTGCG GTGCCCGGAC GAGAGTTGCG
CGCAAAAGAT GATCGATGCG ATCGATGAGG TTCGGGTGAA GGGGGACTCG TGCGGGGGCG
TGGTGACGTG CGTCGTGCGC AACCCACCGC GAGGCTTGGG TGCGCCCGCG TTCGACAAGC
TCGAAGCCGA TTTGGCTAAG GCGATGTTGA GCTTACCGGC GACGAAAGGT TTCGAAATCG
GTAGCGGTTT CGACGGCACG TTGCAAAAGG GTAGCGAGCA CAACGACGAG TTTTTCATGG
ATAGCGAAAA GGGTTTGCGT ACGCGCACGA ACCGCTCCGG CGGTATCCAG GGTGGCATCT
CCAACGGGGA GATGATCGAG ATGAAGATTG CGTTCAAACC GACGTCGACG ATCACACAGG
CGCAAAATAC GGTGAACCGC GATGGGGTGG AGACGGAGCT CAAGGCTCGC GGTCGACACG
ACCCGTGCGT GGTCCCGCGC GCGGTGCCGA TGGTGGAAGC CATGGTCGCG CTCACGCTCG
TGGATCACTT GATGCTTCAG CACGCACAAT GCAACTTGAT CGACGCTGGA GATTTGACTG
AGCTCGTTCA AGGAAACCTG CCCACTCTTT ACGACCCCGA AGCCATCGCC GCTGCGGCCG
CGGCGTCCAA GGCGCAAATG ACCACGAAGG ACATGTCTGA CGCGTTCAGC GAAGATTAA
 
Protein sequence
MRTARATSKP SARAARSTTK STRTATGRTR PARCPDAGDA RRGARTIPRA GSTFGRIFRV 
TTFGESHGGG VGCVVDGVPP RLRVTREELQ FELDRRRPGQ SRITTPRNEE DSCEILSGVG
LDGVTLGTPV AVLVRNKDQR SQDYGEIAVA YRPSHADATY DMKYGVRAIA GGGRSSARET
IGRVAAGAIA KKVLKEVAGT EILAYVSAVR DVKTTAVNHE TMTMDDVESN IVRCPDESCA
QKMIDAIDEV RVKGDSCGGV VTCVVRNPPR GLGAPAFDKL EADLAKAMLS LPATKGFEIG
SGFDGTLQKG SEHNDEFFMD SEKGLRTRTN RSGGIQGGIS NGEMIEMKIA FKPTSTITQA
QNTVNRDGVE TELKARGRHD PCVVPRAVPM VEAMVALTLV DHLMLQHAQC NLIDAGDLTE
LVQGNLPTLY DPEAIAAAAA ASKAQMTTKD MSDAFSED