Gene OSTLU_32688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32688 
Symbol 
ID5002734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp482691 
End bp484172 
Gene Length1482 bp 
Protein Length417 aa 
Translation table 
GC content65% 
IMG OID640418155 
Productpredicted protein 
Protein accessionXP_001418947 
Protein GI145349037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0289228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0344017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGCACACGA CGGCGCGGCG CGCGACGACG CGGACGCGGT GAACGACGGG CGTCAGACGC 
GAGACGAAGA CGACGCGCGG GTGACGCGCG ACGCGCGAAC GGACGGGCGC GATGGCGACG
GCGCGAACGA TGGGCGCGGC GGTGACGACG ACGACGACGG GGGGGCGTCG CGAGGCGACG
CGCGCGACGG GCGCGGCGAC GCGAGGCGCG CGCGTCGTGC GACGCGTGAT GACGCCGCCG
ACGCAGGGGA AGATCTCGGA GCAGTCGCGA CCGGACGCGA ACGGGAGGTA CGGCGCGTAC
GGGGGGAAGT ACGTGCCGGA GACGCTGATC CCGGCGCTGC GGGCGCTGGA GAAGGAGTAC
GAGGCGATCA AGACGGACCC GGCGTTCCAG GCGGAGCTGA AGGATATTCT GAAGGATTAC
GTCGGACGCG AGAATCCGCT GTACTACGCC GAAAGGTTGA GCGAACACTT CAAGGACGCG
AACGGGGAAG GGCCGGACGT GTACCTGAAG CGCGAAGACC TGAACCACAC GGGGGCGCAC
AAGATCAACA ACGCGGTCGG GCAAGCGCTG TTGGCGAAGC GAATGGGGAA GAAGCGCATC
ATCGCTGAGA CCGGGGCGGG ACAACACGGC GTGGCGACGG CGACGGTGTG CGCGCGATTC
GGGTTGGAGT GTATCATTTA CATGGGCGCG GCGGATATGG AGCGACAAAA GCTCAACGTG
TTCCGCATGC GTTTGCTCGG CGCCACGGTT CGACCGGTGC GCGCGGGCAC GGCCACGCTC
AAGGATGCGA CGTCTGAGGC TATTCGTGAC TGGGTGACGA ACGTTGAGGA CACGCACTAC
ATCCTCGGCT CGGTCGCGGG CCCGCACCCG TATCCGATGA TGGTGCGCGA CTTCCACGCC
GTCATCGGTC AAGAGACTAG AAGACAAGCC ATGGAGAAAT GGGGCGGTTT GCCGGACATC
CTCGTCGCGT GCGTTGGCGG TGGCTCCAAC GCCATGGGTC TGTTCCACGA GTTCATCGAC
GACGAATCCG TGCGCATCAT CGGCGTCGAA GCCGGCGGCG AAGGCATCGA GCCGGGCCAA
AAGCACGCCG CGACGCTCAC CTTGGGCACC CCGGGCGTGC TTCACGGCTC GTTCTCGTAC
TTGATTCAAG ATGAAGAGGG TCAAATCGTT GAGCCGCACT CCATCTCCGC CGGTCTCGAT
TACCCGGGCA TCGGTCCGGA GCACGCCTTC TTGAAGGATT TCGGTCGCGC CGAGTACCAC
GCCATCACCG ACAAGGAAGC GCTCGACGCT TTCGTCGCCA CCTCTCGTCT CGAGGGTATC
ATCCCTGCCC TTGAAACGTC CCACGCCTTG GCGTACTTGT GGAAGCTCTG CCCTGGTCTC
CCCAACGGCA CCAAGGTTGT CCTCAACTGC AGCGGCCGCG GCGACAAGGA CGTCAACACC
GCCGCCAAGT TTTTGGACAT CAGCGGTGAG GTCGACGGGT GA
 
Protein sequence
MTPPTQGKIS EQSRPDANGR YGAYGGKYVP ETLIPALRAL EKEYEAIKTD PAFQAELKDI 
LKDYVGRENP LYYAERLSEH FKDANGEGPD VYLKREDLNH TGAHKINNAV GQALLAKRMG
KKRIIAETGA GQHGVATATV CARFGLECII YMGAADMERQ KLNVFRMRLL GATVRPVRAG
TATLKDATSE AIRDWVTNVE DTHYILGSVA GPHPYPMMVR DFHAVIGQET RRQAMEKWGG
LPDILVACVG GGSNAMGLFH EFIDDESVRI IGVEAGGEGI EPGQKHAATL TLGTPGVLHG
SFSYLIQDEE GQIVEPHSIS AGLDYPGIGP EHAFLKDFGR AEYHAITDKE ALDAFVATSR
LEGIIPALET SHALAYLWKL CPGLPNGTKV VLNCSGRGDK DVNTAAKFLD ISGEVDG