Gene OSTLU_23942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_23942 
Symbol 
ID5000000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp839641 
End bp840771 
Gene Length1131 bp 
Protein Length346 aa 
Translation table 
GC content69% 
IMG OID640415421 
Productpredicted protein 
Protein accessionXP_001415617 
Protein GI145341026 
COG category[R] General function prediction only 
COG ID[COG0456] Acetyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0967488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATCGACCGC GCGACGCGCG CGATGGCGTC GCGCGCGGTG TCGATCGACG CGCGCGCGCG 
CGGCGACGCG ATGGCGTCGC GCGCGACGCG CGGCGCGACG CGGGACGGCG CGCGACGCGC
GACGCGCGGA ACGCAGACGA CGACGACGCG ACGACGCGCG ATGACGACGA CGCGATGCGA
TGGGCTCGCG CGGACGGGAC GCGGCGGCGC GACGAGGGAC GCGTCGACGT CGACGATGAT
GGGCGCGGGC GCGGGCGCGC GGGGAGGGGT GGCGCGACGC GCGCTGGCGG ATCGACCGAC
GCGAGAGGCG GCGGAGGACG AGGACGCGCG CGCGACGGGC GCGAGCCGGG TGAAGTCGAG
GGGGGGGGTG GACGTCGTCG TCGCGACGAA TGACTTCGCG TTTGAGATCG CGGCGAATCT
CAGGGCGACG GCGTTTTACG ATGATCTCGC GGAGAGGCAC GAGATGCCGT TTCCGCCGCG
GTTCACGCCG ACGTTTCATC GGGAGTTCGC GCAGCGCGAA CGCAAGGCGC TGCGGGAGCG
GACGACGAGA CGCGTGGGGC CGGCGCTGGA GTCGCGATGT TTCATGGCGG ATTGCGAAGG
ATTGGGGTTA GTGGGGTGTT TGGACGTCAG CGTGCGCGAG GGGCCGTGCG CGAGTCAGAT
CAACGGCGTG TGCGTGCCGG AGGGGGCGTC GTACGCGTAC GTGGACAACG TGGCGGTGGA
CGCCGCGGCT CGTCGACGAG GGTCGGCGAA GCTCATGATG GAGTGCGCGA GCGACTGGGT
CGAAGAGCGT GGAATCACGG AAATCTGGAC GCACGTGCAC TGCGATAACG TGGGCGCGCG
AAGATTGTAC CACGCGTACG GTTTCCGGGC GCCCAGCGGC TCGCATCCGG AACAAGGCTT
GCCGAATTAC TTCAACGGCG AGCGATTGAA GGGCCTAATC TTAATGCGAG CCCCTGTGCC
GCTGGTGTAC GAGGCGCGCG TTGACGCGGT TTGCGGATGC GGCGCGTGCT TCGCGCGCGT
GGACGAGTGC ATCTGCATCA AACCCGCGGT CGCCGCGCGT TAACGCCGCC TCGAGTCGAG
ACTACTTAAA CTTTGTTGTA CTTTTCCTGC CCGCCGTACT ATATGCGCGC T
 
Protein sequence
MASRAVSIDA RARGDAMASR ATRGATRDGA RRATRGTQTT TTRRRAMTTT RCDGLARTGR 
GGATRDASTS TMMGAGAGAR GGVARRALAD RPTREAAEDE DARATGASRV KSRGGVDVVV
ATNDFAFEIA ANLRATAFYD DLAERHEMPF PPRFTPTFHR EFAQRERKAL RERTTRRVGP
ALESRCFMAD CEGLGLVGCL DVSVREGPCA SQINGVCVPE GASYAYVDNV AVDAAARRRG
SAKLMMECAS DWVEERGITE IWTHVHCDNV GARRLYHAYG FRAPSGSHPE QGLPNYFNGE
RLKGLILMRA PVPLVYEARV DAVCGCGACF ARVDECICIK PAVAAR