Gene OSTLU_17574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17574 
Symbol 
ID5004657 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp233558 
End bp234658 
Gene Length1101 bp 
Protein Length323 aa 
Translation table 
GC content55% 
IMG OID640420078 
Productpredicted protein 
Protein accessionXP_001420626 
Protein GI145352595 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.769732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.690692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCG CGGTGACGCG CGAGGACGCG CGCGAGAAAG TCCCCACGGT GTCGCTCGAA 
CTGTACGCGA AAGATTTCGA AAAGTTTGTC CACGACCTGG GAGAGGCGTA CGAGCGATAC
GGATTCGTCA TCTTGGAAAA TCATGGGATC GATCTGGAAC TCATCGAACG CGCGATGGAA
CGCTCGAAGG CGTTCTTTGC GTGCGATGAG GAATGTAAGA AGAAGTACAC GCTGCGCGGG
TTGGGCGGCG CGCGCGGGTA CACGGCGTTC GGAGTGGAGA CGGCGAAGGG AGCGAACGCG
CCTGGTGCGT GAAGGCGATG CGAAAAGTGA TACTTTTTTG TTTTGATGCG TCGCGCTATA
GCGCGTGTGA TGGATTCGAC GACGCGATTC GTTTGACTGA CGCGAAGGAT AACGTCGCCT
CTCGCGCGCG CAGATTTGAA AGAGTTTTGG CACCTCGGGC GAGACTTGCC GGCGGGCCAT
AAGTACGAGT CGACGATGCC GCCGAATGTG GATGACGTGG TGGAAGTCGA AGGGTTTAGC
GAAGTGAATA AGCAAATCTT CAACGCTCTA GATGAGCTCG GGAATTGCGT GCTCGAGGCG
CTGGCGGTGC ATCTGGAACA ACCGCGCGAG TATTTCGCAG ATAAGACGAA CGAAGGCAAC
AGTATTTTGC GTATCATTCA CTATCCACCA ATTCCGCCTG ATGCTGCGGG GCGCGTGCGC
GCTGGTGCGC ACGAGGATAT CAATCTCATC ACGCTCCTGC TCGGCGCCGA CGAGGGTGGT
TTGCAACTCT TGGGAAAGGA TGGAGAGTGG CTCGAAGTCA ACCCGCCTCC CGGATGCGTG
ACGTGCAACA TCGGTGACAT GCTGCAACGC CTGTCGAATC ACAAGCTTCC TTCGACGACG
CACCGTGTCG TGAATCCTCC CGCGGAACGG GCGCACATTC CTCGATATTC CATGCCATTC
TTTTTACATC CAAATCCAGA CTTCGTCATC AAAACGCTGG AGTCGTGCAT TTCGGATCAG
TATCCCAATC GCTATCCAGA GCCCATTAGC TCTGATGAAT ACTTGCAGCA ACGCCTTCGA
GAGATTAAAC TCAAATCATG A
 
Protein sequence
MTPAVTREDA REKVPTVSLE LYAKDFEKFV HDLGEAYERY GFVILENHGI DLELIERAME 
RSKAFFACDE ECKKKYTLRG LGGARGYTAF GVETAKGANA PDLKEFWHLG RDLPAGHKYE
STMPPNVDDV VEVEGFSEVN KQIFNALDEL GNCVLEALAV HLEQPREYFA DKTNEGNSIL
RIIHYPPIPP DAAGRVRAGA HEDINLITLL LGADEGGLQL LGKDGEWLEV NPPPGCVTCN
IGDMLQRLSN HKLPSTTHRV VNPPAERAHI PRYSMPFFLH PNPDFVIKTL ESCISDQYPN
RYPEPISSDE YLQQRLREIK LKS