Gene OSTLU_39543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39543 
Symbol 
ID4999924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp550217 
End bp551284 
Gene Length1068 bp 
Protein Length355 aa 
Translation table 
GC content55% 
IMG OID640415345 
Productpredicted protein 
Protein accessionXP_001415530 
Protein GI145340849 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.159789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCGG TTGCCAGCCA GCTCCCGGTG CTGTCGCTGC CGCGAGAAGA CGCCGACGAA 
AACTCACACC GCGCGTTCGC CGCGGAGCTG TGTCGAACGT GTCACGACGT TGGATTCTTT
TATTTATCGA ATCACGGCGT CGAGCAGGCG TGCGACAACG TTCTCGACGC GTCGAGGGCG
TTCTTCGCGT TGCCACTGGC TGAGAAGAAA GAGATAGACT ACACGCGAAG TCGAGCGTTT
CGAGGATACA TGTACGACGG CGCCGAAAAC ACCGCTGGAA AGCCAGATCG AAGAGAGCAG
ATTGAATTCG GTGTCGAGTG CGCCGAAACG TGCGCCACCG AAGGGCCATA TTACGAGCGA
TTAAAAGGTC CAAATCAGTG GCCGGCTCAG GTGCCGCTTC GCGCACCAGT GGAAGATTTC
CAAAACAAAA TGGCGACGTT GAGTCGAAGA ATCATGACTT ATTTGGCGAT CGGACTAGAC
CTCGACGGCG GTTATTTCGA TTCCATGTTC GGCGACGAAC CGAACGTGCA GATGAAAATT
TGCCGGTATC CGCCGAGCGA CGGATCCGTC GGCGAACACA GCGACACTGG AATTCTTTCT
TTTGTCGTCC AAGACTCGGT CGGTGGTTTG CAAGTGCAGC TTCACGATAG CGGTGAATGG
ATTGATGCGC CGCCGATCGA CGGAACGCTC GTCGTCAACT TGGGGGAGAT GATTCAACTA
ATCACCGGCG GCTACTTTCT TGCCACGCCG CATCGCGTTC AAAACCTGAA CGGCAGCCAG
GCGCGATACT CGGTGCCTTA CTTTTGGAAT CCGGAGCTAG ACTATCGAGT CGAATTTATA
GAGTCCTCCA TATTGGACAG TTTGGTTTGG CATCGGCCGC GACCGTCGGA CGATTCCTTC
AGAGCGACGG GATCGCACGG CGGCGCGAAT CGACTCATCT CAGCGTACGG CGAAAACGCG
TTCAAATCGT TCGCACGGTC TCATCCCGTA GTCATGCGAA GGCATCATTC AGACTTAGTG
ATGAACCCCG ACGGTAGTAT AACTAGTCCT AGCAGCTCAA TATTATAA
 
Protein sequence
MASVASQLPV LSLPREDADE NSHRAFAAEL CRTCHDVGFF YLSNHGVEQA CDNVLDASRA 
FFALPLAEKK EIDYTRSRAF RGYMYDGAEN TAGKPDRREQ IEFGVECAET CATEGPYYER
LKGPNQWPAQ VPLRAPVEDF QNKMATLSRR IMTYLAIGLD LDGGYFDSMF GDEPNVQMKI
CRYPPSDGSV GEHSDTGILS FVVQDSVGGL QVQLHDSGEW IDAPPIDGTL VVNLGEMIQL
ITGGYFLATP HRVQNLNGSQ ARYSVPYFWN PELDYRVEFI ESSILDSLVW HRPRPSDDSF
RATGSHGGAN RLISAYGENA FKSFARSHPV VMRRHHSDLV MNPDGSITSP SSSIL