Gene OSTLU_29289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29289 
Symbol 
ID4999741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp1138973 
End bp1141076 
Gene Length2104 bp 
Protein Length584 aa 
Translation table 
GC content58% 
IMG OID640415162 
Productpredicted protein 
Protein accessionXP_001416038 
Protein GI145341895 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTGAGCGAC GACTCTCCTC GTTCGATCGC TCGTCGCGTC ATGCACGCGA GCGACAGCGT 
GCCGCGAGTA TCTCTCATCA AGCTCGCGGA CGGCGACGAC GTCGAAATCG CGCGTCTGGG
TGAAGCGCTG ACGTCGAGCG AGCGTGTCGG TTACTTCGAG CTCGACGCTC GAGACGTCGG
CGAGGCGTCA ACGTCGCGCG TCGCGCCGTT CGACGTCGCG CGCACGTACG ACATCGCCCG
AACGTTCTTC TCGCTGCCAG AAGACGTGAA GGCGCTGTAC GTACACTCGC AATACGCGAA
CGAGAGCGGT GGTTTCGTGC CCCTGCTGGA AGAGTACTCT TACCAAAAGA AAACAGCCGC
GCTCGTCGAG TCGTTTGATG TCGTTCGAGA GTTGAGCTCA TGTGAGATTG AGCAAGTGCG
AGACGAACGA GGTGACGACG CGGCACGAGG ACTTGGACCT ATGGATTGGC CGGTGGAGGT
GCCCGCGATG CAAAGTGCGT TTTGCAGTTT TTACAGCGCG TGCGATGGCG CGGCGAGGAC
GCTGTATCGA TGCTTTGCGA AGGCGCTTCA CGTCGACGAC GAAGACGTCT GGGTAAAGAA
ATTTGGCAAT ACGTCACACT GCAGCATGCG CGCAATGCGA TATCCATCGA TGAAAGTGGA
TGACGAAGCG AACGAGGAGG ACTCGACTAC GCGACGCAGC GAACGAATCG CCGCGTCGAA
AGTGGAAATA GTTGGTATCT CGGAGCACAC AGACTTTGAA TTCTTCACAC TTCTTCATCA
GACGTGCGAA GGATTAGAGC TCCAAGGGCG AGATGGTGCG TGGCGCTCGG CGCCGGCGTA
CGAGAATGAG GCCATTTTTA CGTGCATTCT GTCGGACGCT TTCGAGATTT TTACAAACGG
AGTCGTACGC GCGACGCCGC ATCGCGTTCG ACCATCTCGC GATGGACGAG ATCGATTATC
GCTCGTGAGA TTCAACGGTC TCAACGACGA CGCCGTCATT GCGCCACTTC CACAGTTTGT
CACTCCGCAT CGTCCGTTGA ATGCAGCGTA CGAACCGCGG ACACAAGGCG ATCACGTCGG
ACAAAACGTC ACTCGCGCCA GCGACAACCT CGCCGATATG ATCGACAAGC AGGTGTACCC
GAAATCAGAG CTTACGAGAC CGCCAAAGCG TTTTGCTCAG CTTCTCGTTT TGGACGTCGC
CAACGGTCGC ATCCTGCTGG GGAAGCACAC GCGCGGCGAA TTCGCCGGCC GCTACACCGG
CTTCATCGCC GAAGTCGACA GCGAGAAGGA TTTAGTCCCA CTGGACGTCG CTCGGTCCGT
CGCACTAGAG GAAGCAGGTC TCAACCCCTT GGCGTGCGAT GCTTTGAACG ATCCGAAAGA
TCTTTTTGAA GCCGCGCGCT TCGTATTTCG CGGTTGGATG CCAGATGGTG GTCTCGCCGT
CGAGCACGAA TTTGTGTGCG CGTTTCGCGA CGGCACGAGC GTCGCGAAGT TGTTTCCAAC
TCACGCGCGC GCGTCAGCTG ACATCATTCC GACGTGGTTT CAACAACAAG AAATACCGTA
TGCAGATATG CCTGAGGACG ACGCGATATG GTATCCCATC GTGTTGGGAC GCTTTTCGAA
ACACGACGGT GTCGACGAAT CGCTCGTCAT TGGGCATTTC GATTTCTCAG GCGACGAAGG
CGAGCTCACC GACCATGCCG TACACGAAGT CGAGTTTCGA CACAGCTCAT TCAATCGTTC
GTCCACCGCG CGCGTTCTCG CACGTCTAGA ACGTCTAGAA GGTAGAAGCG TGTAAGCGTG
TAGAAGACGA TGATGGCACC CTCGAGATGA TTCGCGCGAC GAGCTCGACG CCGACGCCGA
CGCGAGGTCG TGTGTTGCGC GAGTGCATAG TCAGCGGTCG CGAGTGGAAA GCGCTACAGC
GCGTATGAAG ATGGCGACAT CGGCTGGATC GTCGCGGCAT GGCTCGAGCG ATGAGGAGTC
CGACGAAGCG CCGCGCGCTT ACGGAAACGG CTCACGCGCG TACGAACGCG AGTTCGAAGC
GTCTTCAGAC GACGAAAGCT TAGATTACTC GCGCCTTTAG AATCGAGTAT TTAGACTCGC
GCCT
 
Protein sequence
MHASDSVPRV SLIKLADGDD VEIARLGEAL TSSERVGYFE LDARDVGEAS TSRVAPFDVA 
RTYDIARTFF SLPEDVKALY VHSQYANESG GFVPLLEEYS YQKKTAALVE SFDVVRELSS
CEIEQVRDER GDDAARGLGP MDWPVEVPAM QSAFCSFYSA CDGAARTLYR CFAKALHVDD
EDVWVKKFGN TSHCSMRAMR YPSMKVDDEA NEEDSTTRRS ERIAASKVEI VGISEHTDFE
FFTLLHQTCE GLELQGRDGA WRSAPAYENE AIFTCILSDA FEIFTNGVVR ATPHRVRPSR
DGRDRLSLVR FNGLNDDAVI APLPQFVTPH RPLNAAYEPR TQGDHVGQNV TRASDNLADM
IDKQVYPKSE LTRPPKRFAQ LLVLDVANGR ILLGKHTRGE FAGRYTGFIA EVDSEKDLVP
LDVARSVALE EAGLNPLACD ALNDPKDLFE AARFVFRGWM PDGGLAVEHE FVCAFRDGTS
VAKLFPTHAR ASADIIPTWF QQQEIPYADM PEDDAIWYPI VLGRFSKHDG VDESLVIGHF
DFSGDEGELT DHAVHEVEFR HSSFNRSSTA RVLARLERLE GRSV