Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29289 |
Symbol | |
ID | 4999741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 1138973 |
End bp | 1141076 |
Gene Length | 2104 bp |
Protein Length | 584 aa |
Translation table | |
GC content | 58% |
IMG OID | 640415162 |
Product | predicted protein |
Protein accession | XP_001416038 |
Protein GI | 145341895 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTGAGCGAC GACTCTCCTC GTTCGATCGC TCGTCGCGTC ATGCACGCGA GCGACAGCGT GCCGCGAGTA TCTCTCATCA AGCTCGCGGA CGGCGACGAC GTCGAAATCG CGCGTCTGGG TGAAGCGCTG ACGTCGAGCG AGCGTGTCGG TTACTTCGAG CTCGACGCTC GAGACGTCGG CGAGGCGTCA ACGTCGCGCG TCGCGCCGTT CGACGTCGCG CGCACGTACG ACATCGCCCG AACGTTCTTC TCGCTGCCAG AAGACGTGAA GGCGCTGTAC GTACACTCGC AATACGCGAA CGAGAGCGGT GGTTTCGTGC CCCTGCTGGA AGAGTACTCT TACCAAAAGA AAACAGCCGC GCTCGTCGAG TCGTTTGATG TCGTTCGAGA GTTGAGCTCA TGTGAGATTG AGCAAGTGCG AGACGAACGA GGTGACGACG CGGCACGAGG ACTTGGACCT ATGGATTGGC CGGTGGAGGT GCCCGCGATG CAAAGTGCGT TTTGCAGTTT TTACAGCGCG TGCGATGGCG CGGCGAGGAC GCTGTATCGA TGCTTTGCGA AGGCGCTTCA CGTCGACGAC GAAGACGTCT GGGTAAAGAA ATTTGGCAAT ACGTCACACT GCAGCATGCG CGCAATGCGA TATCCATCGA TGAAAGTGGA TGACGAAGCG AACGAGGAGG ACTCGACTAC GCGACGCAGC GAACGAATCG CCGCGTCGAA AGTGGAAATA GTTGGTATCT CGGAGCACAC AGACTTTGAA TTCTTCACAC TTCTTCATCA GACGTGCGAA GGATTAGAGC TCCAAGGGCG AGATGGTGCG TGGCGCTCGG CGCCGGCGTA CGAGAATGAG GCCATTTTTA CGTGCATTCT GTCGGACGCT TTCGAGATTT TTACAAACGG AGTCGTACGC GCGACGCCGC ATCGCGTTCG ACCATCTCGC GATGGACGAG ATCGATTATC GCTCGTGAGA TTCAACGGTC TCAACGACGA CGCCGTCATT GCGCCACTTC CACAGTTTGT CACTCCGCAT CGTCCGTTGA ATGCAGCGTA CGAACCGCGG ACACAAGGCG ATCACGTCGG ACAAAACGTC ACTCGCGCCA GCGACAACCT CGCCGATATG ATCGACAAGC AGGTGTACCC GAAATCAGAG CTTACGAGAC CGCCAAAGCG TTTTGCTCAG CTTCTCGTTT TGGACGTCGC CAACGGTCGC ATCCTGCTGG GGAAGCACAC GCGCGGCGAA TTCGCCGGCC GCTACACCGG CTTCATCGCC GAAGTCGACA GCGAGAAGGA TTTAGTCCCA CTGGACGTCG CTCGGTCCGT CGCACTAGAG GAAGCAGGTC TCAACCCCTT GGCGTGCGAT GCTTTGAACG ATCCGAAAGA TCTTTTTGAA GCCGCGCGCT TCGTATTTCG CGGTTGGATG CCAGATGGTG GTCTCGCCGT CGAGCACGAA TTTGTGTGCG CGTTTCGCGA CGGCACGAGC GTCGCGAAGT TGTTTCCAAC TCACGCGCGC GCGTCAGCTG ACATCATTCC GACGTGGTTT CAACAACAAG AAATACCGTA TGCAGATATG CCTGAGGACG ACGCGATATG GTATCCCATC GTGTTGGGAC GCTTTTCGAA ACACGACGGT GTCGACGAAT CGCTCGTCAT TGGGCATTTC GATTTCTCAG GCGACGAAGG CGAGCTCACC GACCATGCCG TACACGAAGT CGAGTTTCGA CACAGCTCAT TCAATCGTTC GTCCACCGCG CGCGTTCTCG CACGTCTAGA ACGTCTAGAA GGTAGAAGCG TGTAAGCGTG TAGAAGACGA TGATGGCACC CTCGAGATGA TTCGCGCGAC GAGCTCGACG CCGACGCCGA CGCGAGGTCG TGTGTTGCGC GAGTGCATAG TCAGCGGTCG CGAGTGGAAA GCGCTACAGC GCGTATGAAG ATGGCGACAT CGGCTGGATC GTCGCGGCAT GGCTCGAGCG ATGAGGAGTC CGACGAAGCG CCGCGCGCTT ACGGAAACGG CTCACGCGCG TACGAACGCG AGTTCGAAGC GTCTTCAGAC GACGAAAGCT TAGATTACTC GCGCCTTTAG AATCGAGTAT TTAGACTCGC GCCT
|
Protein sequence | MHASDSVPRV SLIKLADGDD VEIARLGEAL TSSERVGYFE LDARDVGEAS TSRVAPFDVA RTYDIARTFF SLPEDVKALY VHSQYANESG GFVPLLEEYS YQKKTAALVE SFDVVRELSS CEIEQVRDER GDDAARGLGP MDWPVEVPAM QSAFCSFYSA CDGAARTLYR CFAKALHVDD EDVWVKKFGN TSHCSMRAMR YPSMKVDDEA NEEDSTTRRS ERIAASKVEI VGISEHTDFE FFTLLHQTCE GLELQGRDGA WRSAPAYENE AIFTCILSDA FEIFTNGVVR ATPHRVRPSR DGRDRLSLVR FNGLNDDAVI APLPQFVTPH RPLNAAYEPR TQGDHVGQNV TRASDNLADM IDKQVYPKSE LTRPPKRFAQ LLVLDVANGR ILLGKHTRGE FAGRYTGFIA EVDSEKDLVP LDVARSVALE EAGLNPLACD ALNDPKDLFE AARFVFRGWM PDGGLAVEHE FVCAFRDGTS VAKLFPTHAR ASADIIPTWF QQQEIPYADM PEDDAIWYPI VLGRFSKHDG VDESLVIGHF DFSGDEGELT DHAVHEVEFR HSSFNRSSTA RVLARLERLE GRSV
|
| |