Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36617 |
Symbol | |
ID | 5006958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | + |
Start bp | 113153 |
End bp | 115807 |
Gene Length | 2655 bp |
Protein Length | 884 aa |
Translation table | |
GC content | 59% |
IMG OID | 640422379 |
Product | predicted protein |
Protein accession | XP_001422810 |
Protein GI | 145357202 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.000615494 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0024222 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGACGG CGACGGAGAC GCCGACGGAG GCGCCGAAGA CGATATATCT GAAGGATTAC GAGCGACCGG CGTACGCGTT CGAAAGGGTG AATTTGGACT TTGAGCTCGG GGAGGCGACG ACGACGGTGA CGTCGACGAT TCGAGTTCGA CCGGCGAACG ATGCGAATGG GAAATCGTTG TTCCTGAACG GCGATGAATC GGTGGAGTTG GCGGCGATCG AGGTGGACGG CGCGAAATTC ACGACGTACG AACGGACGGG GAAGGGGATC ACGCTGCGCG CGCTGCCGAC GGAGGCGTTT GATTTGCGAG TGACGACGAC GATTAAGCCG CAAGAGAACA CCGCGCTCGA GGGACTTTAT AAGTCGTCTG GGAACTTTTG CACGCAATGC GAGGCGGAAG GTTTCCGACG AATCACGTTT TATCAAGATA GACCGGATGT GATGTCGATA TTCACGACGC GCATCACGGC GGATAAGACG AAGTATCCGG TGCTGCTCGG CAACGGAAAC TTGGTGGATT CTGGAGATTT GGAGGGTGGT AAGCACTTCA CCGTGTGGGA AGATCCGTGG GCAAAACCGT GTTACCTTTT CGCGCTCGTC GCGGGTGATC TCGGGATGGT GGAGGATAAA TTCAAGACGA TGACGGGCAA AGAAGTGACG CTTCGCATCT TCACCGAGAC GCACAACTTG GACAAGTGCG CGCACGCGAT GACGAGCTTG ATCAAATCCA TGAAATGGGA CGAAGACACG TATGGTTTGG AGTACGACTT GGAGCTTTTC AACATCGTCG CCGTGGACGA TTTCAACATG GGCGCCATGG AGAACAAGTC GCTGAACATT TTCAACTCGC GTCTCGTGTT GGCGACGCCG CAGAGCGCCA CGGATGCCGA TTACGCCGCC ATTGAAGGTG TCGTGGCGCA CGAATACTTT CACAACTACA CTGGAAACCG TGTGACGTGT CGTGATTGGT TCCAACTGTC CCTCAAGGAA GGGCTGACTG TGTACAGAGA CCAAGAGTTC AGCGCGGATA TGAACTCGCG AGGCGTCAAG CGCATCGGCG ACGTCTCGCG CCTTCGCATG GCGCAATTCG CCCAAGACGC GGGTCCGATG GCGCACCCGA TTCGCCCGGA ATCGTACATT AAGATGGATA ACTTTTACAC CGTGACAGTT TACGAAAAGG GAGCGGAAGT CGTGCGCATG TACGAGACGC TCCTCGGCAA GGATGGGTTC CGCAAGGGGA TGGATTTGTA CTTTGAACGT CACGACGGTC AAGCGGTGAC GACGGAGGAT TTCTTCGCCG CCATGTGCGA CGCCAACGGT GCGGACTTGT CCACGTTCAA GCCCTGGTAC TCCCAAGCGG GTACGCCGCG CGTCACCGCG AACGGGTCTT ACGACGCCGC CGCGAAGACG TTCACCCTCG AATGCTCGCA AGTGGTTCCG AAAACGCCCG GTCAAGACTC CAAGGTTCCG GTCTTGTGCC CGATCGCCGT TGGTCTCGTT GGTCCCGACG GTGCAGACAT GAACCTCACG ATCGACGGCA AATCCCACGG CACGACGGCG GTGCTTCGTT TCGATCAAGC CTCGGCGACG TACACTTTCA CCGGCGTCGA CGCCAAGCCC GTGCCGAGCA TCTTGCGCAA CTTCAGCGCG CCCGTGCGTT TGACGACCAA CTTGACGCAA GACGACTTGT TGTTCCTCAT GGCGAACGAC TCGGACGCGT TCAACCGATG GGAGGCTGGG CAGACGCTGC TCAGAAACCT CTGCCTGGAT CTGATCAAGG GCGGCGAGCA GTCATTCAAG ATGAACGACG CCATCACGGC GGCGATGCGC ACGATTCTTT CGGGCGCCAA GGCTGCCGAC GCGGACAAGG CGTTCATCGC GCGCGCCATG ATGGTGCCTT CCGAGGGCGA GCTGAGCGAC ATGCTCGAAG AGGGCACGGT GGATCCCGCC GCCGTGCACG CCGCTCGCGA CTTTGTCATG AAGACGCTCG CCACGGAGCT TCGCGCTGAG TTGGAAGCCA CGGCGCAAGC GAACAGCGCC GCGGTGTATT CGAACGAACC CGCCGATCGC GCCGCGCGAT CGCTGAAAAA CGCGTGCATC GGATATTTGT CGTATTTGGA CGCGCCGGAA ATCGCCGCGA TGACGTACGA GCGCTACGTC GCCGCGGACA ACATGACGGA TAAGATTGCC GCCCTGAGCG CGCTCAGCGG CAAAGACTGC GACGAACGCA TCAAAGCCAT CGATGCGTTT TACGCCGAGT GGTCGCACGA CCCGCTCGTC ATGAACAAAT GGCTCAGCAT CCAAGCCGCG TCGTCGCTCC CGAACAACCT CGCCAACGTT CGCGCGCTCG CCGCCGGCTC CGCCTTCGAC ATCAAGAACC CCAACAAAGT GTACTCCCTC ATCGGTGGTT TCTGCGCCTC TCCCACCAAC TTTCACGCCA TCGACGGCTC CGGTTACGAA TTCCTCGCCG ACATCGTCCT CGAGCTCGAC GATCTCAACG GCCAAGTCGC CTCTCGCATG GTGTCCGCGT TTACGCGTTG GCGCAAATTC GAGCCGACGC GCGCGTCGGC GATGAAGGCG CAGCTCGAGC GCATCGCCGC CAAGACGGGT CTGAGCGAAA ACGTCTTCGA GATCGTCTCC AAGTCGCTCG AGTGA
|
Protein sequence | MTTATETPTE APKTIYLKDY ERPAYAFERV NLDFELGEAT TTVTSTIRVR PANDANGKSL FLNGDESVEL AAIEVDGAKF TTYERTGKGI TLRALPTEAF DLRVTTTIKP QENTALEGLY KSSGNFCTQC EAEGFRRITF YQDRPDVMSI FTTRITADKT KYPVLLGNGN LVDSGDLEGG KHFTVWEDPW AKPCYLFALV AGDLGMVEDK FKTMTGKEVT LRIFTETHNL DKCAHAMTSL IKSMKWDEDT YGLEYDLELF NIVAVDDFNM GAMENKSLNI FNSRLVLATP QSATDADYAA IEGVVAHEYF HNYTGNRVTC RDWFQLSLKE GLTVYRDQEF SADMNSRGVK RIGDVSRLRM AQFAQDAGPM AHPIRPESYI KMDNFYTVTV YEKGAEVVRM YETLLGKDGF RKGMDLYFER HDGQAVTTED FFAAMCDANG ADLSTFKPWY SQAGTPRVTA NGSYDAAAKT FTLECSQVVP KTPGQDSKVP VLCPIAVGLV GPDGADMNLT IDGKSHGTTA VLRFDQASAT YTFTGVDAKP VPSILRNFSA PVRLTTNLTQ DDLLFLMAND SDAFNRWEAG QTLLRNLCLD LIKGGEQSFK MNDAITAAMR TILSGAKAAD ADKAFIARAM MVPSEGELSD MLEEGTVDPA AVHAARDFVM KTLATELRAE LEATAQANSA AVYSNEPADR AARSLKNACI GYLSYLDAPE IAAMTYERYV AADNMTDKIA ALSALSGKDC DERIKAIDAF YAEWSHDPLV MNKWLSIQAA SSLPNNLANV RALAAGSAFD IKNPNKVYSL IGGFCASPTN FHAIDGSGYE FLADIVLELD DLNGQVASRM VSAFTRWRKF EPTRASAMKA QLERIAAKTG LSENVFEIVS KSLE
|
| |