Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26975 |
Symbol | |
ID | 5004736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 527524 |
End bp | 529571 |
Gene Length | 2048 bp |
Protein Length | 624 aa |
Translation table | |
GC content | 57% |
IMG OID | 640420157 |
Product | possible metalloendopeptidase |
Protein accession | XP_001420876 |
Protein GI | 145353119 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCTCGTGTCG CATGCGCGTC ATCGCTCGAG TGATCACGCA TCGCGGATCG GCGTTGAACT CGACCTCGAC GCGCTCGGCC TCGACGCTCG CTCGACGCGC TCGCGAGCGC GCCGTCGTCG GAAACGTCGC CGCGAGAGGA CGAGGACGTC ACGTCATGGC GAACTCGGCG ACGACGACGC CGGCGTCCTT CATCGACGCT TTCAACGATG AATACCTGGC CAAGCACAAG ACGTTCGAGG ATAACTTTTG GGCGACGAAG ATGAACCTGC GAGGGAACGA CGTGGAGGCG CTGACGAAGT CGTTCAACGC GCTGGAGACG TTCATGGGCG ACGCTGAGAC GTTGGCGAAG ACGCGCGAGT TGTTGGCGTC CAAAGACGTC ACGGATGACC AAAGGATTGT GCTGGAGCAA ATCGAAAAAA CGCTCAAGTG CTACATCGTC GAGTCCAAAG AAGCCGTGGC GTTGCGCGAA TCCGCCATCG CGAAGGAGAA CGCGTTGCAA GCGGCGCGCA ATAAACAAGA GCTCGGGTAC ACCGATGTGG ACGGCAAGTT CGTCTCGGCC ACGCCCACGG TGTTGCGCAC GAAAATTCGC TCGAGCGATG AGGAGTGCGT GCGCAAGAGT TGCTGGGAGA CGCTTCGCGC GAACGGTCCC TTTTTGTGCG ACAACGGCTT CCCAGCCATC ATCGCCGAGC GCAACCGCTT CGCGCGCGCG CTCGGTTTCG AGGATTTTTA CGACATGAAG GTGACGCAAG CGGAAGGCTT CAACAAGAAG AAGTGCTTCG AGATGCTCGA TGGCTTGGAG GAAGCTACGC GACCGCTCAT GCACGCCGCG CGCGACAGGC TCAAGGCGGA AAAGGGCGAA GACGCGACCA AGGGATGGAA CACAGCGTAC GCGCTCAGCG GAGAGTTGAC GCAGTTGATT GATCCGTATT ATCCATTTGA AAACGCTCCC GAGGTCTGGG GACGCTCGTT CGGGGCGATG AAGATTGGTT ACAAAGGCAC GACTATGCGA CTGGATTTAT GCGATCGCGT CGGCAAGTAT CCCAACGGTT TTTGCCACTG GCCGACGCCG CCGTTCAAGA AAACTGATGG CACGTGGGTG CCGTCTGAGT CGAACTTTAC CTCGCTCGCC ACGCCTGATG AAATTGGTAG CGGTAACACC GCCTTGACGA CGCTGATGCA CGAAGGCGGA CACGCCGCGC ACTTTGCCAA CATCGTTCAA GGTTCTCCGG TGTTTGCGCA AGAGCGGGCG CCGTTCTCTG TCGCCTTGGC GGAGACACAA TCAATGTTTC TCGACGCCTT GTGCGAAGAC GCTGCGTGGC AAGCGCGATA CGCGAAGGAT CGCAAGGGCA ACGTCATTCC TTGGGAGTTG ATTGAACGAA ATATTCGACA AAAGCACCCT TATAAGGTCA TGGCGCTGCG CGGGATGATC GCCGTGCCGT ACTTTGAGAA GGCTTTGTAC GAGCTTTCGG AAGATCAATT GACGACGGAA AACATTTGCC GCGTCGCTGA TGAGATTGAA GAAAAGATTC AAGGAGGTTT CAGCGGTCGT CCGTTGATGA GCGTGCCGCA CATCTTGGCC GACGAGTCGT CCGCATACTA CCACGGATAC GTGTTCGCAG AGATGGCCGT GCACCAAACT CGAGAGCATT TCTTCCGCAC CGAGGGATAC ATTGTCGATA ACCCAAAAGT CGGTCCAACG CTCGAAGCCG AGTACTGGCG TGCCGGCAGC GGCAAGCCGG GCTTCCTCGG GTTGGTGAAT AATCTCACTG GCAAGCCTTT GTCGCACGAC GCTTGGGTCA AAGAACTCGG CGAAGACGTC GAAGAACTTG TCAAGAGTGA ACGTGAGGCT TATGAAAAAT CTCTAGCCGA AGCCACATCT ACGAGCGATG TTGATTTGGA CATGCGCATG CTATTCGTCC ATGGCGACGA GGTCATCGCC GATAGTGCCG AAAACGGAGG TTTCTTGCCG GCGTGTGCAA AGTTCAAGAG CTGGATTCAC GACAAGTGGC CGAAAACCGT GGCAGCGTAG ATTGTACTCG TGCGCGAT
|
Protein sequence | MANSATTTPA SFIDAFNDEY LAKHKTFEDN FWATKMNLRG NDVEALTKSF NALETFMGDA ETLAKTRELL ASKDVTDDQR IVLEQIEKTL KCYIVESKEA VALRESAIAK ENALQAARNK QELGYTDVDG KFVSATPTVL RTKIRSSDEE CVRKSCWETL RANGPFLCDN GFPAIIAERN RFARALGFED FYDMKVTQAE GFNKKKCFEM LDGLEEATRP LMHAARDRLK AEKGEDATKG WNTAYALSGE LTQLIDPYYP FENAPEVWGR SFGAMKIGYK GTTMRLDLCD RVGKYPNGFC HWPTPPFKKT DGTWVPSESN FTSLATPDEI GSGNTALTTL MHEGGHAAHF ANIVQGSPVF AQERAPFSVA LAETQSMFLD ALCEDAAWQA RYAKDRKGNV IPWELIERNI RQKHPYKVMA LRGMIAVPYF EKALYELSED QLTTENICRV ADEIEEKIQG GFSGRPLMSV PHILADESSA YYHGYVFAEM AVHQTREHFF RTEGYIVDNP KVGPTLEAEY WRAGSGKPGF LGLVNNLTGK PLSHDAWVKE LGEDVEELVK SEREAYEKSL AEATSTSDVD LDMRMLFVHG DEVIADSAEN GGFLPACAKF KSWIHDKWPK TVAA
|
| |