Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_625 |
Symbol | |
ID | 5004232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 38805 |
End bp | 40775 |
Gene Length | 1971 bp |
Protein Length | 657 aa |
Translation table | |
GC content | 54% |
IMG OID | 640419653 |
Product | predicted protein |
Protein accession | XP_001420059 |
Protein GI | 145351381 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TGGACGGAGC ATCGCGCGCG ATCGACGGCG ACGGCGAGCG CTCAGGGGTT ATACGATCTT CAAGGTTTGC GCGCGCCCGA AGATTTCGAT CGCTTGGCGC GAGAGACGAT ATCGAAGTGC GAGGCGATGG CGCGCGCGCT CAAGGCGGCG GCGCCGAGCG CGGCGAGCGT CGGGGCGTTG GATGAGATAT CTGATGAGGT CTGTCGAGTC GTGGACGTCG CGGAGGTGTG TCGTCACACG CATCCTTCGC GCGAATACGT CATCGCGGCG GAGAAGGCGT ACGTGCGACT ACAAGATTAC GTTGCAAGCT TGAACGCCGA CGGCGACTTG TACGAGGCGT TGCGGAGCGC GCGAGAGAGG GACGCAAAGA ATTTGAGCGA CGAGGGCGCG CGCGTGGCGC TTACTTTGCA AGAAGATTTT GAACGCGGAG GTATTCATCT CGACCCGGCG AGGCGACGGG ATTTCGACGG GAGCTTGTCG CGAAGTCTTG AGCTCGGGAT GGAGTTTCAA CGGAATTTGT TGCGCCCAGA GTTAGCGAGC AGAGTATCGC TCGACAAGAG CACGCTGAGT TCGCTCCCGA AGACAGTGCG AGACCAGTTT CGAAGTAATG ATGGTAAGCT ATGGACGGGG TTGGTAGATT CATCGAACTC GTCGTTGATG TTGAGACACC TGAGGGATTC AAACGCGCGC CGAGACGTCT TCATAGCGGC GAACACTGGA CCCGAGCCAA ACAAAGACGT TTTGGCCAAC CTCATATCTT CTCGTGCTGG TGTCGCGCAT TCTCTAGGAT TCGAAACATA TGCCAAATAT GCCACCGCAC CGTTGCTTGC TCGATCACCC GACGCCGTTC GCGAGTTCTT GTTGAGTCTA TCGGATGTCC TTCGGCAGAG CGCAAAGGAC GAGTACGCCG TCATTGAAAA GTATAATCAC GGCAAACACA TATCGGTGTG GGACAAAACT TACGCGATGG CACAAGCGCG TGGACACGAG TGCGAATTCA ATTCGGCCGC GATTGCGGAG TATTTCCCTT TGGAAGGCGT CATCGTCGGC ATCGGCGAGC TTCTTGCGAG AGTTCTCGGT TTACGCATCG AACTGCAGGA GTTGGCACCT GGCGAAGGAT GGACGAATGA CCTGAAGAAA CTAGTGGTGA AGACTCGTGA TGGAGACATG CGAGGGACGA TTTATTTGGA CTTGCTCCCG AGGCCGGGAA AGTTTAATCA CGCCGCACAC TTTGTCATTC GGTGTAGCCG CATGGTGTCG CCCACAGAAC GGCAGCACCC ATCGGTCGCT CTCGTGTGTA ACTTCCCTCC CGTATCGGGC AAAGGGCGAT CTTTGTTAAG TCACGGAGAA GTGGAAACGT TTTTACACGA GTTTGGTCAC GCCATGCACT CTGTGCTTTC GGATACAGAA TTCCAACATT TGTCTGGAAC TCGAGCGCCA ATGGACATCG TTGAGGTTCC GAGTCATTTA TTCGAGCACT TTGCGTGGGA TCCAAGTGCT TTGAAGCTTC TTGGAAAACA CTACATGACG CACGAGCCCA TACCTGACGC CATGATTTCC GCGTTGCGCA AGTCGAGGAA TATATTTCGT TCGATTGAGT CGCAACAACA GGTTGTCTTC GCGTTGACCG ATCTCGAGCT TCACAACCAA ACATCCGAAC TATCGTCAAA ATCGATCGCT GATCTCGCTG CAACGATTCA AAACGAGCAC AGTATGTTCA AACCTGTGTC CGGTACGAAT TGGGAGCTTA GATTTGGTCA CTTTGTCGGC TACGGAGCGA CATATTACTC GTACCTGTAC GCCGATGTCC TGGCCGATGA CATTTGGAAG CGTTACTTTG AGGGCGACAG CCTCGCGGCG GGCGCGGCGG AGAGTCTTCG TGACAAATTA TTGCGACACG GGGGATCGAG AGATCCAGAA AAAGTGATTA GAGATTTGCT AGGAAAGGAT TCGTTGATAG AAGTTAATGG A
|
Protein sequence | WTEHRARSTA TASAQGLYDL QGLRAPEDFD RLARETISKC EAMARALKAA APSAASVGAL DEISDEVCRV VDVAEVCRHT HPSREYVIAA EKAYVRLQDY VASLNADGDL YEALRSARER DAKNLSDEGA RVALTLQEDF ERGGIHLDPA RRRDFDGSLS RSLELGMEFQ RNLLRPELAS RVSLDKSTLS SLPKTVRDQF RSNDGKLWTG LVDSSNSSLM LRHLRDSNAR RDVFIAANTG PEPNKDVLAN LISSRAGVAH SLGFETYAKY ATAPLLARSP DAVREFLLSL SDVLRQSAKD EYAVIEKYNH GKHISVWDKT YAMAQARGHE CEFNSAAIAE YFPLEGVIVG IGELLARVLG LRIELQELAP GEGWTNDLKK LVVKTRDGDM RGTIYLDLLP RPGKFNHAAH FVIRCSRMVS PTERQHPSVA LVCNFPPVSG KGRSLLSHGE VETFLHEFGH AMHSVLSDTE FQHLSGTRAP MDIVEVPSHL FEHFAWDPSA LKLLGKHYMT HEPIPDAMIS ALRKSRNIFR SIESQQQVVF ALTDLELHNQ TSELSSKSIA DLAATIQNEH SMFKPVSGTN WELRFGHFVG YGATYYSYLY ADVLADDIWK RYFEGDSLAA GAAESLRDKL LRHGGSRDPE KVIRDLLGKD SLIEVNG
|
| |