Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41838 |
Symbol | |
ID | 5005193 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 367767 |
End bp | 369751 |
Gene Length | 1985 bp |
Protein Length | 626 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420614 |
Product | predicted protein |
Protein accession | XP_001421129 |
Protein GI | 145353669 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0141589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGG GCGAACGCTC GAACGCGTCG AAGCTCGCGG CGGTGCGAGA GGCGATGGCG AAGCGAGGGG TGCGAGCGGT CGTCGTGCCG TCGCAGGATC CGCACTTTAG GCGCGTCGGC GAAGCGAAGG CGAACGAACG AAACGAGGAA CGACGACGCG CGCGACGGGA AAGACTGACG AACGGGCGAG GGCGTGTTTT TGTGGGGAAC GCAGTGAGTA CGTGGCGGCG TGCTTCGAGC GACGACGATG GTTGAGCGAT TTTACGGGGT CGGCGGGGAC GGTGGTGGTG ACGGACGCGG CGGCGTTGTT GTGGACGGAT GGACGGTATT TCGTGCAGGC TGAAGACGAG CTGAGCGAGG ACTGGACTCT GATGCGAAGT GGGGTGAAGG ATGTGCCGGA CGTGAAGAAG TGGTTGTGCG CGGAGGAGGC GGGACTGGCG TTTACCGGAG CCAAGGTGGG CATCGATCCA AACGTGCACT CGGTGAGCGA GGCGCGAGGT TTGAGAGAAG CGTTGAGCGC GTGCGGGATC GAGTTGATGA GCGTCGAAGA GAACTTGGTA GATTTGGTTT GGAGCGATCG TCCACCGTTC CCGAAGACGC CGCTCAGAGT GCACCCGATG GAGTACGCGG GGAAGAGCGT GGCGGAAAAA TTGGAAAACC TTCGAGAAAA AATGAAGGAA AACGACGCGC AGAAGCTCGT CGTGAGCTCG TTGGATGACG TCATGTGGCT ATGCAATGTT CGAGGCGGTG ATGCACCGTG TAATCCGGTG ACGTTGTCTT ACGTCTTGGT GGGTGAAAAC GACGCTTCGT TTTACGTCGA CACGGACAAG GCGACGCCTG AAGTCGTGGC GCATCTCGCC GAGGCAAACG TGACGATCAA GCCGTACGAA GACATGGCCA AAGACGTGTA TGCCGCGGCA CAGCGCGGTG AGCGACTCTG GATGGACGTC GATAAGGTCT CCATCGCCAT GCTCGAACAG GCTGAAGCCG GAGCCGCCGA AGCGCCCAAG GATGCGAAAA AGGTGAAGAC GGAGAGCGCG CCGTCCGCCA TCAAGGAGGG CACGTGTCCG GTCCCGATCG CAAAGGCGGT GAAGAATGAG GCCGAGATGG CCGGTATGGT CGAAGCCCAC CTCATGGATG GCGCTGCGAT GGCTGAATTC TGGTGCGCGA TCGAGCGAGA CGTCGCCGAG GGGCGCGCCA TTGACGAGTA CGAAGCTGGC GAGAGGGTCT TGGCGTGCCG AGCCAAGCAA AACGGTTTCT TCGAAGAATC GTTCCCGACG ATCGCGGGTG AAGGTCCTCA TGGCGCCGTG GTGCACTACC GTGCTTCGAA AAAGAGCGCG AGGGCTATCG GTAAGGACAG CTTATTACTC TGCGACAGCG GCGGCCAGTA CGCGTGTGGC ACGACGGATG TCACTCGAAC GGTGCACTTC GGAACGCCCA CCGCTCATCA AAAGGAGTGC TACACGCGCG TGCTCCAAGG TCACATCGCA CTCGACCAAA TGGTTTTCCC TGTCGGCACG AAAGGTTTCG TTCTCGACGC CTTTGCGCGA TCGCACCTGT GGGCCAACGG CTTGGATTAC CGTCACGGCA CCGGCCACGG CGTCGGCGCG GCGCTCAACG TGCACGAAGG TCCGCAAGGA ATCTCTCCGC GTTTTGGAAA CATGACGCCC CTTATGCCAG GAATGATCTT GAGCAACGAG CCGGGGTATT ACGAAGACGG TGCGTTCGGT ATCCGCATCG AGACGCTTCT GCAAGTGAAG GAGGCGAAGA CTGCGCACAA CTTCGGAGAC ACTGGATTTT TATGCTTTGA CGTCTTGACG TTGATCCCGA TTCAAACGAA ACTCATGGAC TTGAGCATTA TGAGTGAAAA AGAAATCGCG TGGGTGAACG CGTATCACGA AAAAGTTTGG CAACAAATTT CCCCGCGAGT GTCGGGGGAG ACTAAAACGT GGCTCGAACG CGCGTGTGCA AAGATTTCCA AGTAG
|
Protein sequence | MTTGERSNAS KLAAVREAMA KRGVRAVVVP SQDPHFRRYV AACFERRRWL SDFTGSAGTV VVTDAAALLW TDGRYFVQAE DELSEDWTLM RSGVKDVPDV KKWLCAEEAG LAFTGAKVGI DPNVHSVSEA RGLREALSAC GIELMSVEEN LVDLVWSDRP PFPKTPLRVH PMEYAGKSVA EKLENLREKM KENDAQKLVV SSLDDVMWLC NVRGGDAPCN PVTLSYVLVG ENDASFYVDT DKATPEVVAH LAEANVTIKP YEDMAKDVYA AAQRGERLWM DVDKVSIAML EQAEAGAAEA PKDAKKVKTE SAPSAIKEGT CPVPIAKAVK NEAEMAGMVE AHLMDGAAMA EFWCAIERDV AEGRAIDEYE AGERVLACRA KQNGFFEESF PTIAGEGPHG AVVHYRASKK SARAIGKDSL LLCDSGGQYA CGTTDVTRTV HFGTPTAHQK ECYTRVLQGH IALDQMVFPV GTKGFVLDAF ARSHLWANGL DYRHGTGHGV GAALNVHEGP QGISPRFGNM TPLMPGMILS NEPGYYEDGA FGIRIETLLQ VKEAKTAHNF GDTGFLCFDV LTLIPIQTKL MDLSIMSEKE IAWVNAYHEK VWQQISPRVS GETKTWLERA CAKISK
|
| |