Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25439 |
Symbol | |
ID | 5005009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | + |
Start bp | 440803 |
End bp | 444141 |
Gene Length | 3339 bp |
Protein Length | 1112 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420430 |
Product | predicted protein |
Protein accession | XP_001421003 |
Protein GI | 145353402 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.088116 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGGAA TTCGACGCGC GCCAGGCGTG GAGTGTCGAG CGACGAAGAC GAAGACGAGT AAACCGACGA TCGGGACGAA GACGGCGCCG AGCGATGCGT TGAGCCTGGC GAAAGACGTG CGGACGTATT TGGAGTCTGA GAGTAACTTG GACGGAGAGG AACACTCGAG GGAGGTGGTG ATCGCGCCGG TGAGCGGGGA GAAGATTGAG GCTCGAGCGC GAGGCGCGGC GAACGGGGAG TCGACGTCGG CGACTACGGA GAAGGAAATC GTGATTGAGA TCGAGTACGA AACGGACGAA AGCGCGATGG GAATCGCGGC GGATGAATCG TACTTTTTGT GCCCGGGGGC GTCGAGGCGG CCGGCGTGTT GGTTTCCGTG CGTCGAGCGA GGGGACGTGC TGACGACGTT TGATTTCTCG GTGAGCGCGC CGAGGGATTT GCAAGTCATC ACGTCCGCGC ATTGGGATCG CGTCGAGCGG TGCAACGACA TCGATGAAGA CGAAGACGGC TTTAGGCTGC GACATCACTT CACGGGGATG TATCCGACGT TTGCGCACGA AATGCGATTG ATTTGTGGCA GGTTCAAGGC GGTGTCGAGC CCGATCAAAG GCGTGACTTT GTTCGCTCCC AAAGACGGCG ATTACGCCGA GCGCTTGGAG ATCGCGGCCG CGGGTGTGAG TAAGGCTATC GTGGCGTACG AGGAATATCT CGGGCATCCG TACCCTATTT CGTGCTTGAA TATCGTCTTC ATGCCGGACG AGTACGTGGG AGCGCGCGAT AGCTTGGGCG CGTGCATGAA CATTCACTCG GCTAAATGGT TGATAGACCC GACGCTCAAC ACGGCTTTGC TCGATGCGCG CGTGCATATC GCCACCGCCA TCGCGCGACA GTGGTTCGGG GGCGTCGTCG TCCCGGCGGA TACCACGGAC TGTTGGGTCG TGGAAGGACT GGCGCAGTAT CTCGCCGGCG CGTATGTGAA AAAGTTGACG GGATTGAACG AACTGTCGTT CAGGCGCATG CGCGACATGC AATTGACGGC GCGGATGGAC GACGGTGAAT CACTCCCCCC TCTCGCCAGT CGTGCGGCGC GCATTTGGCG CGCCGGACAG TACGCCGGCC CCGACCTTGC CGCCGGCGGG ACGCCGAAGC CACTCTCGGC GAGCGTCGAG CGCGGGTTGC AAGCCAAGGC GGTGACGATC ATTTACATGC TCGAGAAACG CCTCGGGCCG GATGTCATGC AGAAGGTGTT GAAATATTTC GCAGGATTGC ATGTTCGTAG AAATAAGAAG GAGGGCGGGA CGCGCGCTGG TCCCTCCGCG GAGGTGCTGT CGAGCAACGC GCGTTGGATT CACACGCTGC AACTCTTCGA TCACTGCCGC GCGACGGTCA ACTTGGGCAA AGGTGAGGTG AACTCCTTCT TAGAACGCTG GGTGTATGGC GCGGGCACGC CGAAGCTGTG CGTTGGATAC GTTGTGAAAC GTAGGAAAAA CGTCATGGAG TTTGCGGTCA AACTCGAAGG GAGCGTCGCG GCGGCGGCGG CCGACAGAGC GGCACTCGCC GTCGCCAGAA ACCATCGCAC TTCGGTCACC GTGCGCATGC GCGAAGAGAA CCGCCCGGAT GCGAACGATC ACGTCGTATC GCTAGGACAT TCTGCTTGGC AGTTGATGGA GATTCCTCTG CAACCGAAAA TCAAAGATCG TCGCCCGAAG ACAATCATCG AATCTGGAGG TGACCCCGAA CTCATCGCCG CGATGGATTG TCCGGTACGA TACGTCCGCG TGGACCCGGA ATTCGAGTGG ATGGGCAACA TAGAACAAAG TGCGAAGCAG GTCGGGTTAG AGTCGATGAT GGCGCAGATG CTGGAGAAAG AAAAAGACAT CGTCGCGCAA ACCATCGCCG TGGAATTTCT CGGTCGCCGC GTCGCCAACG GATCCGTGAG CGCAGTTCTC GTGCTGGATA AGTGTCTGAA CTCTGAGGAT ACCTTCTGTC GCGTGCGCGC CGAAGCAGCG CTAGCGCTCG GCAAAAGTGC GAGCGAGAAA ACGCAATGGG GCGGATTGAA CGCGTTGATT CGGCATTACA AAAAGTTTCA GTGCGACGCC AACACGGGAA AGCCGAAACA AAACGACTTT AGAGACCTCG CGAAAGTCAT CGTTGACGAA GCCGTCATCA CGGCGCTTGG TTACGTCATG GATAACGGAA TCACCCATCA GGATGCGCTC GAGGCTATCG TTGCCCGACT AAAGTATAAC GACAACGAGG GCAATCCGCA GAGTGATGAC GGATTCGTGG CGACGTGCAT CGCCGCGCTC GGCCGCTGCG TTCCCGCGGA TGGCAAGCAA CTGCAAACCG TCATGCAGAC GATTCATTAC TACATTCGGC GTGATGACAG ATTTCCAAGC GACGACCTTT GCGTGACGTG CGCTGGAATC AGAGCGTTAG GCTTACTCGC TTCGACGATC GACTCGAAAG AGCTCCGCGC CGACGCCGAA CACGTCGCCA AACTTGGATT CGCGCTCGGT AGACAATCCA CCGTGCTCGC TTCCGCGGAC ACGATGATGT ACCTTCGCTT CGTCGAAACT AAGAGCGAGA TCGAAGCGCT CAAGTACATG CTTGAACGAT CGGCGCAAGA GTCTGCGGCG ACAAAGGCTG CCATGTTGTG GAGCGCGTGC GAGTACTTGC AATCCGTCGC CGGGGCGGAT TCTTTGAAGG GCGTCTCGAA GCCGATATTG GCCGACCTCC GTGATCTCGT GCTAAAGGGC GGAAGCGAAA TTGCGAGCGC GGCGTTTTCG ATCCTCGCCA CGCTCGCTTC GCAAGATGAG AGCCTGAGTG AGATTCGAGA ATCCATCGCC GAGGCGATTC GGCGCGCGGG CGATCAAGTC CCAGTCGGCG TCGATGTCCA CGCCGCGCAC GCCGCGCCAA CGGCTGAGGA TGAAGTCAAG CGCGCGGAGA AGGAAGCTAA GCGTGCTCGG AAGCGAGAAC GCGAGCGCTT GCGACAAGCC GCGCGCGCCG AGGTAGACTT GAAAAACGCT GAACAGCGTA GAATCGACGC CGAGGCGGCG TTTGTCGAAC AGCAAGCGGA ACACGTACAA GACGACAAGA CGACCGTCTC CGAGCGCACG GCGACGATGA TGGGATCAGA AGGCGACGCC ACGCCTCAAG GTGGTACTCC GGTGGGTGGC CTGAAGCGCG CGAGAGATCA GAATCCGAGC GTCACGCAGT TTGCCGTGGA GCCTGCGGCG CCCGTCGCGA CGGAGGACGC CGCGCCCGCC GCGACGGAGG ACGCCGCGCC CGCGAAGAAG CTCAAACTTT CGTTCAAAAT TAAAAAACCC ACGACGTAG
|
Protein sequence | MDGIRRAPGV ECRATKTKTS KPTIGTKTAP SDALSLAKDV RTYLESESNL DGEEHSREVV IAPVSGEKIE ARARGAANGE STSATTEKEI VIEIEYETDE SAMGIAADES YFLCPGASRR PACWFPCVER GDVLTTFDFS VSAPRDLQVI TSAHWDRVER CNDIDEDEDG FRLRHHFTGM YPTFAHEMRL ICGRFKAVSS PIKGVTLFAP KDGDYAERLE IAAAGVSKAI VAYEEYLGHP YPISCLNIVF MPDEYVGARD SLGACMNIHS AKWLIDPTLN TALLDARVHI ATAIARQWFG GVVVPADTTD CWVVEGLAQY LAGAYVKKLT GLNELSFRRM RDMQLTARMD DGESLPPLAS RAARIWRAGQ YAGPDLAAGG TPKPLSASVE RGLQAKAVTI IYMLEKRLGP DVMQKVLKYF AGLHVRRNKK EGGTRAGPSA EVLSSNARWI HTLQLFDHCR ATVNLGKGEV NSFLERWVYG AGTPKLCVGY VVKRRKNVME FAVKLEGSVA AAAADRAALA VARNHRTSVT VRMREENRPD ANDHVVSLGH SAWQLMEIPL QPKIKDRRPK TIIESGGDPE LIAAMDCPVR YVRVDPEFEW MGNIEQSAKQ VGLESMMAQM LEKEKDIVAQ TIAVEFLGRR VANGSVSAVL VLDKCLNSED TFCRVRAEAA LALGKSASEK TQWGGLNALI RHYKKFQCDA NTGKPKQNDF RDLAKVIVDE AVITALGYVM DNGITHQDAL EAIVARLKYN DNEGNPQSDD GFVATCIAAL GRCVPADGKQ LQTVMQTIHY YIRRDDRFPS DDLCVTCAGI RALGLLASTI DSKELRADAE HVAKLGFALG RQSTVLASAD TMMYLRFVET KSEIEALKYM LERSAQESAA TKAAMLWSAC EYLQSVAGAD SLKGVSKPIL ADLRDLVLKG GSEIASAAFS ILATLASQDE SLSEIRESIA EAIRRAGDQV PVGVDVHAAH AAPTAEDEVK RAEKEAKRAR KRERERLRQA ARAEVDLKNA EQRRIDAEAA FVEQQAEHVQ DDKTTVSERT ATMMGSEGDA TPQGGTPVGG LKRARDQNPS VTQFAVEPAA PVATEDAAPA ATEDAAPAKK LKLSFKIKKP TT
|
| |