Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119526 |
Symbol | ArbK |
ID | 5000257 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 487033 |
End bp | 488863 |
Gene Length | 1831 bp |
Protein Length | 477 aa |
Translation table | |
GC content | 43% |
IMG OID | 640415678 |
Product | armadillo/beta-catenin repeat protein |
Protein accession | XP_001416407 |
Protein GI | 145343604 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.225141 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCTCT CACAGGTCGG TCATGAGCCT GTTTTTGGGT ATTTTGTTCA ACCGACCTAC AGGAAGCATA CGATGAAGCT GTTACTTCGT ACGTAGAAAA CTTCGGAATG GATGCGACTA CAGCCCAGAT TGAAGCTAGA GAGGAGTTTG CCTTGCAGGG CTTCGATATT TCAAGCATTG CGACAAGCTT ATCACAAGTC AACTGCAATA CTTTCACAGC AGTTGTGCAA ACTTTTCTGA CTTCGATCTG TCCTCTTGCT GAGATAGAAG ATGAAACCTG GGGAAATAGT GAAGAAGACA AGCTCACGTC TGCGATCGTC TCATTCGAAT CATATTTGTC AGATCAGTCT GTTTTCCGTC CAAACATCGT CGACTTCGCT TGCACTGCTG GTCAAAGAGA AATCATAGAA ATTCTTCTAA AATTTTGCTG CAAAACAACT TTGGCTCAAA CACTACGCGA CCGCGCTCTA AATTGTTTAA CAGGTGCGTG TAAAAGTTTT TTTCGTATTT TTTGTCTCAT TCGTACCTAC ACAGCGATGA TGATCATACA CGAGAACCGA CGTCTAGTCT TGAAAAGTGC CGCATTCCTC ACTTTACTGG AATGCCGAAC GTTGTCGACT TCCTTGTGCT CTTTCTTAGC AAGTGTCACG AAGGAGCACG GTGATGCGAA AATAATTTTG ATGAAGACCG TACGTTACTT TTCGCTCTTC ATTGAACAAG ACTCAAGAGA TGCCAGGAGC GCACGCTCAC CATCCTCACT GAAGCATTTG GTGCCTCTGA TGTAGTCTTG ATGCGGGCGA TTTGTGAACT CTTCTCCGCG ATTTTAATCG ACGATGACAG AGAGAGTTCA ACAAGCAATA AATATAGACA TGCGCTTATA TTACACGAGC ATGGATTGTT GGCGAAAGTT AATGACGGTA TGTAGTAATT TCCGTCCCTG TGAAATCAGC AATCTGAGGA ACCGCAGCTA TTCTCCAACA AGTACAGTCG AGTCAGCTTG AGACATCTGT GGTCTTATTC AAGCTAGTAA GTAAAGCTTC CCTATGCAAG AATCTTGTTT CACAGCATAT GCAGTTTTCT CACCTGTTGA CAAGCGAGAA GATATGTAGG AGCGTGCCAG AAGATGTTTT AAGTAGTTTA CTGGTATGTG AATTTGATGT TGTGTCGATG TCAAATTTTG TCATGTCCAC ACAGGCAATA ATATCTCGTC GAAGGGACGC AGTATTACTG AACCATTGCT GCAGATGCGT TCGGCGCCTT ATATTGTCCG ACTCGCGCAA AGAAATCCTG CTAAAGTTCG ACGTTATCAA AACTTTTGTC TCCTTAATGA CGTGCGAAGG TGAGCGGTTT ACTACTCTAT CGCATTGTAT CTGAGTGTAA ATTACCCCAG TAAGTTCAGT GCGTGAGAAC TCGATCAGTA TTTTGGCGGC TTTGGCGCTT CGGAATGTAG AAGTTTCTAA TGAATTGAGA GCAAATGGTG GGGTCGATCA GCTGTTAGAG TTGATGCCAC AGGAAACTTG TGGTAAAGTT CTTCGTCAAT GCTGTATTCT CATCAGGAAT ATTGCGGTTA GAAATCAGGA TACCAGGGTA TGAACATCAT TGTTTTACCT TCTTATTTTT TAAAAGTAGC ATATACTAGG ATTATCTCGT GGCCAATAAT GTGACAGCTC AACTCCACGA CCTGAAGGCT TTGCATCCGA GGTCATGTAT TGATGTTGGA AGTGCGGCAC TGCGCGATTT AGGTTGCGAT GACTACGGTC GTGGCTGGGT GCCAAGAACA TTGGTCATGG GCACAGATGG ATCGATTATA AATTCTAGTG ACTTGAACTA G
|
Protein sequence | MRLSQEAYDE AVTSYVENFG MDATTAQIEA REEFALQGFD ISSIATSLSQ VNCNTFTAVV QTFLTSICPL AEIEDETWGN SEEDKLTSAI VSFESYLSDQ SVFRPNIVDF ACTAGQREII EILLKFCCKT TLAQTLRDRA LNCLTAMMII HENRRLVLKS AAFLTLLECR TLSTSLCSFL ASVTKEHGDA KIILMKTERT LTILTEAFGA SDVVLMRAIC ELFSAILIDD DRESSTSNKY RHALILHEHG LLAKVNDAIL QQVQSSQLET SVVLFKLFSH LLTSEKICRS VPEDVLSSLL AIISRRRDAV LLNHCCRCVR RLILSDSRKE ILLKFDVIKT FVSLMTCEVS SVRENSISIL AALALRNVEV SNELRANGGV DQLLELMPQE TCGKVLRQCC ILIRNIAVRN QDTRDYLVAN NVTAQLHDLK ALHPRSCIDV GSAALRDLGC DDYGRGWVPR TLVMGTDGSI INSSDLN
|
| |