Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29609 |
Symbol | ARP3504 |
ID | 5006676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 521006 |
End bp | 522346 |
Gene Length | 1341 bp |
Protein Length | 407 aa |
Translation table | |
GC content | 59% |
IMG OID | 640422097 |
Product | predicted protein |
Protein accession | XP_001422774 |
Protein GI | 145357127 |
COG category | [Z] Cytoskeleton |
COG ID | [COG5277] Actin and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00127301 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CATCGCGCCG CTCCGCGCGA GTGACGGCCC GAAGCGCGCC CCGCGCGACG CGACGGCGCC GGCGAACGAA CTCCAATTCC GAATCCAAAG AATCGCACCG GCCATCGATT CATCGACATG ACGGACGCGA GCGGCGCGCT CCCGGCGCGG TTCGCGACGT CGGTGGGACT GACGAACCGT GAGAACGTCG TGGTGTGCGA TACGGGCACG GGATTCGTCA AGGCTGGGTA CGCGGGGGAT GAAGAACCGC GAACGCTGTT CCCGTGCATG GTGGGACGAC CGACGCTGCG GTACGAGGAA GACGCGTTCG ACGATGAAGC GATGAAAGAC GTGTACGTCG GGGACGAGGC GGCGCGAAAA CGCGCGAATT TAGAAATTTC GTACCCGGTG TCGAACGGGG TGGTGCGAGA CTGGGAAGAT ATGGGATTGG TGTGGGACAG GGCGTTTGAG AGTTTGGGAT GCGATACGCG CGAGTGCAAG GTGATGCTCA CGGATCCGCC GTTGAACCCG AAATCGAATC GCGAGCGCAT GATGTCGACG ATGTTTGAGA CGTATGGGTT TCGAGGGGCG TACGTCCAAG TGCAGGCGGT GTTGACGCTG TACGCGCAAG GATTGATGAC GGGAGTCGTC GTGGACTCGG GCGACGGCGT CACGCACGTC GTGCCGGTGG TGGATGGATA TTCGTTCCCA CATCTCACCA AGCGATTGAA CGTGGCGGGA AGGCACGTGA CGACGCGAAT GATTGATTTG TTAACGCGTC GAGGGTACCC GCTGAATCGA ACGTCGGACG TGGAAACGGC GCGTTTGATC AAGGAGGAGT TGTGTTACGT CGCGTACGAC TACAAGCGCG ATTTGCAGTT GGCGCGAGAG ACGACGGCGA CGAACGCGTC GTACACGCTG CCGGACGGGC GAGTCATCAA ATTCGGCCCG GAACGGTTCA TGGGTCCCGA GTGTTTGTTC CAGCCCGATC TCATCGACGT CGAAAGCGAC GGGATCTCCG ACCTCGTCTT CAAGTGCATC CAAGAAAACG AAATCGACAA TCGACGGTCG CTGTATCAAC ACATCGTCTT GTCCGGCGGG AACTCCATGT ACGCCGGCTT ACCCTCGCGA CTCGAGCGCG ACATCAAGCG CCTTTACCTG AAAAACGTCT TGAACGGCGA CAAAGAGGCG ATGAAAAAGT TCAAGATGAA AGTCGAGGCG CCGGCGCACC GCAAGCACAT GGTCTTCGTC GGCGGCGCCG TCTTAGCGGA CATCATGCGA AGCAAGGACG AGTTCTGGAT CTCCAAGCAA GAGTACGAGG AGCAAGGCAT CGAACGAGCG TTGAAAAAAT GCGGCATGTG A
|
Protein sequence | MTDASGALPA RFATSVGLTN RENVVVCDTG TGFVKAGYAG DEEPRTLFPC MVGRPTLRYE EDAFDDEAMK DVYVGDEAAR KRANLEISYP VSNGVVRDWE DMGLVWDRAF ESLGCDTREC KVMLTDPPLN PKSNRERMMS TMFETYGFRG AYVQVQAVLT LYAQGLMTGV VVDSGDGVTH VVPVVDGYSF PHLTKRLNVA GRHVTTRMID LLTRRGYPLN RTSDVETARL IKEELCYVAY DYKRDLQLAR ETTATNASYT LPDGRVIKFG PERFMGPECL FQPDLIDVES DGISDLVFKC IQENEIDNRR SLYQHIVLSG GNSMYAGLPS RLERDIKRLY LKNVLNGDKE AMKKFKMKVE APAHRKHMVF VGGAVLADIM RSKDEFWISK QEYEEQGIER ALKKCGM
|
| |