Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39896 |
Symbol | ARP3501 |
ID | 4999699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 377282 |
End bp | 378532 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | |
GC content | 59% |
IMG OID | 640415120 |
Product | predicted protein |
Protein accession | XP_001415475 |
Protein GI | 145340736 |
COG category | [Z] Cytoskeleton |
COG ID | [COG5277] Actin and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.213761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAGCG ACGCGCACAC GCGCCCGGCC GTGGTGATCG ACAACGGCAC CGGGTACACG AAGATGGGAT TCGCCAAAAA TGTCAATCCG ACGCACGTGA TCCCGACCTG CGTCGCCGAG AACGCGCCCG CGAGCGCGTC GAAGCGGCGG GGCGCGATGG ATGACTTAGA CTTCGCGATC GGTGACGAAG CGATGGCGCT GAGTGGGTCG AGAGACGTGC GATGGCCGAT CAGACACGGA CAAGTGGAGA ATTGGGAACA CATGGAAAAG TTTTGGGAGG CGTCGATTTG TCGATACCTG AGGTGCGATC CCGAGGATCA CTACTTTTTG TTGACCGAAC CGCCGTTGAA TCCGCCGGAG AATCGAGAGT ACACGGCGGA GATCATGTTT GAATCGTTCA ACGCGCCGGG GATGTACATC GGCGTGCAGG CGGTGTTGGC GCTCGCGGCG AGCATAGCGA GCAAGAAGCA GAGTCAGTAC GCGTCGGCGT TGACGGGGAC GGTGATTGAT ATCGGGGATG GGGTGACGCA CGTGATACCG GTGAGTGATG GGTACGTGTT GGGGAGCTCG ATCAAGAGCG TGCCGTTGGC GGGGAGAGAT TTGACGACGT TTGTACAATA TCTGATGCGA GAGCGCGGCG AACGCGTGCC GCCGGAGGAC GCGATGGAGG TGGCGAGAAA GGTTAAGGAG GATTACTGCT ACGTGTGCAA AGATGTTGTG AAAGAGTTCT TGCAGCACGA GCGCATGCCG GGCGAGTACG TGGTGCAAAT ACACGGCGTG CGGGGGAAAA CCGGCGACAC GTGGACGGCG GATGTCGGTT ACGAACGATT TCTCGCCCCA GAGGTCTTCT TCGAGCCCGA GATATACTCG TCGGACTACA TCACCCCGTT ACCAGAGCTA GTGCACCAGG CGATTGCGTC GAGCCCGATC GATACCAGAC GTAATCTGTA CGGTAACATT GTGCTCTCGG GCGGAAGCAC GATGTTTAAA GGATTCGGCA AGCGCATCAA ACGCGACGTC AAAAGGCTCG TGGACGGGCG AATAGCGGCG ACGACGAAAG GCGCCACGTT CGAGTCGAAA GAAGTCGAAG TCGAGGTCGT GACGCACAAC TTCCAGCGCA CCGCAGTTTG GTTCGGTGGA AGCGTACTCG CGTCCACGCC CGGTTTTTAC TCGAGCTGCG TCACCAAAGC CGAGTACGAG GAAAAGGGCG CGAGCGTCGT TCGGCAGAAT CCCGTGTTTC GAGGTATCTA A
|
Protein sequence | MSSDAHTRPA VVIDNGTGYT KMGFAKNVNP THVIPTCVAE NAPASASKRR GAMDDLDFAI GDEAMALSGS RDVRWPIRHG QVENWEHMEK FWEASICRYL RCDPEDHYFL LTEPPLNPPE NREYTAEIMF ESFNAPGMYI GVQAVLALAA SIASKKQSQY ASALTGTVID IGDGVTHVIP VSDGYVLGSS IKSVPLAGRD LTTFVQYLMR ERGERVPPED AMEVARKVKE DYCYVCKDVV KEFLQHERMP GEYVVQIHGV RGKTGDTWTA DVGYERFLAP EVFFEPEIYS SDYITPLPEL VHQAIASSPI DTRRNLYGNI VLSGGSTMFK GFGKRIKRDV KRLVDGRIAA TTKGATFESK EVEVEVVTHN FQRTAVWFGG SVLASTPGFY SSCVTKAEYE EKGASVVRQN PVFRGI
|
| |