Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20837 |
Symbol | ARP |
ID | 7201813 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 429055 |
End bp | 430756 |
Gene Length | 1702 bp |
Protein Length | 463 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | actin related protein |
Protein accession | XP_002180828 |
Protein GI | 219120167 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTAGAGT ATCGCACTCG GCGCTTCTCA CATTCTCTCG CAGCGAATAG TAATTTCCTC CCCTAGCGCT GACTGGAAAT AGAGTCTCGA CTCGTAAGGA AAGAAGTCCC TGTCACGCTG TGAGTGCTCC CCCCGAGCCA ACATGTACTG TGGTGACGAA ACGGGATCCT TCGTCGGCGA CGTCGGTTCC CATACCAGTC GGTTCGGTTA CGGCGGCGAG GACTGTCCCA AATATGTGGT GCCGTCGTAC GTCGCTCGGA ACAAATCGCC GGACGACCGC GCGCGACGCT CTCCCGTACC GAATGCGCCC CACCATCCCC GTTGGGCCGA GGCGGAACTC GCCAGCGCCC TGCGACAAGC GCGAACAGAC GATAATTCGC ATCAACCGTT GGTCGATCCC GTGGCGTACC TAGCGCAGGG TGATTCCGTA CAAGATTGGG ATGCTTACGA ACAGCTCTGG CAGTCGGCTT TTGACGTCAT GCACGTGAGG GAGCGATACA AACACACCAA AGGAGGAGGG AATGTTCGCA AAGAAAAGAA CAGTAACAGT ATAATTGCCA GCGACAATAC TGCAATTGGC GCATCTGGTG TCACGAGCAC GACCATCCGC GACACGATCT CGCAAGACAG CCGCATCGTC CACCCTGTTC TGGCGTCCAC ACCAGGATGC ACCTACAGTG TTGGAGTCGG AGCCAAAGCC ATGGCGTCGG CTCGTCGCCG CGATCTCGTC CACCGCGTCG AATGTCTCAT GGAAAGTCTC GACTGCCCCG CGGCCTTTTT GGCGCCAACT CCAATGCTGA GCGCCTTTGC CTACGGTCGT CAAACCGCAC TGGTCGTGGA CGTGGGAGCG GGAGGTTGCG TCGCCACGCC CGTCGTGGAC GGGCTCTTGT TGACGCAGGC GCAACGACGC AACGGTAGGG GTGGGGACTG GCTGGGGAAC GTCACCTGGC AAGCCTTGCT GGAACAGCGG ACCATTGTCC GTCCACGCTA TCAACAACAC GCCAGTTTTA AACCCGACGA GTCGGCAGCG AAAAACGGGA TCTTTCATCG GTGGGCCATG CAAGATCTCA TGTACGAATT TAGAACGTCC GGTAACGTTG CGGTACCGGC ATGGTGGTAC GACGAAACAG TACCGTTTTG CAAGAGCCCA GCTACCGAAG CCGGTGACGA AATAGTGATC GACCCCATTT CTCCTGGAGG GTCCGAGTCC ATTACGTACG AACTCCCCGA CGGTACGCTG GTGGACTTGA CCAATCGAGT TGGCCGAGAC TTGTGTCGCG TTCCAGAATT GCTCTTTACC GATCAAGTAC CCTTCGTCTC GGCCGATCAG ATCAGCAACT CGAGCGTGCT TATGGAACAC GAGTCACTAA CGAACTTGCC CCTCCACAAG CTCGTGCACG AGTCCCTGGC GGCTGTAGGT GACGTCGACG TCCGCAAGGA CTTGGCGGCC AATATTGTTT TGACGGGGGC GTCCTCCCTA TTGCCCAATA TGGAGCAGCG ACTATCTTTG GAAACGTCCC GGATGACGTC GAGCGCATAC AAGTGCAAAG TGCTGGCCTC TCGACACGCC GTCGAACGGT CCTGTGCTGC GTGGATTGGT GGTAGCATCC TCAGCAGTCT CGGTAGTTTT CAGCAATTAT GGCTGAGCCG GACCGAGTAC GAAGAATACG GCGCGACGCT GGCGATTCAG CGCTTTCCTT AA
|
Protein sequence | MYCGDETGSF VGDVGSHTSR FGYGGEDCPK YVVPSYVARN KSPDDRARRS PVPNAPHHPR WAEAELASAL RQARTDDNSH QPLVDPVAYL AQGDSVQDWD AYEQLWHRIV HPVLASTPGC TYSVGVGAKA MASARRRDLV HRVECLMESL DCPAAFLAPT PMLSAFAYGR QTALVVDVGA GGCVATPVVD GLLLTQAQRR NGRGGDWLGN VTWQALLEQR TIVRPRYQQH ASFKPDESAA KNGIFHRWAM QDLMYEFRTS GNVAVPAWWY DETVPFCKSP ATEAGDEIVI DPISPGGSES ITYELPDGTL VDLTNRVGRD LCRVPELLFT DQVPFVSADQ ISNSSVLMEH ESLTNLPLHK LVHESLAAVG DVDVRKDLAA NIVLTGASSL LPNMEQRLSL ETSRMTSSAY KCKVLASRHA VERSCAAWIG GSILSSLGSF QQLWLSRTEY EEYGATLAIQ RFP
|
| |