Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46821 |
Symbol | |
ID | 7204678 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 552284 |
End bp | 554083 |
Gene Length | 1800 bp |
Protein Length | 543 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185726 |
Protein GI | 219120989 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.24642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATATGC GACCGTCACC CAAGGCAAAT CGGTGCGTCT CCAATTCCCG TTCGACCTCT CCCCAAACCG CACACGCCCA TCCGCTCGCA TTGCTGACTA ACATGGTGCA AGGGATTGAG ACCATTTCGT TCCCTACCAA CGCCGCCGGC ATGGCAATCA ATCGCCGGTC TCCTCGTGCG GATCGTAGCT CGACTCGTAC GTCCCTACAA CGAGACGCGG CGTCGTCGTC GTCGTCGTCA GCAGCGGGGA ACCGTCGATT CCTGCCCGTC ACGTCCGCTT ATCTCAGATA TCAAACGATG CACACCTTTG TCGTGAGTCA ACCGATGAGC GCTGCCGTAC TTTTACTCCT CGTTGTGGCG GCACTCGACG GTTTTTACGA AAGTGTACGC GTACGGCACC CGTACAATTT CGCACACGGG TTGTCCCCCG CGTCGGGACT TTCCAAACTC CCCTCGTCGT CTCTCCGCCG GACGGCAACG ACCGGTGCGA ACGGATCCGA ACTACCGACA CCACTGCTAG CCAAGCATTC GACTACGGCT ACGACTGCAG CCAATGACAC GGCTTCGATT CCTACCGCCG ATAATGCACC CCCAACAACC CGAGTTCAAT CGGCTTTCGC GGACAACAGC GTCAGCGACA ATCCGTCGTG CCTCGGCAAG GAGCATTTGG TACAGATTCT GATCGCCGCG GGGAAAACTC CGGAGGAAGC CGAGGCACAG TGTCCCACCC TACCTCTCTG GCAAGAAGTG GTGGATCTGT ACGGCGACGC CCCCATCATA CTGGGGCAGG AGCGGTGTCA GGCCTACCGG GAAGCCGTGC GCAATCAGTT CGAAGACGCA CCTCCTCTGC ACAACATTCG CGTGGACGGA CTCTTCAACG TGGGTACCAA CGCTCTGGCA CAGAACAATT TGTTGAATCT TGAACACGGA CGGTACTTTC AACCCAATTT GTCCTTGGAC GATCCGGACT ATGCGGAAAA GCTGGGCGTG CTCCTCTTCG TGGGTTGGGG CAAACATTCC ATGATCAAGT ACAAGCCAAC TAATGCGAGG TTACAGTTAC CCGTAGTACT CGTCCGAGAT CCCTACCGGT GGATGAAGAG TATGGTGCGT AAGCATTTGT TTGATGAAAG AGAATTCTAT CGTTTGTGTT TGTGTATATA TATATACATG TGTGTGTGAT TGTGACCGCG GAAGTGTTCT ACTTTTTGTA CATGGTTCAA CAGAATAGGC CTACCGACTG ACACGAATGG CTCTGCCAAC CCTTCCATAC AGTGCAAAAC TCCGTACCGG GCTGTTTTTG ACCAACAACC CAATCATTGC CCCAACCTTG TCCCTACCGA GGACGAACAA CTCGCCTCCG GCAATTTAAC TACCTACAAA GTTTCGGTCA CTCAGAATGC CCACAGTACG GTGACGGACG AATTCGATTC CCTCGCCGAC TACTGGTCGG AATGGAACCG CATGTACCGG GACGTGGAGT TCCCGCGGCT GATTGTACGC TTTGAGGACA CCATTTTTCA CGCGGAAGCC GTCATGGACG CGATTGCCCG TTGCGCGGGC GTGGAACGGG CAAAACCTTA CCGTTACTAC GTAGAACAAG CCAAATCCCA CGGTCTCAGC TCCAATTTTG TGACAGCCTT GGCCAAGTAC GGGACAAGCC AAGGACGGTT CGACGGCATG ACTCCAGCCG ACCTCGCTTA CGCCCGGACC CATCTAGATC CGGCACTCAT GAATACGTTT GGTTACCAGT ACCAAGGCAG GTATACTGCG CCGCAAGTTT CATTGGCCAA AGAGTCTTAG
|
Protein sequence | MHMRPSPKAN RCVSNSRSTS PQTAHAHPLA LLTNMVQGIE TISFPTNAAG MAINRRSPRA DRSSTRTSLQ RDAASSSSSS AAGNRRFLPV TSAYLRYQTM HTFVVSQPMS AAVLLLLVVA ALDGFYESVR VRHPYNFAHG LSPASGLSKL PSSSLRRTAT TGANGSELPT PLLAKHSTTA TTAANDTASI PTADNAPPTT RVQSAFADNS VSDNPSCLGK EHLVQILIAA GKTPEEAEAQ CPTLPLWQEV VDLYGDAPII LGQERCQAYR EAVRNQFEDA PPLHNIRVDG LFNVGTNALA QNNLLNLEHG RYFQPNLSLD DPDYAEKLGV LLFVGWGKHS MIKYKPTNAR LQLPVVLVRD PYRWMKSMCK TPYRAVFDQQ PNHCPNLVPT EDEQLASGNL TTYKVSVTQN AHSTVTDEFD SLADYWSEWN RMYRDVEFPR LIVRFEDTIF HAEAVMDAIA RCAGVERAKP YRYYVEQAKS HGLSSNFVTA LAKYGTSQGR FDGMTPADLA YARTHLDPAL MNTFGYQYQG RYTAPQVSLA KES
|
| |