Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47146 |
Symbol | |
ID | 7202047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 609442 |
End bp | 611081 |
Gene Length | 1640 bp |
Protein Length | 505 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181235 |
Protein GI | 219121775 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.460614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACCCGAACC TCCATGCTTT CCTTCATCCA CCGGAGCACC CGACCTTGAA ACTGCTGACG CAAGGAGATA TCAATTCCAT AATACTGCCA CTCTATCACT GTGGGGTAGA AAGTTTATCC TAATGGTTCA ACCATCCAAC GCCACGGTCA GAAAGCTGCT GACGAAATCA CAAACAACGC CACAACGAAG ACCACGCCAG TCGCCCTCTC CCATCTCAAT TGTTATCAAA ATTGTCACAG CGACTGTTAT CATCCTTGCC ATAGTGCACC CGATTTATCT CTATCGCCAC ATATCCCAAT CGAACGCAAT CCCTGGGGAC TGGTTCCAGA ACAAATCAAT GTCATCGCTG TCGGCGGAGT CAACCAACCG CAAGCCAAAG ACGCCAACCG AAGGCCAAGC AACAGGGCTT ACACGAGCGC AAGAAAATAA CGCTCACCGA CTACACCTTC GCACGACTCC TGGCGAACGA CCGTATCTGC GACAAGACAG TCCGGGTTGT TTGACGGATG CTTGTGTACA GCAACTGGCG TCCACGATTG CTCGCGCCTT TCCGGACCGC GTTAACCAGA GTTGGTGTTT TCGGGAGGAA CCTCCACTGG GTCATGAAGA CAACGCCAAC GCGCCCGCGT CAGTACACAG CAAAGACGGA AATGGGGTGT GGCGGGGCAT AATCCTGGTC AAAGTTCCCA AAGGCGCGTC TTCCACCTCG GCCGGCGTTG CCATTCGCAT CGGTCGCCGT GTCGGGTGTC AGGCAGTCCA GTGGAAACAC CGCGTCGCTT CCAAGTACCA ACGGCTCCAC CATCCCGACA CCTTTTTGTT TACGACCGTT CGGGACCCGG CAGCACGTGC GATCAGCACT ATTTTCTTCC ATACCATAAG TCGAAGCCCA AAAGCCAAAC CGACGGACGA TCTCATCAAA ACTTATTTAC AACAGAGCAA GGATATACAC TTTGGATCCA TTTCGGAGGG ACAGGGAGGT TTTCAATTGC GTTACACATC CCTAGACGAG ATTGCTAAGG GCTCGGCCTG GTCCGCGACC AACAGAACGC GCGTTTTGCA TCCGGAGCAG GTCGTCGCCA ACGTGCGGAG CGTAATTGAC TCTTACGGAT TCTTGCTTGT AACGGAGCGT ATGGAGGAAT CTCTCGTCGC CATGGCACTT GTCCTGGGCA TTGACGTGGC TGACGTTCTA GTCACGTCGT CCAAGGTCGC CGGTGGTAGT CGGTATCACT TTGCCCGTAT GCCTAAACAA GAGTACAAAT GCTTACCTAC CGTCAAAAGC TTTGTTTCTC CCGGAGTCGC TCAATACATA GATTCGGACG AGTGGCGGGC AATTAACTAT GGAGATTACT TGCTGCAAGC CACCGCCAAC CAGAGTTTAG ATTTGACCAT TGCACGTTTG GGGAGAACGC GGTTTGAGGC GGCTCTGTCC GAATTCCGTA CATTGCGCGC CAAGGAACAA GCTCTTTGCG CCCCTAATGT GTCATTCCCG TGTTCGAACG AGGGCGTTCC ACAACCACAG CTTGCCACAG AATCCTGTTA CCTACCATTT TTTGATTTCG GCTGTGGACA CAAATGCATT GATCAAATGA TCAAAATTGA TCGGGAAATG CGGGGAGTTA GTGGATGGCA GTACCAATAG
|
Protein sequence | MVQPSNATVR KLLTKSQTTP QRRPRQSPSP ISIVIKIVTA TVIILAIVHP IYLYRHISQS NAIPGDWFQN KSMSSLSAES TNRKPKTPTE GQATGLTRAQ ENNAHRLHLR TTPGERPYLR QDSPGCLTDA CVQQLASTIA RAFPDRVNQS WCFREEPPLG HEDNANAPAS VHSKDGNGVW RGIILVKVPK GASSTSAGVA IRIGRRVGCQ AVQWKHRVAS KYQRLHHPDT FLFTTVRDPA ARAISTIFFH TISRSPKAKP TDDLIKTYLQ QSKDIHFGSI SEGQGGFQLR YTSLDEIAKG SAWSATNRTR VLHPEQVVAN VRSVIDSYGF LLVTERMEES LVAMALVLGI DVADVLVTSS KVAGGSRYHF ARMPKQEYKC LPTVKSFVSP GVAQYIDSDE WRAINYGDYL LQATANQSLD LTIARLGRTR FEAALSEFRT LRAKEQALCA PNVSFPCSNE GVPQPQLATE SCYLPFFDFG CGHKCIDQMI KIDREMRGVS GWQYQ
|
| |