Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31147 |
Symbol | |
ID | 7199156 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 203228 |
End bp | 205163 |
Gene Length | 1936 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185340 |
Protein GI | 219130370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0700452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACTGCAAGA GCGGCTACCA GAAAAATTTC CAGACGATTC TCCAGCCTCT CTCTGCTCTC TCAAGGAAGC ATATTCTTTC AGTGGTTCCA TCATGCTTTC GAACGGCTCG CACCAGAACG TTTCCGCCTC TGGCGATACG TCAGTACCCG GCCAAGGTTG GTCCAACGGA GCCTTGGCAC CGCAACTTTT TCCCGTCACG CAGCAAGGCC GACTGCAGTT GTACGGGTCA CAACGTCGCT GGATTGGCCT TCAATCGGGC TGGACTCGGA TCGGGGCGGG ACAGACGGAA CTTGACGCTG TTCACTCGCG TTCGCAAGGT ACGGGAATCC TGGATAGGAA CGACGCGGAA CAAGCCCAAG CCGCACGTAG CTTGATTGCG CAGTTGTGTG AGACCTTCTA CAGACAGGGA TGGGCGACAG GAACAGGTGG GGGTGTTTCT ATTCGAGTGG GAGGTCCATC GCAGAATCGC CCTTGGAGAG TGTTTGTGGC CCCCTCGGGG ATTCAAAAGG AAGATATGAT TGGTGACGAC GTCTTTGAAC TGGATATGGA TCGGAAAGTT ATCGTTCCCC CGAGGACGCC GAATCTAAGA CAGTCGGCCT GCACCCCGCT CTGGTACGTG GTCTACAAGT ATAGACCAAC CGCAACTTGC GTCATTCACA CTCATTCAAT GCACGCGCAA ATGGCTACCT TGTTGGATCC GACCGAAACC GCTCAAACTC TTAACGTTAC CCACCTGGAA ATGCTCAAAG GCGTCGGCAA CCACGCCTAC GACGACGTTC TCGAGATCCC CATCATCGAC AATCGTCCCT CGGAAGACCA ACTGGCCACG CAGCTGCAAG CCGCCATTCA GGCGTACCCT AAGAGCAACG CGGTACTCGT GCGTCGTCAC GGTCTCTACG TTTGGGGCGA TAGCTGGGAG CAGGCCAAAA CGCAATGCGA AAGTTTTGAT TATCTCTTCC AATCGGCCGT GCAAATGAAA GCCATGGGCA TTGATTCCGG ACTTAAACCA TTGCAAGGGA CGTATCGCGA AGGCGAGGAC AAGGAGGACC TCGTCGAAAA GACTGTCGAC GAGCCTCCCC TCAAAAAACT CAAGACGACT GGGTTTCACG GGCTCAAGGC CGCCGACAAC CACCGCGACG TCGTCGCCAA CGCGGTACCA ATTCTGCCCC GCGATGCGAA GATTTTACTA CTGGACATTG AAGGATGTAC CACAAGTATT TCCTTTGTCA AGGACCGACT GTTTCCGTAC GTCCGGGAGC GTTTGGACTC TTATCTGAAA GGGCACGTGG CCGCAAGCGA CAAATATCAG CAGTTGGCTA AAGCGTTGGC CGGCGAAGCG GATGCCCACA GCGACTCGCC TGTTGCGGGT ACGATTCGAC AAGACGTCGC TGGGATGGTA CGATACATGA TGGATCGAGA CTTCAAATCT GCTACACTCA AAGCGCTTCA GGGGGACATT TGGAAGACTG GATACGCTCG CGGTGAGCTG AAGGGACACA TATACAGCGA CTTTGTTCCT ACTTGTCAAT GGATGCAACG ACACGGCGTC CGTGTCTACA TTTATTCTTC TGGGTCGGTG GCTGCTCAAA AGCTTTTGTT TGGCAACTCG ACCGAAGGCG ACTTGTTGCC GTATTTGTCC GGGCACTTTG ACATTCCCAC AGCTGGTCCT AAAAAGGAAG CAGGGTCGTA CACAGCCATT GCTCAAACGC TCCAAGTCGC ACCTTCCGCC ATTGTGTTTT GCAGTGACGC AGAAGCCGAG CTCGTTGCCG CACGGGAAGC GGGCATTGGT TATCCTGTCA TGAGTGTTCG GCCCGGCAAT GTTCCGCTAT CGGCCGAGGG ACGAGAGCTT CCAGCAATCT ACTCGCTTCT GCAACTTTGT GGAGAGTGAA TATACAATGA TTCTGTCTAG CTATGTATCA GACAATCACT TTTTTG
|
Protein sequence | MLSNGSHQNV SASGDTSVPG QGWSNGALAP QLFPVTQQGR LQLYGSQRRW IGLQSGWTRI GAGQTELDAV HSRSQGTGIL DRNDAEQAQA ARSLIAQLCE TFYRQGWATG TGGGVSIRVG GPSQNRPWRV FVAPSGIQKE DMIGDDVFEL DMDRKVIVPP RTPNLRQSAC TPLWYVVYKY RPTATCVIHT HSMHAQMATL LDPTETAQTL NVTHLEMLKG VGNHAYDDVL EIPIIDNRPS EDQLATQLQA AIQAYPKSNA VLVRRHGLYV WGDSWEQAKT QCESFDYLFQ SAVQMKAMGI DSGLKPLQGT YREGEDKEDL VEKTILLLDI EGCTTSISFV KDRLFPYVRE RLDSYLKGHV AASDKYQQLA KALAGEADAH SDSPVAGTIR QDVAGMVRYM MDRDFKSATL KALQGDIWKT GYARGELKGH IYSDFVPTCQ WMQRHGVRVY IYSSGSVAAQ KLLFGNSTEG DLLPYLSGHF DIPTAGPKKE AGSYTAIAQT LQVAPSAIVF CSDAEAELVA AREAGIGYPV MSVRPGNVPL SAEGRELPAI YSLLQLCGE
|
| |