Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49170 |
Symbol | |
ID | 7195662 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 120468 |
End bp | 122149 |
Gene Length | 1682 bp |
Protein Length | 445 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183817 |
Protein GI | 219127177 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.105496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTGGG CTGGCGACAT TCCCATTTCC CTTCGCAAAG AACCTTTCCC AGTAAAGCTC AGCTACATGA TTGACGAAGC AGAGCGCAAT GGCAAGGAGA GCATAATTTC GTGGCGCCCC CACGGTCGGG CGTTCTTCGT GAGCAACCCG AAAGCCTTCT CAGATGTCGT ACTTCCAAGG CAAGATCAGG TGCCGTCGGT GATTTGCAAA GGAATGCAAC AACTTCTCAC GTTCAATTTT TGTTATTTTC AACAGATGGT TTCGGCATAC AAAGCTGCAG TCTTTTCAGA GGCAACTGAG TCTCTACAGT TTTAAGCGAT TCTCTACCGG TAAGACGTGT TAAGAATATC ATCAATCCCA ACCTGTTGCT AGCAAGCCTT ACTCACCATT CATCGGAAAT TTTTGCACCG GAAGGATCAG ACAAGGGAAG CTACTACCAC GAATTCTTTC TTCGTGGACA ACCGAAGCTT GCCTACGCGA TTTCACGAAT GGCAATCAAG GGAAATGGAT CCCGGAAGCC AGTGTCACAG GAATCGCAGC CCGATTTCTA CTTGCTTCCC GCAATGACGT CAGAAAAGTC AGCACCCGTG GTTAGCAACG ACCAACGGCA CAAGCATTTA GCTGCCACGG CCGCGGGCGC TCAGGAAAGC AAGTCTGGCG GCATCTATTC CGAATACGAT AGTTCAGCGA CTGTTGCACA AGACAAATCA AGTCAAAGAG AAGAGATCCG AAGTGGGGCC GGCACAAGTA TTTGCAACGG GGCTCGTCAG CATCCTTTGG TTGTTCCGAG AGATGGACAG AGCACATGGC CTGCGAGTCA TCATATGCAG GCAGCATCAC ATGCATTGCC AAACAAAGAA ATGCAAGCCT TGTATGCTCC TATTGTACCA CAACGTGATA CTCGAGCACA ACTCGATCCG GATCACGACG TACGCAAAGG CAGCATCATG TCAAATTTTG CTGTTGATAG CGATCATCTT CGCCGAGAAG ACATGGTAGA CCTTCGCAAT ACAATCGTAG ATCGCTTCGC TTTTTATCAG AATGTGGCTG GGCTGGATGC GAACACTCAT TCAATGGCAA GCATCCCTTT CACTCTGCCA CTCGATCCCT CTGTGACTGT CAATTCGTTT GGCCGGTACG AACCCGAGTC CGTTTCAAGG CAAAGGCTCG AAATAGCCGT TGCCGTCCAG ATTCTACAGA ATCAGTTTGG CCAAGCCGTG AACGGCTATG GTCCGGCACG GGATGCACAA ACCTTGGATG ATTGTCTGGG ACTCAATCGC GGTTTTATCC CACCCATTGC TAGTCTGCCT GGAAATTCTT CGTCTCAAGC CTTCCTGCTT CAATCTTTGT TGGCAGAGTC TCAACCGTTT CGGAACGGCG ATTCCGACGC TTCTGGTAGT GCGGGTGCAG TGCGGTTCTG GTGAATAGTA GGGCCCGTTC TTTTGAATCT TTTTTTTCGT CCAATTTACA CTTTATCCAA GAAAGTGACT GTGAGTATCT ATTTTGGCAA ATTTTGCTTG TTAGCCCACC TAAAATAATA GAATCTTTTT TGTTGCACAT CGTGGCCTTC GAGATTCCGT TCTTCGTTCA CCGGTAATCC CGTACGCAGT AGCATTCAAT GTCAGCGAGC TAGATCGATG CTGGTGAACT AACAGTAATG GATATGTCCT CCTTGCCTCG AG
|
Protein sequence | MDWAGDIPIS LRKEPFPVKL SYMIDEAERN GKESIISWRP HGRAFFVSNP KAFSDVVLPR QDQVPSVICK GMQQLLTDSL PVRRVKNIIN PNLLLASLTH HSSEIFAPEG SDKGSYYHEF FLRGQPKLAY AISRMAIKGN GSRKPVSQES QPDFYLLPAM TSEKSAPVVS NDQRHKHLAA TAAGAQESKS GGIYSEYDSS ATVAQDKSSQ REEIRSGAGT SICNGARQHP LVVPRDGQST WPASHHMQAA SHALPNKEMQ ALYAPIVPQR DTRAQLDPDH DVRKGSIMSN FAVDSDHLRR EDMVDLRNTI VDRFAFYQNV AGLDANTHSM ASIPFTLPLD PSVTVNSFGR YEPESVSRQR LEIAVAVQIL QNQFGQAVNG YGPARDAQTL DDCLGLNRGF IPPIASLPGN SSSQAFLLQS LLAESQPFRN GDSDASGSAG AVRFW
|
| |