Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44426 |
Symbol | |
ID | 7197667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 511487 |
End bp | 513403 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178527 |
Protein GI | 219115463 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGGCG GTTTGACTCT GCGTCTGGCA CGTGAGGGAA GTATCAAGAA TCTTACCATG GCGGTTTTGT TGGAGACGGA TGTGGGGGAC ATGGTGATAG ATCTCGATGT CGAAGGTTCC CCGGAGTTAA GTCGGAATCT CTTAAAGTTG TGCAAGGCAC GATACTATAC AAAAACTTTG ATTTACAACG TCAATTCGCG CTTTTGTCAA GCGGGTGATC CGATCGGTGA CGGCACCGGT GGTGGGTGCA TGTATGGTCT CCTGGATCAG CATTCTTCAT CACCCTTGGC GGATATTAGA CAATCGCAAC GACGATTTTT GAAAGGGAAG GGACGCAAGT TGAGTGCTTC GGAATGTCGA GAAAAAGGAA GGGTGGTTGC GACGGAATTA AATGGGGTGC CCGACACAAT CGGATCTCAG TTTCTGATCA CTCTCGAATC CGGAGAGGGC CGTGCACTTG ATGGATACGT TAGATCATCA ATCGTCTCAG ACGAGGCCTC GAAGGAGTCT GGAAAATCGG TCTCTTCGTT CCTTTCGCTT GGCACGGTTA CAGAGGACGT TGATGGTGCG CTGGACAAAA TCGCAGCTTC ATACTGCGAT CCCGACGGTC GTCCCTATGC CGATATTCGT ATCATTCGAG CCCTCGTAAT CCACGATCCC TTCGACGATC CCGAAGGTAT GGACGAGCTG CTGGCAAAAC GTGATGTTGT GGTTCATGCG GAAAGCGGAC GCGTAACGTC GTCCCCGTCG CCTGAACGAC CTGTGGAAGA AGCGGTTGCG GTTCGGATTC CCATCAGTCA GATAGAACCA GACGAGGAGG GTTTAACAGA AGTGGAGCTC CGTCGTCGTG AGGAACAAGC TCAAAAACAA GAAGACCGGG GCCGTGCGGT TGTCCTCGAA ATGCTAGGTG ATTTGCCCGA CGCCGACATT AAAGCCCCCG AAAATGTGTT GTTTATTTGC AAGCTCAACC CCATCACTCA AGACGAAGAT CTGGAACTCA TCTTCAGTCG ATTCGATCCT ACAGTGAAAG TAGAAATTAT TCGAGACCAA AGCACGGGTA AGTCTTTGCA ATATGCTTTC GCCGAGTTTG AGGAAAAACA GCAGGCCGTG GAGGCTTATT TCAAAATGAA CAACGCACTG GTGGATGATC GCCGTATAAA GGTTGATTTC AGTCAATCAG TGTCGAAGAT TTGGAACAAA TACACGCAAA AAATGCGCAT GCCGACTGGT GGTCCGGGCG GGACTTTTCA AAACAGTGCA AGCACGGGAC CCGGTGTCTC GAGAGGGCGG GGTGGAAGGC ATTCGTGCCA AGGTGGCAAC TGGCAAGTTC GCAATGATGA TCGACATCGT CCTTCGGAAA GGTCGGATAT GCGCAGCTAC TATGGAAGTC CCAATCGATC GAATGATGAC CGTCACCGGG ATAGCCGTCG GAGCGCCGAC GCACGCAAGA TGGACTGGCA ACGAGACCAC CGAGATAAAA GGGAGCGCCA GAGAAGCGAC GACCCTTCCA GTCATTCGAA AGGGGATAGG AACGGATCTA GGCACCGAGG CAGGGCGGGA GACGGTGATA GACGCAAGCG TGATGAACGG ACCCACGAAG ACCGAGAAAA TGATCGCGAA CGTCACTACC ATACGGAAAG AGATGCTGGT CGCATCGATG ATGGCCGGAG TCACAGTGTT AGGCGTGAAT ATGACCGTGT AGAGCGAAAA AGGAAGGATC GTACCGGTTA TGACGACAAT TATCACAGAC GAAGCAAGTC TAGCCACCGA CACCGAGACG AAGGACACAA GGAGAGGAGG CGGGATAAGG AAAGAAGCTA CTCTGGTGAG GACTCGCATA GAAAAAGCGA GCATCGGGAT CGAGACCATG AGAGGAGCCG TCGCGATAGC AAAAGAAAGA GAAGGAGTCG GAGCTAA
|
Protein sequence | MNGGLTLRLA REGSIKNLTM AVLLETDVGD MVIDLDVEGS PELSRNLLKL CKARYYTKTL IYNVNSRFCQ AGDPIGDGTG GGCMYGLLDQ HSSSPLADIR QSQRRFLKGK GRKLSASECR EKGRVVATEL NGVPDTIGSQ FLITLESGEG RALDGYVRSS IVSDEASKES GKSVSSFLSL GTVTEDVDGA LDKIAASYCD PDGRPYADIR IIRALVIHDP FDDPEGMDEL LAKRDVVVHA ESGRVTSSPS PERPVEEAVA VRIPISQIEP DEEGLTEVEL RRREEQAQKQ EDRGRAVVLE MLGDLPDADI KAPENVLFIC KLNPITQDED LELIFSRFDP TVKVEIIRDQ STGKSLQYAF AEFEEKQQAV EAYFKMNNAL VDDRRIKVDF SQSVSKIWNK YTQKMRMPTG GPGGTFQNSA STGPGVSRGR GGRHSCQGGN WQVRNDDRHR PSERSDMRSY YGSPNRSNDD RHRDSRRSAD ARKMDWQRDH RDKRERQRSD DPSSHSKGDR NGSRHRGRAG DGDRRKRDER THEDRENDRE RHYHTERDAG RIDDGRSHSV RREYDRVERK RKDRTGYDDN YHRRSKSSHR HRDEGHKERR RDKERSYSGE DSHRKSEHRD RDHERSRRDS KRKRRSRS
|
| |