Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38668 |
Symbol | |
ID | 7203360 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 653307 |
End bp | 655304 |
Gene Length | 1998 bp |
Protein Length | 626 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182730 |
Protein GI | 219124897 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0793283 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCATC AGCAACTACA GGGAGAAACT CCGTTTTCAT TTCCCGCCTT GGATGGGGAA ACAAGAATTG ATCTCGGCGC GTACCAAAGA CTAATCGTGA CGAATTTGGA CGTCAATGAG GCAGGCCGGG CTTATTTTAA CTTAGGTCTA CGGCTCATGC TTTCATACCA GCACGAAATG GCCTCCAAGT GTTTCCTGGC ATCACTAGAA AACAGCCCAG ACTGTGCTTT GGCCCACGGT CTTTTGGCAC TATGTCATTC GCCGAACTAC AACTTTAAGG GTGAAGCCTA CTACGAGTCA GCCTGTCACT ATGAAGACAC AGACAAGCCT GATCTGCTCT GCGTCTTTCC TTCTCAGCAA GTCGCCGATC GACACAGTCG AATGGCTGTG GAGAAAATTG AGGAGTTGCG CAAGGCACAC CGTAAACGCA AGGGGAAAAA GAAACAGAGG ACGGTTCCTT CCAATAACGG CGAAAAGCTA CCTTCTGTAA TATCGGATGT AGAATGTCAG TGGCTTGCGG CGATTCGTGT ATTGACGAGT TCTCCGGGTG TCGACCCAGA CTTGAGCCAC GATATTGTCG GTCGACCCTA CTCCGACGCC ATGCGAAAAG TATACGAAAA GTTCGACAAC GATCCAGAAA TCGCCTACGG TTTCGCGGAG TCATTGATGG TTTTGAATGC CTGGCAGCTA TACGAGTATC CATGTGAGTC ATATGCGCAG TCTAAGCGAT TGTGATAACT ACGCGTCGGT TCTTCGCCGT TACATACATT TGGAAATAAT GTCACTCACA ATGCTACTGG TTTCTTGGTT GGTTTCTCAG CCGGCAAGCC GCTCAGCCCG GATGTAGTGG AAACCCGAGC TGTGCTGGAG CGTTCGCTAA AAATTCATCC GCATCACGCC GGTCTGTGCC ACATGTACGT GCACCTTTCC GAAATGTCAG CGCATCCCGA AAAGGCCTTG GCTGCCTGTC AGCCGCTCCG CGGAGAATTC CCCCATGCTG GACATCTGGT GCACATGGCA ACGCACATCG ACGTCTTGCT GGGTGACTAC GAGTCCTGTG TGCACTTCAA CTGTCAAGCC ATCCGGGCCG ATCGACATGT CATGGCGAGT AGTCCGGCAA CGGCTGGTAA GGAAAGTTTT TACTTTGGAT ACATTGTACA CAATTATCAC ATGGCCGTAT ATGGGGCCAT TCTCGGAGGG ATGCAAGGGA AAGCTATGGA ATTGGCGGAC GAGTTGAACG AACTTATCAA CGAAGATATG TTCCGAGAGT TTCCCGATTT GACGTCATAT TTGGAAAGCT ATGCAGCTCT GGAAGTGCAC ATTATGGTTC GTTTTGGGCG CTGGAAGGAG ATCTTGGAGT TAGAATTGCC GAAGGATCAG CGCCTGATGT TGTTTCGGGC CTGTACTCTG CGGTACGCCC GAGGCTTGGC GCTAGCTGCT CTAGGCCGCG TCGAGGAAGC CAACAAGGAG ATGATGACGT TGGATGCGTT GCGGGTTGAT CCCGAAGCGA CGATGCGAAT TTTGCACAAC AATACCATTT TTGATTTGCT CGCGGTAGAT TCTGTAATGC TGCACGGGGA AATTGCCTAT CGAGAAGGAC AATACGAAAA GGCGTTTGCA CTGTTGCGGC AGTCCGTACA AATGCAGGAT GACTTGGTGT TTGACGAACC GTGGGGTAAG ATGCAACCAA TTCGCCATGC CTTGGGTGGA TTATTATTGG AACAGGGACT CTTGGAAGAG GCTATAGCGG TGTTTCGAAA AGATTTACAT TTTCATCCCA AGAATCCTTG GGCCTTGGTT GGTTTGATTG AATGCTTGAA ATGTCAACAG CCATGTTGCT GCGAAGCGAC CGATCGAAAT GCCGAGATTG CTATGCTGCA ATCACAGCTT GCAATATGTC GCAGTGGTGA GCTGGCTGAT TTTGATATAG AAGTACCGTG CGAGTGCTGT CAACGTTCAC CGGGGCAAAA TACAAACGAA ACGCAAATCT TGGAATAG
|
Protein sequence | MRHQQLQGET PFSFPALDGE TRIDLGAYQR LIVTNLDVNE AGRAYFNLGL RLMLSYQHEM ASKCFLASLE NSPDCALAHG LLALCHSPNY NFKGEAYYES ACHYEDTDKP DLLCVFPSQQ VADRHSRMAV EKIEELRKAH RKRKGKKKQR TVPSNNGEKL PSVISDVECQ WLAAIRVLTS SPGVDPDLSH DIVGRPYSDA MRKVYEKFDN DPEIAYGFAE SLMVLNAWQL YEYPSGKPLS PDVVETRAVL ERSLKIHPHH AGLCHMYVHL SEMSAHPEKA LAACQPLRGE FPHAGHLVHM ATHIDVLLGD YESCVHFNCQ AIRADRHVMA SSPATAGKES FYFGYIVHNY HMAVYGAILG GMQGKAMELA DELNELINED MFREFPDLTS YLESYAALEV HIMVRFGRWK EILELELPKD QRLMLFRACT LRYARGLALA ALGRVEEANK EMMTLDALRV DPEATMRILH NNTIFDLLAV DSVMLHGEIA YREGQYEKAF ALLRQSVQMQ DDLVFDEPWG KMQPIRHALG GLLLEQGLLE EAIAVFRKDL HFHPKNPWAL VGLIECLKCQ QPCCCEATDR NAEIAMLQSQ LAICRSGELA DFDIEVPCEC CQRSPGQNTN ETQILE
|
| |