Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44141 |
Symbol | |
ID | 7203893 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1105852 |
End bp | 1108643 |
Gene Length | 2792 bp |
Protein Length | 715 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186470 |
Protein GI | 219113773 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.609078 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACTGTTGTCG CCACGACCAA GGAAGAGACT TGCCGTCCAA CTCCTCAGAA ATCATTCGCC AACACCACAT CGAAAGGAGC CTCGTAACCT ACCTGCCCCT CTCCATTGCA GCTTCCCTTC CTTTGGAATA TGTTCTCTCG TGGCCTTTGC ACGATTTTAT TTGCGGCACC CGCTTTGGTC TGGGCGACGA CCAAGGGCGA CCTGTCGAAT CCTGACGCCC AAAAGATCCT GCTCGTCGAG TCGGAACGGA TCGAAGAGTA CCGCAGTCGT AACTACACTT GGCCTTTGAA TAACTACAGC CCCAACACGC CTGGATGGGC TTCCTTGATG ATGGATCGCT TCCATCAGGT CGAAGAGATC GACGACCGTG GCCGACGTTA CGAAGCGTAC ATTCAAACAA TCCATTCTGC TTTCCTCGTG CCCAACTTTA CCGAACACGG TTTTGGCCTA GCACGGTGCC CCGAAGACCT TATAGAGGCT CTGCGTGCGG GAATTCGCGA CGGCCTTCCT AATGCCCGTT ACGAACAGAA TGTCGATGTG ATTAGCGGCC CGACGCCTCT CTTTATCGAC CGACCCGACC TGAGCCAACG GGTTTTGAAC GAGCTGCGCC ACTACGCCGA AGAATGGAGT GGGGTCGAGC TGACGCCTTA TCGTGCGTAT GGCTTCCGAT TGTACCGAAA CGATTCGCAA CTGATGATGC ATGTGGATAA GATGCAAACG CACATTATTA GTTTTATTTT GCACATCGAT TCCTCAGACG ACGCCGAACC CTGGCCAATC TTTATCGAAG TAAGTCAAAC TGAAGTGCGG CGCCGGAAAT CGTATCAAAG GACGTGTAGT CTTGAACGAG TATTTTCATT AACCGGTTTT TGTCTTCATT CTACAGGACT TTAACGGGAA CACTCACGAA GTAATCTTGA CTCCCGGCGA CATACTGTTT TATGAATCTT CCAAGTGCTT TCACGGCCGA CCACGTCGTT TCAACGGCGC CTGGTATTCC TCCATCTTTG TCCATTACTA TCCCAAATAC GGTTGGTACG AACAAAATCA CGAATTAGAA GCGCACTATG CCGTTCCACC ACGTTGGTCT AGTGATCCGA CGGGTGTCAA AAAGTACAAT CGACTCGAGA TGGTGGGCAC CTCTATGAAA GAACCCGACT GCGATGCCGA TTGGTGTCGA GCAGCCAATA GCGTCAAGTG GAGTGGACCG GGGGAAGTTG GTAAGTGGAT TGCACCCAGC TTCGAACGAT TTCCTTTCAA TCCGAAAGAA GGCGCCTTCG AGAATGAATT GTAATATTTA CAGTATGTTA CAGATAGTGA ATGTAAAATC AAAGGGTGTT ACAGATGCTG ATGCGGTTCC GAACACACCC TTCATCTAAT ACTCCAGATT TTAGGTTTGC ATACGGAGGC TAGCAACGAT ATTTAGGTAT CCAACATACC GTGGGGTATA GGCGTCGTAG CACGGCGCTA ACACTAAATG TCTAACAATG TCTCGATGGC GGTTGGGCAC ATACACATAT ACATGCATAC ATACACACAG GCAAAGCGCA CTCGTCAATT AGTGGCCTTT GCCTGGACAA CACCGAAGCC TCACGTTCTC GTATTCTATT TCCGTCCAGA CTTCTAACCA CTACGCAGTC GGAGAAATAG AAGTGATGTA TTCGCCTACT GACCGAAAGC ATGGGCCCAT CAAACGAAAA CGAGCCCCAG GTGGAGGTGG CGCAATTCTT CTCCTTGCTG GAGGAGCCGC ATTCACAACA CTTCTTTCCA TTTTTGGACC GACAATTTTT CTGGTCGAAC AACATCACAG CCAATCGAAG GATGTCCGTG TTGGTCTGGA GAGTCTGAGC AAAGTGGTTC GCGTGGCGGA TCTTAAGCAC GAAGAAATTT TCCGGTTAAC GATCAAAAGC TGTCTACCTA CACATAATCC AAAGTGCAAA CAATACATAC CTCCAACCAG TGGAAAGGAT GAGGATCCCA TCCAAAGAGT AGCGTTGGTC GCTCCCCCTG GAGATATCTC TAGTATACTG ATGAATCAGT TGGAACGAAT ACAACATCAA CACACTAATC TACAAAACAA AACAGAATCG GATATAGAAG TATTCGCCAC ATCGCATGTT CCACCTTATG GTTATGGCAA GACTCACGGT TGGACGAAAA TTGTGCGGCT CGTGCCAAGG GCATTGGTCT TGGAAGTGGT GGACGCACTA CAAAGCTCAT TGCTGTCTTC CGATTCTCAC ATGGATTTAA CTCTAGATGA TCTCAAGGCT GCTCTGAGGC AAATCTTACG CTTCCATTGC CGACTTTCTC ACGTTGCCGC TCATACAGCT CTCTTATCGA TTCCTTTGGT GGATCTGATT GCCAACTCGG CTAATGTGAG CAGACACATC CAAGATTTTC TTGTTTCCAG GGATGTAGGT CGTATGAAGG GAGGGGATGA CGACGGGGTG GAGAACGCCG ACGATGACAG TGGATCAGTA TCTGCGACAC AAGAATTTTA CGGTTCCCAA ATGCTTACTT ATATTCAATC AGTTGCGCAT GTGGATGTCC TGAAGGTGCT GGACGAGGTA CTGATAGAGG AGATGAATCT AAGCAGAAAC ATGACTGTTT GGCCGTGTCT ATCATTTTGG GCGGCTGGAG AAAGAGAAGA CCCATCGAAA CTCACTAGAT TCACTCAAAA CATAGCCAAG GAGCTCTCAC CGGAGTGCAG CGATCCTTTC GTCTCTTGCT TTGTGCCAAG GGACATATGC GAAGCCTCTG GGAATGGAGT GTGCAAGGGA CACAAGCGAT AG
|
Protein sequence | MFSRGLCTIL FAAPALVWAT TKGDLSNPDA QKILLVESER IEEYRSRNYT WPLNNYSPNT PGWASLMMDR FHQVEEIDDR GRRYEAYIQT IHSAFLVPNF TEHGFGLARC PEDLIEALRA GIRDGLPNAR YEQNVDVISG PTPLFIDRPD LSQRVLNELR HYAEEWSGVE LTPYRAYGFR LYRNDSQLMM HVDKMQTHII SFILHIDSSD DAEPWPIFIE DFNGNTHEVI LTPGDILFYE SSKCFHGRPR RFNGAWYSSI FVHYYPKYGW YEQNHELEAH YAVPPRWSSD PTGVKKYNRL EMVGTSMKEP DCDADWCRAA NSVKWSGPGE VVGEIEVMYS PTDRKHGPIK RKRAPGGGGA ILLLAGGAAF TTLLSIFGPT IFLVEQHHSQ SKDVRVGLES LSKVVRVADL KHEEIFRLTI KSCLPTHNPK CKQYIPPTSG KDEDPIQRVA LVAPPGDISS ILMNQLERIQ HQHTNLQNKT ESDIEVFATS HVPPYGYGKT HGWTKIVRLV PRALVLEVVD ALQSSLLSSD SHMDLTLDDL KAALRQILRF HCRLSHVAAH TALLSIPLVD LIANSANVSR HIQDFLVSRD VGRMKGGDDD GVENADDDSG SVSATQEFYG SQMLTYIQSV AHVDVLKVLD EVLIEEMNLS RNMTVWPCLS FWAAGEREDP SKLTRFTQNI AKELSPECSD PFVSCFVPRD ICEASGNGVC KGHKR
|
| |