Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49093 |
Symbol | |
ID | 7195323 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 561498 |
End bp | 566522 |
Gene Length | 5025 bp |
Protein Length | 1260 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183770 |
Protein GI | 219127077 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0396405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGATTCCAAA CTCACATATT CCTACGGAAC TTATCACGAA GCCCATACAA AGCTCGCAAC CTTGGATATT TCTTCCGCAA ATTGTAACTT ATCCAGTGGA CCTACACTTG ACACTGCGCG TTGTCGCTTA CAAAGTTTAC CCTGCTACTA GCTGTCGGTC AACTGGCTTC TTATCGTATT CTCCACTTCT TCCACAACAC CTCTGTCATG GTTCCTCTTC CGCAAGAAGA CGGTGAGCGA CGGCCTCTCA AGCAGGGGGA CTACGGCGCT ATTCGTGTCA CAAATGACCA CGACAATTCT CTCGAAATGG AAGAAGCGGA TGACGGCCAG GGGAAAGATC AAGCTTCTTC CGTGCCTCCT CCTTCCTCCA CATCCAGCGA CGCTGCTCAC CGCGTTCTCA TCAAAACTCA AGGATCGTTT TGGGAGGATG CCCGGGATTT TGCTCCGGGA AGTATTCCCC ACTCTGCTGT GCTTGCTTTG ACTATTGGTT GCGTCTGCGG CGTCGCCGCC TGGTTGTACT ATATGTTTCT GGAATGGGCG CTAGATTTCT TGTGGCACGA CTTGCCGGAA CGATTCGTCA TTGACTACTG GCCCGAATAC TTGTATGTGC TGTGGATTCC TCTCATCGGT TTCGGTATGG CTGTTGGATT GGGTATAGTG GTGAAGGTTA TGGGCGAACC GGGAGATCTA CCGTACACGG TCAAGTGTGT GCACGAGAAG GGCTACGTCG CAATGAACCA CGTAATGCCC ATGGTTATGG CGTCACAATG CAGTATTTTA GCGGGAGGAT CGCTCGGCCC GGAAGCGCCG CTCGTGGCGA TTTGTGCCGC ACTCGGTGGC TTCGTCAGTC GCAGCGTTTT CGGAATGACG GACCGGAATC TAGTGCGCAA ACACACTCTC ATGGTACGGA CGATTTGTCA GGACAGGTGG TCTTGTCACG GGTGGCTCCG GTGGTTTGAA ACAACATTCT CACTGCGACT TTCGTTTGTT TCGTAGGGTA TGGCGGGAGC CTTGGCCGCA TTCTTTGGCT GTCCGCTTGG AGGAAGCTTG TTTGCCATGG AGGTAAACAG CCGATTCGGG ATCGAGTACT TTGAACATAC TCTCGAAGCA ATTTTATGCG GTGAAATCTG CCTAGTCGTA TTTCGCGCCT TGGCCGGTCT GCCTATTCAA GCAATCTGGG ATATTTCGGA CGACAAATTG GAAGCCACTT CCGTAGTGGA TATCCTATAC GGATGCGTGC TGGGGCTCTT GGGGGCTGGA GTAGCCTGGG GGTTTGCCAA CTTTCACTGG AAGGTCATGC ATATGTTTCA GCACTTCAAT CTACTGAAGA ACGAACGCGC AATTCACCGT GCCATGGCGG GTGCTGTCGT GGTAGTGCTG ATGGGGATGC TCGTTCCGCA GACCATGTTC TGGGGCGAAT TTGAATTCGA AACCATTGCC ACAATGGCTC CAACCAAAGA TCTGATGCAC ATATGGCCCA CATCAGGATT GTTGGGGTTC GAAATGGATT CTTTCGGATC AGCAATGATT GTTGGCGTCG CAAAGCTGAT TGCAATTTCT TTTACGGTAT CGGGTGGATA CCGTGGAGGG TATATTTTCC CAGCGTTTGC TGCGGCAGCC GCCTTGGGAC GCGCCATTTC TATTCTTTGC CCCTTTATTC CCGTCCAGCT CTGCGTTTTG TGCATGGCTG CTGCACTGAA TGTAGCCTTG ACTCGAACCG CATTGGCCAC GACACTCATT CTAGCGTATT TGTCTGGCGA GCAAAACTCG ATCTCTGCCA TTCTAGCCGC GTCACTGGTG TCTCTCTTTG CGACCGGCTA CATGCCTTTC ATTCGGTCGC AAATGGTGCG AGCCGATTTG GACGCGAGCT TGTACTATTC GGAAGATTAT CCCGAAGAAG CCTTTTCTAT GACACCCATG GCCGATGTCC ATAAGATGTC CAATACTACT ACCACTCCGG GAGCAGTACA AGGACGCAAA AGCAACGGTG GTGTGCAAGT AGTGTAGTCG GACAATCGTT TTCCGGAAAC TGGATAATGG TCCTAGCTGA CGATGCACGC ATGATTTTCC GGCAGCATGT TAGTTCTCTT TACGTACCGA ACTTTGACTG CTCTCTCTGT CAGTCCGGCA CGTGGGCGAG TTTTGATTGT TCACATCAGG CGTCTGACTG ACTGTGAACA AATTTTGCAA TAGTTTTGGT TACACATTCT AATATATTTT GATTTTTTAT TCGGCATTTT GTGTATTTCT TATTATCGTG ATACTACTTT GGGTTTTTCT CCCTTTTTTC TCTTTTTCCC GTCAGATTCT CCACTTGGTT CACACACAGT CAGAAGAGAG ACCAAACGGA GAGGAATGTG AATTCGAATC AAGCAACGAA CGTCCGACTA CTGGCAGTGA GCTATCGGCC TAGCAAACTT TATCTACTTT CTACATTACC TATTTATGCC AAGATCCTAC CAGAATGATA CCGAATGGCC TTTGCATTAC GCCTGTCGAG TAGGCAATCT AGAACAGGTC CAATACTTGG TAAAACAATC AAAAGCGAGT ATCGGAAGCA AGGACGATAT AGTGAACCAG CACGACGACA ACGATGCAAC TCCTTTGTAT TACGCTGCAC TGACGGGAAA TACTGACATT TGTCGTTTTC TGCTAGAAAA GGCCGGCGCA AGATGTCAAG GTCACGATGG TGCTCGAGTG TTTTACGTGG CCTTGACTTC AGACCTACGG AAATTACTAC GCGTTTGGAG TTTATCAGCG GCAACGCGCG ATCCATATCT AGATCGTTGG CAAGATGCCT TTGTGCAGGC ATCCGAATCA GATCAACAAG GGGATTGTGT GTACCATCGC CGTAGCAGCT TCCAGCATAA CTCAAAACCT TTCTGTCGGT CGAAATCGAT TTATTTCCAT CGAATTGTTG TCCAAGCTCG GTGCCCCGCA TTGGCTGACT TCTTACACGA GCACTACCCG GAGTCTGCGG AGGATAGTAA AGCTGCAAGC GAACTACGTT TGGATGAATA TCCTGAGGCC GTCGTTTGCG GATTTCTGGA ATATCTGTAC ACTGGGGTTT GGTATATTGA GAATCTTTTG GAGCTCCGAT CACAAGCACT TGCTCTCGCA TCCAAGTTGG AACAACAAGA TTTAGTGCAA ATTATTCATA CAAAGTGGCC GGACATGGCA AAACGCATTG AAGAGGATGG CAAAGGTTTT CAAACTTCTT TGACTATGGA AGTTTCGAGC ATTCCACGGC TGCGGAAGGA CTTGCGGAAA TTGGCACACT GGGTGTCAAC ACCCCATCAC GAAATTTCTT CCTTTCATCG ATTGAACCAT CTGTTAACCT GGAGTGATGC GACCATCCAA TGCCGGGATT GTTCCTGGTC AGTGCACCGC TTTGTCACGC AATCAGTATC CGAATTTTTG GACCGCGCCT GGTCGGGGTC CTTTCTAGAA GCACAGGAAG GCTCTTTGAA TGTAAGCGAA ATGCCCTGCA CACCCGAAGC CTGGTCGCTT GCAATTCAAT GGATGTACGC GGATCAATTC CTGACGTCTG TCGATTCGGA CACTGCTCTG GATGTCCTGG AATTGGCGAC GACTATCTTG TGTCCACAAC TGGCCTCGTA CGTGGCGAAC ATTGTGCTGG TTTCCCTAGT CGAATTGGAA AACGTTCTGG ATCTTTTGCT TCTCGCTCGA TCGTACGGGT TTGACAAATT GGAAGATTGC TGCATTGGGG TTGTGGCGTT GCACATAGAT AAGCTGGTGG GCTCAGACGA CCTCCGCTCC ATCTTGACTG AGGAGGCTGC CGAAATTAGT CAAGGTGGCG ATGTGCGGGT GCCGGATGTT CCCGTTGCTG CTGAAATACG TCGCGCTATC AAGGAAGCCG ATTTTGACGA CGAGGATCGA AATGCCCGGC TCGAAGTCGT TCAATTCCTT GTGGATGAAG CCTTGCAGCA TCTTACACAA TAGTGTATAT ACATCAATTG CAAGATATTT GGCCGGAAAC AATTTTGGGA ACAAGTTGTA TCTGTTGACA GTGAACCAAC CCTTTTGGGT CACTTCTTTC GCGTTCAACT CCTGAATAGC AAATGCTATG CCAATGAAGC CACAGATTTT ATTTTTTGGA GCTTGTCAGC CGTTCGCACC TCCATGGCAA AATAATAATG GAGAAATTGC TTTGGGTAGC GTTGGCAATC CCAGTCGTGC GTGATCGAGC GGTTTTACTG TAAAGTTCAC ATCGTCACGA TTGCTCCTAC TCGTTGGATC GCCTTCCTTG AAGCGTCTCT ACTATTGGGA AAGCGAAGTT GATCGAGTGA GCTCATTGAG TTGTTCCTCG CCACAAACCG AAGTACTCGA AATCTCCAGC AACTCTCCTT TACAGACAAA TTGACTGAAG ACATGAACTT CTCGATTGTC CTCACAGGGA GAGCTACTGC AATTCTAGTA CTAGCCTGGG CGCAAACGTC GGTGACGGCC TCCAGCTGTG ACGATTCTGT CCATTGTTCC GCTGGATATT ACCTTGAAAG CGCTTCTTTA CGCAGCGCTT CTGCTCCATG TATTGCGATC GGAAAAGGGT TCTGGAGCCC GTCGTTTGAT AACAATCGCC ATTCGTGCCA AATCCTACCG CTCTACGTGG CAGCTAGCGT TGCCTTGCCT TTTGCCGGTA CAACCGACGA TGTAACTGGA TCGGAGCGTT ATCGTTTGCG TGAGGAAACC GAAAGGTTGT CTGCTTTCCG GGACGCGTGC GAGCGGCCTG GCGGGAGTTC ACTAACTTTC ATGGAAACTC CACCTGCTAT GACGTCGAGA AGATCCACTT CTCATAACAG CGTCTACATA CCGCTAGAGC CTTTTCTTTC CATTTTCATA GTGGTCATGC TAGCCATCTT TTTTACGGCC ATTCTCGATA GAAGAGCGAA CGACAACGAG CTAGAACTTG AAACGCTTCG AGCTCAGACG GCACAAGAGA TTGCAGCCTT ACGAAAGATG CTTTTGGATT TCCAGTTGAA GGCAGACCGT AATTGTATAG ACTAA
|
Protein sequence | MVPLPQEDGE RRPLKQGDYG AIRVTNDHDN SLEMEEADDG QGKDQASSVP PPSSTSSDAA HRVLIKTQGS FWEDARDFAP GSIPHSAVLA LTIGCVCGVA AWLYYMFLEW ALDFLWHDLP ERFVIDYWPE YLYVLWIPLI GFGMAVGLGI VVKVMGEPGD LPYTVKCVHE KGYVAMNHVM PMVMASQCSI LAGGSLGPEA PLVAICAALG GFVSRSVFGM TDRNLVRKHT LMGMAGALAA FFGCPLGGSL FAMEVNSRFG IEYFEHTLEA ILCGEICLVV FRALAGLPIQ AIWDISDDKL EATSVVDILY GCVLGLLGAG VAWGFANFHW KVMHMFQHFN LLKNERAIHR AMAGAVVVVL MGMLVPQTMF WGEFEFETIA TMAPTKDLMH IWPTSGLLGF EMDSFGSAMI VGVAKLIAIS FTVSGGYRGG YIFPAFAAAA ALGRAISILC PFIPVQLCVL CMAAALNVAL TRTALATTLI LAYLSGEQNS ISAILAASLV SLFATGYMPF IRSQMVRADL DASLYYSEDY PEEAFSMTPM ADVHKMSNTT TTPGAVQGRK SNGGVQVVSY QNDTEWPLHY ACRVGNLEQV QYLVKQSKAS IGSKDDIVNQ HDDNDATPLY YAALTGNTDI CRFLLEKAGA RCQGHDGARV FYVALTSDLR KLLRVWSLSA ATRDPYLDRW QDAFVQASES DQQGDCVYHR RSSFQHNSKP FCRSKSIYFH RIVVQARCPA LADFLHEHYP ESAEDSKAAS ELRLDEYPEA VVCGFLEYLY TGVWYIENLL ELRSQALALA SKLEQQDLVQ IIHTKWPDMA KRIEEDGKGF QTSLTMEVSS IPRLRKDLRK LAHWVSTPHH EISSFHRLNH LLTWSDATIQ CRDCSWSVHR FVTQSVSEFL DRAWSGSFLE AQEGSLNVSE MPCTPEAWSL AIQWMYADQF LTSVDSDTAL DVLELATTIL CPQLASYVAN IVLVSLVELE NVLDLLLLAR SYGFDKLEDC CIGVVALHID KLVGSDDLRS ILTEEAAEIS QGGDVRPQIL FFGACQPFAP PWQNNNGEIA LGSVGNPSRR ATAILVLAWA QTSVTASSCD DSVHCSAGYY LESASLRSAS APCIAIGKGF WSPSFDNNRH SCQILPLYVA ASVALPFAGT TDDVTGSERY RLREETERLS AFRDACERPG GSSLTFMETP PAMTSRRSTS HNSVYIPLEP FLSIFIVVML AIFFTAILDR RANDNELELE TLRAQTAQEI AALRKMLLDF QLKADRNCID
|
| |