Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42940 |
Symbol | |
ID | 7196773 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1578932 |
End bp | 1583368 |
Gene Length | 4437 bp |
Protein Length | 1337 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177324 |
Protein GI | 219111145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.544378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAAGTCGGC TGATTAAAGT GAATGGGCTG CTCTGTTTCT GAAAGCTAGA TCGTACAAGT AGTGCTACTC ATATAGCGAG GTAGTATTGC GTGTTTCATT ATCTTGCCTG ACTAGAAAAC ATGCCACCGG AGGCAATCCC AGCGCCTGTC GCTTCGCAAA ATTGGAAAAA CATGAACGAA TCCGACAAAG AACAGATTGA ATGGGGGTAC GTTTGGATGG AACACTGCCG CTTATGCTTC GTAGATGGAG AAAGTGGAAA TACTGGCGGA AATGTGGCGC TCAAAGAGTG GATTGACGTT GATGTACTGT ATGATGATGA CTATGATGAA ATGAAATTGT CACACGCCAA TCGTACAGGG CGTCAGCGAC CTCTTGAAGA TCTTCCAGAA GTCAAAGGTG ACATTGACGA TTCACTGTCA GCAAAATTTT GTTCCAGATG CAATTTGGTT CTGTATCAGT CGGGACCCAT GGGGAGCAAT TTGAAGGTAC GACTGAATGA AATACCTCCA TGTCAAGGTA CCACTAAGGA CTTCCTGGGC ACGGACGAGT GCTCTTGTAT AGTGTCAAGG TCTGACGCGT GCTTGATAAT GGGTGTAATG AGAACATGTG CTAGTGTAGC GCGTGTCCAG CATTGCTCAA TATTGCGGGA AAGTGATAGT CAGGAACCGA CTTTTACCTT GGTCATATCG GTAGCCTTTC CGCATTTACT ACATGGACCA GGTGGCCGTC GAAACATTAT GGCTCTGAAA GATTACAGAC GGGCTTACAA ACCCCTTCAC ACGACAACAC AACTGATACT ATCGATACTC CGCTCCGACT GGAAAAACTT GATATCGATG ATGGAAAAAA TGCGACTCGC CCCCACAATC AAAGTAGAAA ATCGTCGAGT ACCACGGTTC TTTCCCAACA GGCTTGGTCT GGACGAGTTA TACGAACGAA TTCAGGGTAC TGCCGCAACC CACATGATGC TCGAAACTCG TCGAGAAGAT CAGCTGCGAA ATCAACGTAA CGACTGTTTG ATAGTCCAGT TGCCAAAGGA TGTATTCACT GATTACCTTG CCCCGTTTTT GAAGCCCCGT TCGTTAGATG CCCTTCGTTG CTCTTGCTCA TACCTACACA ACACATTACG AGCGGTCGTC CCAGGTCTAA AATTACGATT ATACAGCCAC CAAGTGTCTT CTCTCATTTG GATGCGAAAT CGTGAGACCA ATTTACTTTC GGAGAAGGAT TGCCTTAGTA CCACACCCTT GCCGGACTCT CCTGATAGAG ACTTGCACCG AATGGTCACT GGTGGATACA CGACATTGTT ACGATCACGT TGCCCTTTGA GTGATTTAAG CATCCGTATT GATCAAAAGA CAGGCTTCGA AATTGTCCCG AAGGACCTTA CCACTTTACC CAGAACTGTG GCGAGAGGTG GTCTCTTATG TGACGACCCA GGACTTGGGA AAACCGTTAC AGTTTTATCA CTGATTATGC AAACAGCGGG ACTCTCGACG AAAGCGAATG ACACTGTAGA TACATCCGTT AGTAAGCAGT TGATGCCGAT GGAGGAAGCA ATATTCCGAA CGTATTGGAG CGAACAAATG ACCCCTGAAT TTCGTCGTCC ACTCTTGCAC CGCTTTCTGA ATGCTTTCAT ACGGCAAAGC CCCGGAGGTT TCATTGGTCA AGCTGATCAA GTCAAGAGAA ACATCGATCT AGACAAGTAC GATTTGAATT TTTCATTGTT TGAAAGAGAT ATGACGTAAG CTTTGTCGGA CCGCTTTTGA ATTCGAAAGC TGTGAGCTGG ATCTTACTGC TTGTTTCCTC TACTTTCTCA GGGAAGCCAT TTTCCCACAA ACTTGGGAAG GTAAACATGA AGATAACGAT AGCGATACTC ATGTATACCG AAACGCGGCA CGCCAGCTTA AAAGCGAGTT TCTGGCTAAT GTTGAGGAAT TTAAGCAAAC TCAACTTCAG TCAGCGCGCA AATCCTTCTC TTCAGTTTCG GCAAAGCCAA ATTCACGAGT CGCGGCTTTG CTGGATCGGT CGGAACAAAT GAAATTTGTG GATTCGCTGG TCTCGTCCTC AGCAACACTG TTGGTAGTCC CGGCAGTGTT GTTGGACCAT TGGCAGGTAA GGTTGTTTGG GCACAGCATT TCTTTTTCAC AAGGCTGAAG TCTCATGTTT CATGTTTTGA CTCCCGTTAC ATCAGGCTCA AATCGATTTG AACTTGGACC TTAGCTATTG CACTGATAAG ATACCATTAA TTTTTGAGTT TGGGAGAAAG CACAAAGGTT TAACAATGGA AGCCGTATGC GCCATTTGCA AAGACAATGG AAGTCACTTT CCCATGGTTT TTATAGATAG AGGTGGTACA AAGAAGTTGC CGGCGCCGGA GTTTCTTGCA ATGTTTCAAA TTGTGATAAC AACGACACAG CGCTTTTCGC AGGAATGGAG GAATGGTTCT TTTCAAGCGG AACTCAAGAG CAGCGGTTGC AAGGAAGTAT CAAAGTTGTA CCTTGATTCA GCTTTTGATC GATCTGAGAG TGCGTGTCCG CTACTAAAGA TCCACTGGCT TCGAATGATC GTTGACGAAG GACATTCAAT GGCGAAGAAC CAGAACTCGA CGATTCAGTT TGCATCGTGG ATTTCTGCTG AAAGAAGATG GGCGATGACC GGGACTCCAA CAAAACAATC CGCGACCCAG ATTCAGCAGA TCTACGCTAT GCTCCGCTTC CTTGGTCATG GTTTTTTCAC TCCTAGACTC GATGGAAACG CGGTTTGGAC GTCAAACGTT GCCCGGTGTT GGAAAGAGGG ATCGTTCGCA GCGTTTTTCC GACTCAGATC GTTGTTGGGT TTGCTTATGA AACGTCATAC AAAACGTGAT ATTGCAGAGC TGGAGTTGCC GTGTTGCTCA GCAGAGGTGA TTCCGATGAG TTTTGTTGAA GTAACGACTT ACAATACGTT GGTATGTGGT GTCCAATCAA ACATTTTGTT AACGTCAATG AGCGGGAAGA CGTCTGGACT GCAGGATTCT CTTCTTCACC GCTCTCAGGT TCAGCATGCT CGTGCTGCAC TTAGTAACCT ACGTCGTGTT TGTGTCGGCT ACTCCAGAGT ATTACCGACG CTTGAGACAA GGTTTTACAT TGAGACTATG GTCCTGCTGA AAGAGCATGG AAGGGATGAC AGACAAATTC AAAATGTGAA GGAATACCTC CACCGCGCCG AAGCTCAAGA GCTCTCAGAA TGCGATTGCT GCCAGATCAA GCTCAGCACG CTTCTCCTCT TTCCATGCTG TGGCGGATTC TTGTGCCCAG AATGCATGGA CGAGAAATCA AATATCTGCG TCCTTTGTGA CCAAGATTTC GACGTGGACG AATTTCAACG ATTGCAGCCA GGTTTCTCGC TAAAGTGGCT CGAAACGATG ATTGAAAGTG AACAACGCAA ACCGAAACCA TCTATTTCCA ATGCCAACGT GCAAGTCGAT CCTCCAGGTG GCGCGGATCC AATCGTGCGT GCCGAGATTG AACTCCCAAA CGGAGTTTTG GTTCGGCCGA ATATGGAGCT CCGTAGAACT CGAAGATTAG GGGATGGTCA CGAATGTCAA TATGATCGGT ATACTGTCCA TGGAAAATGC ATTCTGTGCT TGTCCGAACA CAGCTTCTGT AACCTATTTA ATGACAATGC ACAATGTGCG ATCTGCTTCC GGGCAGCTGA GGAGTGCTCG GAGGAGGAAT CAAAATCCTT TTATCTTGTG AAGAAGCTCT CTGAATTACA TCAACAATTA CGCAACAACG AGCAGCGACG TCCTCTGAAA ATTATCGTCT TTTCGCAATT TCGCCAAGCT TTGAACATGG CTGGGAACCG TCTTTTGCGT AAGTTTGGAA CTGCATGCAT TGCTGAATAT TGGGGCAGCT TTCGCACGAC TGAACTGCGG AAATTTACGT ACGACCGAGA TTGTTTCTGC ATGCTTCTAG GAAGAGACGG CAGCGAAGGA TTGGACTTGA GCTTTGTCAC GCATATATTC TTTCTCGAGG AAATCATGGA CCAGTCGCTT CGTGACCAAG CAATTGCTCG AGCCTGGCGG ATGGGCGCAA AAGGACGTGT GCGGGTGGTA ACTTTGACCG CGGCAAAAAC AGTAGAGGAG ACCATGCAAG AGATTGAGTC AGCAGCTCAA TACCGTTTCC AGTACTCGCA TCAAACTGTC ACACGCCCTA TTGTTGCAGC AGCCGAGTCA AACAATTTAG AGGAGTACGC CACGGCAAAG ACACATGCAT TGCTTCGTTC CTTACGACTC ATTACTGACT ACCACCATTT TTCGGCGGAG CCGCGAACAT CGACCGCAGA AAAAGCCACC TGTTTGAATA ACAAGCTACC CGGGATTTCG AAAGAAAGCG ACAAAACTGT TGACAACCCT CCAGTTAAGA GACGAAAAGT AACTTTCACG TAGCGAT
|
Protein sequence | MPPEAIPAPV ASQNWKNMNE SDKEQIEWGY VWMEHCRLCF VDGESGNTGG NVALKEWIDV DVLYDDDYDE MKLSHANRTG RQRPLEDLPE VKGDIDDSLS AKFCSRCNLV LYQSGPMGSN LKHCSILRES DSQEPTFTLV ISVAFPHLLH GPGGRRNIMA LKDYRRAYKP LHTTTQLILS ILRSDWKNLI SMMEKMRLAP TIKVENRRVP RFFPNRLGLD ELYERIQGTA ATHMMLETRR EDQLRNQRND CLIVQLPKDV FTDYLAPFLK PRSLDALRCS CSYLHNTLRA VVPGLKLRLY SHQVSSLIWM RNRETNLLSE KDCLSTTPLP DSPDRDLHRM VTGGYTTLLR SRCPLSDLSI RIDQKTGFEI VPKDLTTLPR TVARGGLLCD DPGLGKTVTV LSLIMQTAGL STKANDTVDT SVSKQLMPME EAIFRTYWSE QMTPEFRRPL LHRFLNAFIR QSPGGFIGQA DQVKRNIDLD KYDLNFSLFE RDMTEAIFPQ TWEGKHEDND SDTHVYRNAA RQLKSEFLAN VEEFKQTQLQ SARKSFSSVS AKPNSRVAAL LDRSEQMKFV DSLVSSSATL LVVPAVLLDH WQAQIDLNLD LSYCTDKIPL IFEFGRKHKG LTMEAVCAIC KDNGSHFPMV FIDRGGTKKL PAPEFLAMFQ IVITTTQRFS QEWRNGSFQA ELKSSGCKEV SKLYLDSAFD RSESACPLLK IHWLRMIVDE GHSMAKNQNS TIQFASWISA ERRWAMTGTP TKQSATQIQQ IYAMLRFLGH GFFTPRLDGN AVWTSNVARC WKEGSFAAFF RLRSLLGLLM KRHTKRDIAE LELPCCSAEV IPMSFVEVTT YNTLVCGVQS NILLTSMSGK TSGLQDSLLH RSQVQHARAA LSNLRRVCVG YSRVLPTLET RFYIETMVLL KEHGRDDRQI QNVKEYLHRA EAQELSECDC CQIKLSTLLL FPCCGGFLCP ECMDEKSNIC VLCDQDFDVD EFQRLQPGFS LKWLETMIES EQRKPKPSIS NANVQVDPPG GADPIVRAEI ELPNGVLVRP NMELRRTRRL GDGHECQYDR YTVHGKCILC LSEHSFCNLF NDNAQCAICF RAAEECSEEE SKSFYLVKKL SELHQQLRNN EQRRPLKIIV FSQFRQALNM AGNRLLRKFG TACIAEYWGS FRTTELRKFT YDRDCFCMLL GRDGSEGLDL SFVTHIFFLE EIMDQSLRDQ AIARAWRMGA KGRVRVVTLT AAKTVEETMQ EIESAAQYRF QYSHQTVTRP IVAAAESNNL EEYATAKTHA LLRSLRLITD YHHFSAEPRT STAEKATCLN NKLPGISKES DKTVDNPPVK RRKVTFT
|
| |