Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44062 |
Symbol | |
ID | 7204013 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 844734 |
End bp | 852613 |
Gene Length | 7880 bp |
Protein Length | 1943 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186422 |
Protein GI | 219113677 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.284084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TATCTGTACC GAACAACACT CGGTGGCCAC AAAAGCTCGA TTCGGCGCCA TCGAAAAAGT GGTTCTGAAG TCTATGACTG GAGACACATC GATTGTACCG TGTAGAGATG GGCGTTCACG GCTGGAATTG GTTTCTGAAG GAGGAAGGAT GGTTGCCGGA TACGCGGCAA TCGTGTTGCT CACAGTCATT GTGGAGGAAT CCCTCCAGCT TGAGTGAATC GGATGCGTTC GCATCGCGGA TCGTTCCCAT TCCGGACCGC TCCGTGTTGC ATGTTGATGG AAACGGCTTG GCATTTTTCT TGCATCGTGC TGCGTACGCG CGCCACCTCG CGAAGGTGAT CAAAGGCGGA AATAACAAGC AATTGCGGGC GCACTCTAAA GCTTGCACAG TGCCCAAACA ATCCTTGTCG ACCGAGCAAG TGCGTCGGCT TTTGCCCAAC TTTATGCCTC TCACGCTACT ATCGGATGCT GTTCGGGAGT TTGCGTCTCA ACTGCAGGAT AAGCACGGCA TGAAGTTGTT GGTGTACTGG GATGGTGAAC AACGGCGAGT TTTCAAGCAC GAGACGGATA AACATCGTCA GACGCGTCGA CCCGAGGAAT GGTCCAACGT TCAGCACTAC TGCCAGTACG GCATGATGCC AGCGTCGAAA ACAATTTGTG AATGGGAGTA CGTGTTTCCA AAGAGCCGAC TATTCACCAC GCAGATTATG AACGTACTAC AAACCTGTGG TGCAGAAACG ATTCATTGCG AAGAAGAGGC CGATCACATT ATCGCCGAGC GGGCTTCCAA CAACCCGAAC CACTATGTTC TCGGCAACGA CTCAGACTTT TGCTTTTTTC CGAACGTGAA CTATATTCCG TTCGCGACGC TGGATGCACA ACACAACACG ACCACGGCCT GTGTTATTCG TCGAGCAGAG ATTGCGGAAT CCATTGGTCT TCCGGAGGAG CTCATGGTTG AGTTAGCCAT CCTAATGGGG AACGATTACG TGGACCCTCG AGGCGCCAAG CTGAATTTCC ATTCTCGCAA TGTTTACGAT CTAGTGGAGC ATCTTCGGGA TCAGGTGGAG AGTTATCGAG TCACTAGCCA GGTGGCCGAG ATTGAGGAGG CTCTGCGCTT TATCCGGATT CTCTACAACT TTGGTGATTT GTCGGAATTT CCGTTCGACG AACGTGACTC GGCTGGCCGG GTCATTGATG AGAACGATAT GGCCTGTGAT AGCATTTTGG ATAACATAGC TCTGCGTCCG ACGATTCCAG ACGAGGTTCC TCTCGAATTG GCAAAACTTC GTCCTTTCTC AGATATCTCC ATAAAGGACG CCGCTCTCCC TGTTTACAGA CTTTTGTTGA TCAAAGTGCT GGTAACGAAG CAAATTCCGC GATGACAATG ATTCGACAGG AGCATCTGGA TGTTTTTCGG GATATGGCTC TGACAAAATG CGCGAGTCCC GGGATTCAAA AAGGAGCCTG GCGTCCTCTA TGGGAAGATG TACCGGCGGC ATATTTGATT GAAAAGATTG TTGCTCATTC CTTTGAGGAA AGTGGCGAGT CTCCTGTAGT GCGGGTCTAC GATCCCTTTC AGATATTTGA TCAACATATA TTTCATGATT GTTTACGCCA GCGAAGGGCA CATGAAAGGA CGCGAAAGAA AGACCCTGTG TCTAAAGCGG TCGAACCTTC GATCAAGCCA AGAACATCAA CAGGGGAAGA GCGATTAGTT CTTCCGGTCG ATGAACACGA GGAAATCATT CTTGCATCGG TGCGCACGAA TCGAGTAACT ATTATTCAAG GTGAAACTGG ATGTGGGAAG TCATCTCGCA TCCCGATTAT GCTTCTGAAA GCACCGTGCC CGATCCCGAC GATGAATCAG TCGAAAATGT ACATTTCCCA GCCTCGTCGC ATCGCAGCGA AAGCACTAGT CGAGCGAGTT CGATCTGTCG AGCCGGAGTT AAAGAATGCC ATCGCTTTGC GAATGGGGCA TGGAGTTTGC GAATACGAGT CAAAGAAAAC AAGGGCATGG TTCGTAACAA CTGGATACCT TGTACGAGTG TTGGCGAATC ATCCTGAAAA ATTTGATCGC ATCTCGTATT TGGTAATTGA TGAGGTACAT GAGCGTTCGG TTGATACCGA CATACTTTGC CTTCTCTGTC GTCGTCTGCT AAAAGTTAAC CAAAACATTC GTCTGGTTTT AATGTCAGCA ACACTGGCTG CGTCTCTCTA TCAGGAGTAC TTCAATACCA CTGAGCCCCC AATTAAAGTA GGGGCTCGTC GGTTTCCCGT CAAGGAGGTT TTCCTGGAAG ACCTGCAAGA ACAGGTGGCA TTTTCGCAAA AGGAGGAGAA GACAGTGAAT CAACTCGCAA GCGAGTGCAA CAAAATGAAA TGTGCGAAGC CACCATCACT GAGCTACATG GAAAAAATTT TCTCTCTTGT TTCGCATATC GCCATGTTTG TAGGTCGACC TGGATCTTCA GTACTCATAT TTGTTCCTGG AATGAACGAA ATCGTCGCCA TTACCGAGCT TGTGGAACAG CTATTTAAAC CGGGCGTCAG GTTCACTTGC TTTCCGATCC ATAGTGATAT TCCGTTTGAA GACCAGATGA TTGTTTTCGA TGCTACTGCA GAAGATGAGG TACGTGCTTC GAGGCAGTTC TTGATGGTGT CATTAATGCC GAGTCCTCAT TAAACCACAT ACAGGTCAAA ATCATTATCG CAACAAATGC TGCAGAGAGC TCGGTAACGC TTCCTGATGT TGATCATGTC ATCTGTCTAG GGCTTTGTAA ACAAATCGTG TACAACGAAG CTTCGCACAG ACAGATTTTG ATGCCTACAT GGATATCGAA AGCGAGTGCA ACGCAAAGGG CTGGACGAAC AGGCCGACTC CGACCTGGAA CTGTTTATCG TATGTACACT CGAGAAGTTT ACCGTCATTA CATGGATGAT TTCGAACCAG GGGAGATGGT CAGAATTCCT CTTGATTCCG TCATTCTCAT GTTGAAAGAA ATGCTGTCCG ACGAGGACGT CACAAAAGTG CTCCTGGATT GTCTTGAACC GCCCAACATT GCAACAATCG ACCGCTCATT TCAAAGCCTC TACAAGTGTC GATTTATCAC TTCGGCAGAA GGCGACTGCG AGATTACCTC ACTTGGATCA TTCGTTTTGG CTATGGGAGT TGATCTTTTG CTCGGGTCGC TTGTTGGTCT TGGTATACAG TTTGGAGTAG GCGCTGAAGC TGTTCAGATG GCCGCCGTTC TTTCCTTTCC CAAAACTCCG TGGATTATGT CAAACCCTCT GATTCATGAA AGTAAGGATT ACAACGAAAT GACCACAAAA ACGTACGTGT CCCGGTGCCA TTTCGACGCA AACCTTTTCT CTGAGCCCTT GTCGGCGATG AATCTAATTT GGGAATTTGA ACAAGCAAAA AATCAAGCAA ATTGGTGTTG GAAATTCGGT ATTTCCTATC CTCGGATCAA ACGGCTAGCT GCCACCAGTT CCAATTTTAG ACGCCGTGTG GCTGATTTTC TCAGTATCTC GACGAATGAC TTGGAATTAG ACGCTCCACC ACTTTTCATG CCTCATTCGA AAGTGACGCT TTTGCGTGTG ATCCAAGTTT GGGTATTCAA CGATTCAATT ATTGAATGTC GCCCCAGACC TTTCACGAAA GGAGTCTCAG GCGACAAGAT GAGAATAGAT CTGGGGAAGA ATAGCAAAAG AATTGATGAC GAGCATCTCC ATCAAATTCT TAGCAAGGGT CGCCATCCTT TCGCTCTTCA TGGATATCAA GAGATCGAAC AAAGGGGTTC TTTCGAAATA CAAGGTGCGA GCTCTGATAG TGGTATGGTG AGCGGAGATT TTGAGGAACG GTTTCTATCC TATGCCGTCG AGAAAGAAAT AAACGTCGCC TGGATTACTT CGCCTTCATC TTTTGTCCTA TATGTCCGTG CGGAGTCCTC TAAATCCAAG CGCTTTGCCG AAATCTTGAC AACCATGCGA CAAAGCCAAT TCAACGAATC TTCAAAATTT GCCTTTGAAA GCTCGAACAA GAAAGGAAGG GGAGAGAGAC CAGCAGGGAT GTGGAGCTTT GCCGATGTAA CCGATGGAAG GAAAGAAACA GGGACAATTT TTAAACGCTT TATGACAGAA TCTTTGAATC AGTCTCAGAG AAGATTTTTT ACAAAAGCCC TTTGCGATTA CCTGGAATCA CAACCTGCGC TCAGTGGTGT TTCCTGTGAT TTGCTGAAAA ATACAGCCAA ACCGATTCAA AAGTTCTCGA TTGTTTCTCG GGGGAGGAGC AACGCAATCT CGCCTGTTGA TCTGAACGAT TTGTTTGCAG CATCCGGGAT AATTACAAGC AAGGTAGTCA AGCAAGGTAA CCAAGCGATC ATTTTCTCCA ACTCTCCTAG CATGCCACTC GAATGCGGAA CAAAAGGAAA GCACTCGGCG AGCGTTGAAT CTAGCTGGGA CCGCCCCATC TTTCATCCCA TTCCAGAAGG TGCTAGAATT TTATCCATTC TGGCCTCTGG ACGGAGGAAG GAGCATGTAG TGCGCCTAAG CTCTTTGCCA TCTAAGGACA CACTGGCGAA TGGAACCTCC CCGGATAGCC TCGATATCTA CATGGAGCCT GGCACCACGA ATGTTTCTAA GCGCTGGAAA CGTTTTAACA CGGCCATCTC TGTTTACGTT CAGGAAAATT CCGTGCCTGC CTCAGCACTT CCAATGAATA CAGAGGCTAT TATGTATTGT TGCTGTGCTA ACACCCTTGA GGTCAGAGGG GGCGGTTTAC GTGTAGAAGG CCTTACTTTG TTGCCACCTG GACGCCCCTT TCTCCTGCTA TGTCGTTTGG CATTTGGGCT TTTCCATCAG AAGCATTCAG TAGAGGGAAC ACTGGAGGAT ACGTGCTTGC AGTGGGTATT GCAGGATCTT GGTAATGATT CGACTTTAGG CCTTTCGCAG GACGAGATGA AAATTCGGAT TAGAAAAGCT ATGTCATTTC ACCGCTCAGT TGGTGACCTA GGAGAACAAC TTGTGTGTTT TGACACGAAA GTCGCACCTC TTTTGGAAGT TTTTGATGGG GTCGATGGGT ATGAATCAAA GGTGTGGAAA GATCTGAAGA ATAATCCTAT GACAAGCGAC AACCTTAAGG CGTCTGCAAC AAAGTCGATC GATTGGAAGA GCCAGTCAGC TGAGCTCGTC GAAATGCGAA CGCTACCGAA GCAGCATGCC AGTGATCGGC TGGTTTCCAA AAAATGTGGT GAATTCGATT TAAAAAAACG AAGTCCCAAG AAGGACGAGT CTTCTGGTGG GGACACGGAT GATGATCTCC AAAGGATACT CGCAGCGACT CAACACTTAT CCGACGAAAA ATGTTTGAAC CATCAAGCAG AGAAAGTAAA GGCCAAAAAC CGGAAGAAGC GCGCATCTAA GGGCAATAGG GTGGTCCCTT CCATTGAACC TGTCATCTTC GAGCAGATCA CAAATCCGTT CGAACTTGTT CCGCAAAGCT TATTCCTGAC GGAGCTACCT GACGGAGTTG TTCCTATTGC CAACGAGCTT CACTCTACAA ACATCTTGGC GCTTGTTATC CAGCATTATA TGATGCATCT CAAAGACACA AGGGACCCTC CTTATTGTTT CGAAGGAGAT TGGTCACTTT ACCGTGGGGA GCTGGAGGGT AAACAGTACT TCTATGCAAG GTTTCTAAAT CGAAGTATAC TCTACAGGAA ACAGGCGAAC GGTATACAAG CCAAGCAATT TATAGTTCCT AAATGGATAC GAGAAAATGA ACCGCGCCCC TCATCGGTAA AGGACGCTAT CAATTGTCTC CCTCCAAGGT TCTCTGAGTT GCTAGGAATA TCTGTAGCTG CTATTGATGG TTGCAACATA CTGCTTTTCT CAGATGTGAC TTCCGCGATG CGAATGGAAT CGGCATTTTG GCTTGAACGA CATTTTCGTT CGTCTGCGAT TCATTGGTAC GAGCAGTCTC CTGTCACAAT GGTAGAACAC CTCATCCGAG GATATCAATG GTAGCGCTTA TTCGGAAAAC TCGTCAAGCG ACAGCCTAGC CGCTGAATCT TGTATTTGTT ATAGATAGGT GCAACAACAG CCAATTCTTT GTATTTCGAA AGTAAGTTTG ACTCTTCTAC GCATGCGGCG ACATTTCATT TGATGGAAAT ATGTTTCTGT CATCCTTCAT TGAAAAGACA AAGCCAATCT GCTTGTCGAA TCCCTCAAAT GACGAGCGGA AGGAGGGAAT TGTTTTATAA AGGGCTACGT GATTTTAGAT CACCTAATAC TTACAAGATT TTCGCTGTCA CCGTCAATTC TTAGGTTGTA CCGTCAAAGA TGCCTCACCA GGGGTAGCGT ATCAGGAGTC CGATTGACTT GGGTCACGCC ATGGCATAAG GAGCGACGGA GGCACGACCA TACGCACTCT AGAGTCATCA TCTGTCCTTA GAGCAAGAAA CGAAATCGGA GTCTCCTTCG AAGTGACGGA CGTTTCCGTC AAGTTGAATA TACGGATCGA AAGCATCGTC ATACATGCGT GAATTGAATG CCAAATTATA ACCAAAAGAC TACATCCCAC TATGTAACTC TTCGCAACTT CCAACGGATC CTCTTTTTAT TAGTCGTTGA AATCTGACTG GGAAGGCGAT GCAGAGATAT CCTACTAGGA TTGCCGTTGT CCTTGAATTG AGGTGTCAAT CCCTTCGTAC TCTTGCCATT CGTGCTTATA TCGACTAATG TACGCCAAAT ATTCTTCGTG CTCATCCAGC ATCATGTTTT CGGCAACTTC GTCTTCATGG TAAATTTTGC CTTCTTGAAA ATTTTGTATC CACGAAGCCC GAATTGACAC CAAAGCCATT GTCACAACTC CAACGCTGAG GAACAAGACG AAAGCCCAGC TGGAAGCAGC CATCCCCGTT GTACACATAG AGTCGTGTAT TGCTTCGTGA TAGATAGGCT GGACTCGGCT ACATCCGAGC GAACTTGACA TGCTATGGAA CGCTGTTCGA ATAAGTGTCA TGAGCCTTGC AGTGTCGCGA GTATCCACGA GAAGCTGTTC TATTTTTTCC CCACCGCACA TTTCAACGAG GTTGAAACGA CCCGCTGCAT CTACTGACGA AAGATGACGC CACGTGATGT CAACGGCTAC TTGAACGTTG GCTTCCAATC CCAAGAGTAT GCTAGAAGGA TCACTTGCAT CGCATCCTTG CGTAATTGCC GCAACAAACG CACCGACGAG ACCACCTTGT TCTACCCCTT TCGCCTGCAT AATGTGATGA ATGGTTTGTC CAGGATAGCC ACTAGCTTCA CCTTGAGTAC AAGCGTCATT TCCCATGGCT GCAACCAGTG CCGATATAAC GGCTACGATC CAGCAGCAGC AAGAAGAAAA AATAAACATG GGAAGTACGC CATACGACAG GAAAAGCTGA AAGCGAAAGT TCGATTCTTT TTTCCAAGAA AGAACGACAC CAAACAGTGC AAAGGCAATA AGAGTACTGA CTCCAAACAA AAGAGTAGGT ACAACCCAAA AAAATGCGCG AGCTCCATCT GATACCACCT CGAGAAAGTT GATTGATTCT TCGATACCAA AAAGTACTCT GTCAATATCA TATAGGGATC CTTTTATCGT CTCTTGGAGG ACCACATACT CCCCCCCAAA TAGATTAAGC ATGCCGCCAA TATCAACCCC CAGCTTAGTT CTCAATTCTT GTTCGGACGA AAGTGGGCAA AGAAGGTCGA CGCTTGCAGG TATCGTCTGG ATGAAAAAGA GAGTACCATT GATTGCTCTT TCGGCAGTAC TCGCCCATAC TGTTGCTTGA TCCAATACTT CTTGAGCATC TGATAGATAA GCACCCACAT CATTTGTTGT CTTGTTGAAA
|
Protein sequence | MGVHGWNWFL KEEGWLPDTR QSCCSQSLWR NPSSLSESDA FASRIVPIPD RSVLHVDGNG LAFFLHRAAY ARHLAKVIKG GNNKQLRAHS KACTVPKQSL STEQVRRLLP NFMPLTLLSD AVREFASQLQ DKHGMKLLVY WDGEQRRVFK HETDKHRQTR RPEEWSNVQH YCQYGMMPAS KTICEWEYVF PKSRLFTTQI MNVLQTCGAE TIHCEEEADH IIAERASNNP NHYVLGNDSD FCFFPNVNYI PFATLDAQHN TTTACVIRRA EIAESIGLPE ELMVELAILM GNDYVDPRGA KLNFHSRNVY DLVEHLRDQV ESYRVTSQVA EIEEALRFIR ILYNFGDLSE FPFDERDSAG RVIDENDMAC DSILDNIALR PTIPDEVPLE LAKLRPFSDI SIKDAALPEH LDVFRDMALT KCASPGIQKG AWRPLWEDVP AAYLIEKIVA HSFEESGESP VVRVYDPFQI FDQHIFHDCL RQRRAHERTR KKDPVSKAVE PSIKPRTSTG EERLVLPVDE HEEIILASVR TNRVTIIQGE TGCGKSSRIP IMLLKAPCPI PTMNQSKMYI SQPRRIAAKA LVERVRSVEP ELKNAIALRM GHGVCEYESK KTRAWFVTTG YLVRVLANHP EKFDRISYLV IDEVHERSVD TDILCLLCRR LLKVNQNIRL VLMSATLAAS LYQEYFNTTE PPIKVGARRF PVKEVFLEDL QEQVAFSQKE EKTVNQLASE CNKMKCAKPP SLSYMEKIFS LVSHIAMFVG RPGSSVLIFV PGMNEIVAIT ELVEQLFKPG VRFTCFPIHS DIPFEDQMIV FDATAEDEVK IIIATNAAES SVTLPDVDHV ICLGLCKQIV YNEASHRQIL MPTWISKASA TQRAGRTGRL RPGTVYRMYT REVYRHYMDD FEPGEMVRIP LDSVILMLKE MLSDEDVTKV LLDCLEPPNI ATIDRSFQSL YKCRFITSAE GDCEITSLGS FVLAMGVDLL LGSLVGLGIQ FGVGAEAVQM AAVLSFPKTP WIMSNPLIHE SKDYNEMTTK TYVSRCHFDA NLFSEPLSAM NLIWEFEQAK NQANWCWKFG ISYPRIKRLA ATSSNFRRRV ADFLSISTND LELDAPPLFM PHSKVTLLRV IQVWVFNDSI IECRPRPFTK GVSGDKMRID LGKNSKRIDD EHLHQILSKG RHPFALHGYQ EIEQRGSFEI QGASSDSGMV SGDFEERFLS YAVEKEINVA WITSPSSFVL YVRAESSKSK RFAEILTTMR QSQFNESSKF AFESSNKKGR GERPAGMWSF ADVTDGRKET GTIFKRFMTE SLNQSQRRFF TKALCDYLES QPALSGVSCD LLKNTAKPIQ KFSIVSRGRS NAISPVDLND LFAASGIITS KVVKQGNQAI IFSNSPSMPL ECGTKGKHSA SVESSWDRPI FHPIPEGARI LSILASGRRK EHVVRLSSLP SKDTLANGTS PDSLDIYMEP GTTNVSKRWK RFNTAISVYV QENSVPASAL PMNTEAIMYC CCANTLEVRG GGLRVEGLTL LPPGRPFLLL CRLAFGLFHQ KHSVEGTLED TCLQWVLQDL GNDSTLGLSQ DEMKIRIRKA MSFHRSVGDL GEQLVCFDTK VAPLLEVFDG VDGYESKVWK DLKNNPMTSD NLKASATKSI DWKSQSAELV EMRTLPKQHA SDRLVSKKCG EFDLKKRSPK KDESSGGDTD DDLQRILAAT QHLSDEKCLN HQAEKVKAKN RKKRASKGNR VVPSIEPVIF EQITNPFELV PQSLFLTELP DGVVPIANEL HSTNILALVI QHYMMHLKDT RDPPYCFEGD WSLYRGELEG KQYFYARFLN RSILYRKQAN GIQAKQFIVP KWIRENEPRP SSVKDAINCL PPRFSELLGI SVAAIDGCNI LLFSDVTSAM RMESAFWLER HFRSSAIHWY EQSPIGATTA NSLYFESCTV KDASPGVAYQ ESD
|
| |