Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40612 |
Symbol | |
ID | 7198476 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 455047 |
End bp | 460692 |
Gene Length | 5646 bp |
Protein Length | 1097 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184617 |
Protein GI | 219128852 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACCCA CCACTACACC TAAAAGCCTC ATGGATTCGT TTCCTCACAC CACGCTAACA CCCATTGCAA CTACAACTTC GTATCCAACG TACGAAAACC TACGTAAAAT GCAGTGGGAG CTTAACGACA ATGCTGAATC TATCGAATCC GAATTCGGCG ACGGCAATCA CGGCCACATC TTTCTCGTCA TTCCCGAAGC CGAGTATCTC GAACTCACCG ACGGCATTCC TTGTGTGCCA CCCGAAAAAC CTCCTATCAA CGTTGACCAT CCAAATGGAG CCACCGCTCC TCAGATCACC GAAGCGAACC GTCGCAACAC CAATGAAAAA TTTGCGTACA AACAGTACCA CGACGCCACC AAGGCTATTC GCAACCAGCT CATTGCGGCC ATTCCCCTCA GCTACATTGA GTCCCTCAGT CACCCAACTC GCGGTTTCAA CAAAGTTCCT CCCATCGACA TCATTACCCA CCTTTGGGCA CGCTTTGGAA AAATCCGTTC CAGCGACTTG CGTGCCAACG AAAAACGCAT GAAAGCAGCC TGGCACCCCC CGACTCCCTT CCAGGACCTC ATCAAACAAC TCGACGATGG TAAAAATTTG CGGCGGCTGG TAAAGAAATC ATTGATGACC ATGCCCTTGC CCGAATGGGT TACGACATCA TTGACGACAC TGGCTTGTTT GACCTCGCTT GTCGTGAATA TCGTTTCAAA GACGAAATCG ACAAAACCAT GGCCACCTTT GAAGAGCATT TTCGTCTCGC TGACCTTGAT CGTACCCTCA CCGTTACCAC CAAATCCGCC GGCTTCCACG GAGCAAACCA GATGTCCGCT ACCACTACCC CCACTCCATT GAACACCGGT AAGCAGTCGT ACTGCTGGAC TCACGGCATT CTGAAGAATC ACAAACATAC CAGCTTAACA TGCGAGAAAA AAGCCGACGG CCACCAAGAC GCCGCCACCC TCCAGAACAA ACTCGGCGGA TCCACCAAGG TATATCAGTA CACGCCTCCC AAATAGGAAA GAGGGACGGC CAATTGCACG AGTGTGCCAC TGACAACTTA TCATAACAAA AATAAGTTTT CAGTTGACAC CAACACTCCT GCAATAGCTT CCTCCCCGCC TCATTTCCCC ACCATTGCCA TTGCCGATAC CGGATGCACC GGACATTATC TGAGCACCAA CATTGCCCAT ATAAATTCCA CTCCTGCCAA TCCTGGCATC ACCGTCACCT TGCCGGATGG CAGCACCATT GTTTCCAGCC ACGTCACAGA GCTCGACATC CCAGACCTTC CTCTTGAAGC CCGTATTGCC CACATTTTTC CTAAATTATC GTCGGGTTCC CTCATCTCAA TTGGACAACT ATGTGATCAT GGCTGCACCG CCACCTTTAC CTCTAGTGCA GTCACCATTT CTCTCAACGA AAAAATCATT CTTCGCGGCA CTCGATCTGC CCCAAACCGC CTGTGGAATT TGAATGCGCC CAGCGTGACT CCCACTGCCG CACCAATTCC TCCCCCTGGC TTTCCCGTCG CCAACCACCT TGAACATACC TCTTCCCTCT CTGATCGGAT TGCATTTGTA CACGCCTCAT TATTCTCGCC ACAATTATCC ACTTGGTGCA AGGCCATCGA CGAGGGCCGC CTGACAACTT TTCCCGACAT TTCTTCTGCC CAAGTGAAAC GACACCCTCC TCAGTCTGCG CCCATGCACA AAGGCCACTT AGATCAACAA CGAGCCAACA TAAAGTCCAC TCAGCTCAAG CCCTCTGCTC TACTTGCTTC GGCACCCCAC GGTACCGAAC ACGACGAAAA TCCAGTCCCC GACAACCCAC CGGCCCTTCG ATCGAACTTC TTGTATGCCG ATGCCTACGA AGCCACCGGA AAAATCTTTT CGGACCTCAC AGGACGCTTT GTGACCTCCT CCAGCTCCGG CAACGCATAC ATGCTTGTAG TCTATGACTA CGATAGCAAC TTCATCCACG TCGAACCCAT GAAAAATCGT ACCGGTCCCG AGATCCTGGC GGCCTACCGC CGCGCCTTCG ACCTTTTTTC CTCTCGAGGA CTCCGTCCCC AGCTCCAACG ACTTGACAAT GAAGCCTCCG CTGCCCTGCA ACAATTCATG ACTGACTCCA AGGTTGATTT TCAGTTAGTG CCACCTCATT TACATCGGCG CAACGCTGCC GAACGAGCCA TACGCACCTT TAAAAATCAT TTCATCGCCG GGCTATGCAG CACCGACAAA GATTTCCCAC TTCACCTCTG GGACCGCCTT CTCCCACAAG CAATCATGAC CTTGAACCTT CTCCGCGGCT CTCGAATTAA CCCTAGACTC TCTGCTTGGG CCCAAGTTCA CGGCGCGTTC GACTTTAACC GCACTCCACT GGCGCCCCCT GGCGTAAAAG TTCTCGTACA CGAAAAGCCG TCTGTACGCA AAACATGGGC TCCTCATGCC GTCGACGGAT GGTACATTGG CCCTGCCATG CACCACTACC GATGCCACCG AGTCTGGATC CACGGCACCA CAAGCGAACG AATCGCTGAC ACCCTCACTT GGTTCCCGTC GAAAGTGAAA ATGCCAACCA CGTCATCACA CGACACCGTC GTAGCGGCAG CCCGCGACCT TGCCAAGGCC CTTTCCAATC CCACTCCTGC CTCTCCACTT GCACCATTAG GCACCCAAGA GCGCGCTGCT CTCGAGCAAT TATCGAACAT TTTTTCGAAT TTTTCGGACC CGTCCATCAC TCTCGAAACG AAATCTCCTG CTGCTCCCTC GGTACCCCGA CCAGCCCCTG CAACAACCCC ACTGCGAGTC CAATTTCAGG ACCTGCCCAC GGAACCACTT CCGAGGGTGC CTCCCGTTCC CACTGCACCA GTTACACCGC ACACACTTCC GAGGGTGCCA ATTCTGGTCC CCGACACAGA AACTTACAAG CTCGTAACCT GTAACCCTCG TCAGGCCCGC CGTAGGGCCG CGCGCTTGGC AAAACAAGTC CTTGCCGACG CTAAGTCGTC TCCTCTCGAT CAGACTCCCC TTTTAACCAA CCCGGCAGCA CCAACGGCTC CCCATCACGG TCACGGTACG AGACTACAAA CCTCTCGCTT CCCAGCGCAC GCATTCTGGA CTGCCAACGC CGTCGTCGAC CCCAACTCCG GGGCCGCCTT AGAATACTCA AAATTGAAAA TTTCTACCGA AGGCGCCGAA TGGATCCAAG CAGCCGCCAA CGAAATGGGA CGCCTCTCGC AAGGCGTCCA ACCTCACATG CCCACCGGAA CAAACACAAT TCACTTCATT CCTCACACGG AGAAGCCCCA CGACCGCAAG GCAACGTACC TCAAGATCGT CGCCGCCATC AAGCCCCACA AGGCCGAGAA GTATCGCATC CGCTTTACTG TCGGAGGCGA CCGCATCGAA TACAGCGGCC CCACCAGCAC TCCAACTGCC GCATTACCCG CCATCAAAAT TTTAGTCAAC AGCGTCATCT CTACAGACGG TGCCCATTTC ATGACATGTG ACCTCAAGGA TTTTTATTTG GGCACCCCTC TCCCGGTGTA CAAATACATG CGTATACCGG CCAAGCACAT TCCTGCATGC ATCATGGAAC AATACAAGTT AGCCCCATTA GTGCATAACG ACAATGTCCT CGTCGAAATT CGCAAGGGCA TGTACGGCCT ACCCCACGCT GGACGAATTG CGAACGATCG CCTTCTCCAA CATTTGGCTT TGGACGGTTA TCATCAAGCC AAGCACACAC CCGGCTTCTT CACTCACGAA AGCCGTCCCA TTTCCTTTTC CTTGGTCGTC GACGATTTTG GAGTAAAATA CGTCGGCAAA GAGCACGCTG AACATCTCGT CCAGTGTCTC GAAAAATTGT ACACGGTAAC CACGGACTGG ACCGGTTCCT TGTACTGCGG TCTCACTTTC ACCTGGGATT ACAATGCACG ACACGTTGAC ATGGCCATGC CGGGATACAT TGAAAAAGCT CTCCAGCAAT TTCAACACAC GGAACCGACT CGACCGCAAC ATTCGCCGCA CGCTTGGGAA CCGCCATCGT ACGGCGCCAA GATTCAACTC ACCAGCGAAA CCGTTGTTTC TCCCCCGCTG GACAAAGCCG GCATCACTCG CCTTCAAGAA ATCATAGGGA CTCTTTTGTA CTACGCTCGC GCCGTGGATT CAACTATGCT TGTGGCCCTT GGCACCCTCG CGTCCGCGCA GACACAAGGT ACCGAAGCGA CTGCACAAGC AATTACCCAA TTGCTCAATT ATTGCGCAAC GCATCCGGAT GCAACGGTAC GATTCAATGC TAGCGATATG TTCTTACACG TTCACAGCGA TGCTTCTTAT TTGTCAGAAA CCAAGGCCCG GTCCCGATCT GGCGGAATTT TCTTTCTAAG TTCCAAACCC ATCAAGGACC CTAAACCGAA TTCGGAGCCA CCGATTTTCA ACGGTGCTAT TCATGTTCAC TGTTCTATCA TGAAATCTGT TCTTTCCTCC GCCACCGAAG CTGAACTTGG TGCACTGTTT TACAATGCCA AAGACGCTAT TGAATTACGT ACTACTCTAG AAGCCATGGG CCATCCTCAG CTGGCCACTC CTATCCAAAC CGACAACGAA TGCGCTTCCG GCATAGTAAA TGAGACCGTC AAACAAAGAC AATCAAAAGC TATTGACATG CGATTTTATT GGATTAAAGA CCGAGTCAAG CAAGGCCAAT TCAATGTTCA TTGGCGAAAA GGAGTTGATA ATCTTGCAGA TTATTTCACG AAACATCACT CCCCTTCTCA TCATCGACTC ATGAGATCTC GTTATTTGTT GGATTTGCAC AAACCTGCCT CCAAGCCAAG TTTGCCTGAG TCAAATTCAA GTTTGAAACG AGGGTGTGTT GATATGAAGA TCGAGCCCAA TCATCCTATC CCTATCAGTT ACGGTACTGA TGACAGTTAC CCTATCAGTT ACGGCTCTAC TGATAGTATC CCGATTCAAA TCACAAGTAC TTACAAGACA AGAACGCCAA CCGGGCATTC TAGTCTTCCC GCTGACATTT CAAACATGTC ATTGCCATTC AAACCAGTCA TTGAGAGTGT CTCCGAGCCA TACAAACCAA ATAGCGATTC TCAATTGACT CATAATTCTT ATTAGTTCAT CGATTTCCAT CATTCGCGTT CTTGCGTTTG TAGAGCTCAT CAATAAGTGG ATCGATAAAT AAATTAAACG GATCGCCCAT ACTCTTATCA GCTTTTTGTC AAACGATGTG GTATTGGTTA CGGCAAGGTC AAGAAAAAAA GAATTTTCAT GTGCTATAGT TTGGGCCAAA ATTCTTCCAA GAGATTCCAA GACGCTCGTC TTTTGTTCCA CTCCGATCGA ATGCATTTCC GTGTCGAGGA CCCTCGAGCC ATTCTTGCGA GTTGAAGATA AGGCAAGACG ATGGAGCCCC GTAGGCTGCC AAAGAACTGT TGATTCGGTT AAGAGAAATA AGCGAAGACT CCGTTCCAAG CCTCCCGACG AAACCGATTT CGTCCCAATC CGTCTGCCTC AGAGCTTTTT GGGATGCATG AATCGACAAG AACGACAGCA GAATAACAGC AGCTCCTGCA AAATAGACAA TGAAATAATT CGAGAAATGG CGATTCATTT CCCCAACTTT TGGTGA
|
Protein sequence | MTPTTTPKSL MDSFPHTTLT PIATTTSYPT YENLRKMQWE LNDNAESIES EFGDGNHGHI FLVIPEAEYL ELTDGIPCVP PEKPPINVDH PNGATAPQIT EANRRNTNEK FAYKQYHDAT KAIRNQLIAA IPLSYIESLS HPTRGFNKVP PIDIITHLWA RFGKIRSSDL RANEKRMKAA WHPPTPFQDL IKQLDDEIID DHALARMGYD IIDDTGLFDL ACREYRFKDE IDKTMATFEE HFRLADLDRT LTVTTKSAGF HGANQMSATT TPTPLNTGKQ SYCWTHGILK NHKHTSLTCE KKADGHQDAA TLQNKLGGST KFSVDTNTPA IASSPPHFPT IAIADTGCTG HYLSTNIAHI NSTPANPGIT VTLPDGSTIV SSHVTELDIP DLPLEARIAH IFPKLSSGSL ISIGQLCDHG CTATFTSSAV TISLNEKIIL RGTRSAPNRL WNLNAPSVTP TAAPIPPPGF PVANHLEHTS SLSDRIAFVH ASLFSPQLST WCKAIDEGRL TTFPDISSAQ VKRHPPQSAP MHKGHLDQQR ANIKSTQLKP SALLASAPHG TEHDENPVPD NPPALRSNFL YADAYEATGK IFSDLTGRFV TSSSSGNAYM LVVYDYDSNF IHVEPMKNRT GPEILAAYRR AFDLFSSRGL RPQLQRLDNE ASAALQQFMT DSKVDFQLVP PHLHRRNAAE RAIRTFKNHF IAGLCSTDKD FPLHLWDRLL PQAIMTLNLL RGSRINPRLS AWAQVHGAFD FNRTPLAPPG VKVLVHEKPS VRKTWAPHAV DGWYIGPAMH HYRCHRVWIH GTTSERIADT LTWFPSKVKM PTTSSHDTVV AAARDLAKAL SNPTPASPLA PLGTQERAAL EQLSNIFSNF SDPSITLETK SPAAPSVPRP APATTPLRVQ FQDLPTEPLP RVPPVPTAPV TPHTLPRVPI LVPDTETYKL VTYSPFNQPG STNGSPSRSR FLSNDVVLVT ARSRKKEFSC AIVWAKILPR DSKTLVFCST PIECISVSRT LEPFLRVEDK ARRWSPVGCQ RTVDSVKRNK RRLRSKPPDE TDFVPIRLPQ SFLGCMNRQE RQQNNSSSCK IDNEIIREMA IHFPNFW
|
| |