Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42942 |
Symbol | |
ID | 7196775 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1585903 |
End bp | 1591659 |
Gene Length | 5757 bp |
Protein Length | 1474 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176812 |
Protein GI | 219110121 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTTT CTTTGGGAGT AGCTTCCCCA GAACAGCCTG TGGCCTCTGT GACGAAAACG CTTGCTAAGT TGTGTTCAAT CGCTTCGAAT CGCTCGTGTG CCGACTGCCG TGTTACGTTA ATAGATTCCT CCCAAGTGCA CGCTTCCTTT AATCCCAAAA TCGAACTGCC TTCAAGCCAA TACAATAATT TTCGGCTCAA TCATGTCAAT TTCGCCCCTC CTAACTATTC TCCTTCAAAA ATACCGGAAA CTCTCGACCC ACCGGTTGAC CCGAGCCTTG TCGCGGCTTC CTACGTCGGC GGACACGGCG TATTCGTATG TGCCCTCTGC GGAGCAGCGC ACAAGCTCCT CGGTAGAAGC ATTACCGTTG TTTACGCTGT CCAAGATTCT TCCACTTGGA GTGTTGAAGA GGTTCAGCTC TTGGTCAAAG GCGGCGGAAA TAATCGGGCA CGATCTGTTT TAGAAGCGCA TATACCTCAC TCGTGGAAGC AGAAACGACC AAATGCCGAC AGTACAATAG CGGATCGTCT AACATTCGTG CGAGCGAAAT ACGAGGCGCT CGCATTTGTG TTACCCCCAT CGGGTCCGTT TGCGAACCGG GCTTGGAGGG CGATTCTGGA TCGCCACCAG GATGAATGGG AAGGCAATTG GGGTGCTGAT TTACATTCGT TCTCAGAGCT GCAGTTGGGA GAGTCGCATT CTCAGCGTGC CTCGCGCAAT TTGATGCAGT CAGTCGAAAA CGGGAATCGT GAATCGATCC TTCCTAACAG ATTGGTAGAC TATTTTTGTG TGGTGACCCC TTCTGAATTC GTAATTCCTG ATATGTTGGG CTTGGACCTT TCGGAGCTAT CATCTCCGGA GGATGTATTG CTGGTTCCCG AGGTAACTGA CTGCTTTCCG AATCAGGAAG CTCACGCAGA TACTGAATTT CCCAAACACA TTGGCTCACT TGTCGTGCCC GATGGCTGCC GACCGAGTCT TACCCCACGG CCACCATCCT TCTTCTCCTT TGTATTGACG TGTGGCGACG GCGTACGCCT TTATGGCGGC GCCCTTTGCC TGTACGATAA TGATTCTGAC GTAGAAAATC TTAAGGAAAA GTTTCGAATC TCTGGCTATG AAGGACAGCT GCCAGGATGG ATGGGATCAG AAAGGTCGCG TTTGGTGCAT GATCAAGATA CCAGGAGCTC TTCTTTCCAG TCCGAATCTG ATGTCTATGT GGATGTCGTT TACTTTCCCA AGGTGCTAGT TGTCGTATCG CATTATCCGT TTTTCGACGT CTGGAGAAAA TTTTTGTTAC AAATTTACCG AATTGCGTTG GTCGGAGCCC CGTTGCCTAT TGAACGCTTC ATCGCCAACT TTGTAGGCGA AGTGCCGCTG CCCCCGCCGG GCAAAATTCT GGTCAAATTT GGTTTAACAG TGAACGACAC CTGGAGTATT GGACGACCTC CAGAGAATCA GCTACCACTT GCTGATTTTT CTTTTCAGCC ACTTTTTGCT GCGCTGTCGG TTTCGAACGT CATGGTGGTG ATAGGATGCT TGCTGGAAGA GGGACGAGTC GCTTTACTGT CGAGGCATTA CGCCATGTTA GCACCTACGG CAGAAGCGCT TGTGAGTCTT TTATTTCCGT TCCACTGGCA AGGCATGTAC CTTCCAGTCT TACCGTACAA TATGATTGAC ATCTTGGGCG CCCCTGTTCC ATTCTTGGTT GGTTTGCATT CACGATATCT CGTCGATGTA CCAGCCGGGT CAAGACCACG CGGTGTCGTT TTTGTGGATC TCGACCGTGA TGTCATTCAC TTGGGCTATG AATATGACGA AGTAACTCCA CGCTCCAGTC CAGCCCTTCC TGAACGACAG GCTTTGAAGC TTAAGTCAAA GTTGGAAACG CATGCGTCTG TAGCGTACGT TGAATGCGAT TCTGTGTTCT CTGCAGCAGC AACGAAGGCG GCAACAATTC TTACTGGCAA TGAAGAAGAG CTCCACAATA GCTACAGGGA TCCATACGCT CGAGCTTCGA AATCCGAATC ATGCCCGACT TCAGTGCGTA GGAAAGATAT TTTTGGAAGA CTGGACAGAG CCTACGCAGA CAACGAGTTA CAAGTTCCAA TCTCCGGCTT TCTTTCGGAG CATGGACAGT TCTATGAGCA AGATGCGTCG TCGACTCCGG ATTCCAAAGG ACAGCGCTTC AAATTTTTGC GCGCCGTGCG TCCGAGAGGA GGACTGAAAT CAAAGATATC AGCGGATTCC TCGGGTAGCG AAAGGTTTCA TGTCACTGGT TCACCTGGGA CAAAGTTGTT GGACAAGGAT GATCCATCTG GGTTTGACTC TTCCGAGATT CGAAATGCAT TTCTCCGGCT TTTCGTTTCG ATATTCCAAT CTTACCGCCA CTATCTCGAC AAAAATGGTT TTCGTTCCGA AGCGTTTATT GATTCGTTAA ACTGTTCAGA GAGGAGCTCA GAGTTTCTTC ATTGTGTGGT GAAAACACAG TTTTTCTCAA GGTTTTTGGA TGAACGGATC CAAAATCCTA GCGATCCAGA AATCCGTTTC TTCGACGAAT CCATTATCGC AAAGATAAAT CGGTCAAAAA AACAGACTTT CTCCAAAATT GGACGGGGCG GCGGGAAAAG GGAAACGTCC TTTCTAAAGG ATGATTCAAA TATGGTATGT AAGGAAAGAA TTCCTATTCT TTCGGTCTAT TGAAGCTAAC AAGCACACGT TCCTTATTCT TCGTAGGTCA ATGAGACGTT CACACCCCCT CCTCCCTCAA ACTGGGGTTT ACCCGATGAT GGTCGCACCT ATCACTACGG TTCCTTTCCT GCGTTGAACC CTGAGCTATT CGGTAAGGTT CGCCGTCCGA TGAAATGGCC CCGTTATGAT CAAAGATCAT CGGTTCGAAC GTCTCGTAGA ATCTCCGCAC TTGCGCAGGC AAATGAGAAG GCTTTTGTTG CGAGAGCATT AAAGCCCGTG TCGAAAACCC CAAAAATGTT GATGGCAGAG ACAAAGCGGG GTGTGAAAAA CCTCGATACT GCATTATCTG CTTTGTCAAG CCCATTTGTT CCATCTTCCC CAATGCGAGA GACCAATCGC GAAAGGAGAG CGACTGGTAG CTCTGACTTC TCAATGAGTG ATATTTCACT GTCGAGTGCA CTTGTTATGA GGGATGATAT TGTATCTACT GCAGAAGGAG TAGTTCTCAA CGCACGACGG AAGCAGGCAA TTCTACTTGG CATTATCATA AAACTGCAAA CACATTGCAG ATTGCATCTT GTCCTTAAAA AGGATCATCA GAGGGAAAGT TTCAGAGTGA GCGCTGAAGG TGATCGTAGC AATTTGGAGA CCAATGCAAT TATTTGTCTG CAGAAGTGGT ACCGAGTGCT ATCGGTGGGT CGTTCCACCA GAGAATGGTT CCGTAAACTT CGTTTTGCTA CATGCAGAAT GCAATCCCTT ACGCGTGACC GAATGGTGCG TGCCGGCTTC TTGTTATTGG TTACAGCCAT TTGTAGGGCG CAAGCACGTA TCCGAGGTTT TCAAACTCGA AAAATCGTTG CATCCATTGT TGAGCACCGG CTTCGTACCT ATAGAGAACA CATCTTCTTG CTTTGGAAGG CTTCGCATAC ACCTTTGAGC TATCGATGCA AGTTTTGGCC AATACTGAAA GTCGAAAGCT ATCTGCGCGT GCAGATGGCT GAGGCGGAGG TACATCGACT GTGGTCAAAG TTGAATATAC CTTTGCCAAC GAGCATGAAG AATTTGAGTA CAGAGCAGCC AAAGATCGTA AAGTCCGCTT CTTTGTTGCG CATGAACTTT GAAACGCATT GGGCTTGCAT TATTGTAAGT TTCCTTTCTT CAGAAAGATT TGCAAAACGA TAAGTTAGAA CTAACTCGTT TTTCCTTTGT CCCTATCAAG TTGAACATTG AGAAGCCAAT ACTAATTTTA GATTTTTCCC AAAAGGGCAA CAGTAAAGTA ACAGCTGGAC CAGAGGCCCA ACGCATCAAT TCAGAACGGG TTCAAATTTA CGACCGAATT TCGGCTTTGA AGCTCGAGCA GCAGCTGGCC CTATTTGATA GATTCAGCGT CCCGTTGAAG GGCAAGAGAA AGAAGTTAGC GCTGGCGGTG TCGATTTGTA GGTTTATTGA TGTCGAATGG TTTTCTTCAG TTTCATTACC TAAATGTGGA TCCCTCTCAC TCACTCACTC TCTCTTACAA CTAGGGCACG CCACTAAGCT TGCAGACGAA TCGACGGCTC TCATGCTGAC TCTCTTTACA GAATTGAATG ACTCCACAAA TATTAACTTT AAAGCGCCCA CAAAGAAGAG TTTGCGACGA TTTCCTCCAT CTCCCGCAAT CACTCCTTTG AAAAGGGCAT TGTGGGCCCA CGTGGCTTTG GAAGACCGTA TCCGTTTGAA CACTTTAGAG GTTGCTAGGA TAGGACTTCA TGATATCCCT CGCCTCTCAC ATAAGGTTGT CGGAAACAAT GCAGTGCAAA CGTCGGGGAA AAACCGCTTG GAGCTCATGA GGCGTGTCCT GAATTGTGGG GCTTCAGGAG TCAATACGGG CGAAACAAAA TCACTGCAGG CTAGGCACCT TTTCAAACCT CCCGCAATGT TCAATAGGAG CCACGCATCA AAATCTGATA GTATCCGCGA AGTTGAGTAG TATCTAGGAC TTCGATTCCA TAATATCATC AATATAACTC AAATAAAACC TTTGGATGTT AGTGAATGTA GAATAAGCTA GGAGCTTCAC TCCCTGGATA GGGGCTCATT CATTCCAATG TATTCTTTCG GAGCGAAATC ACTGACGTTT GATCAAAACC AGCGCGTTCT ACGGCGCTCA CCAACGCGGC AATCTGCTTC TCGATCTCGT CCTCAGAATC AACTCGGAGA CGCACGGTTG CTTGTCCACC TTTAGTACCA AGAGGCTTGA GCGCCGAAAA AGCATCCACA ACGCTGGTAG TAGATAGCAA ACTCCGATCG ATCGTATTGA TGCAAGCGAC ACACCCCATG GTAGGAATAT CAAGCTCGAC TTCGATATCT ATTTTACCCA AAACGTTGCT TTTCTGTGAC TCTTCTTTGT ATGTGTTCCA AAACTGTAGC CCTTCGGGAA ACAAGGCCAA CGACCAACGC ATGGTGGATA TCACAGCCCA ATTCGGAGCA TAGCGAGTAG TTGCTGTCAA GTAAACCAAC AAGGAAAGAA AGAACGGACG AATAGGCCCC AGGATTGTAT TAAATCCTGC GCACCCACCA ACCACTATGT TTATAAGCAA TTGAAGCATA CAACAAGAGC TTGCAAGGAG CGGTAAAACC ATGTCGCTCG CCACAGACCG AAAACTGGGA TGCATAGAAG AAAGAAGATG CAACAGCGGC TCCATGGTCG CTCCAGACAA CTGGATTAGG AAACGGGCCG AAAGAAGCGT ACCCGACACG AGGGCGACTT GCTTTAGGAC GCCGGGGGAC CAAACCAGAT GGTAGGAGTT GTCAACCGCC AAAGCCCGAG CCCGAATCTT CTGAAACAAC GTTTTGTGAG AATTTGGAGT CGCTCGAGAA GAGTGGTTGT CACGGGAAGC GCGAAGAGTA ATCAAATTAT CATTTGATCG ACATACTTGG CTTTGCTGGC TGGTTGGGAA AGCTACATCA GCGTGCACTT GGTGTTTTTG GCGGTCACAC CAGTATCCCC GAGAGACAGC GAAACAAATG AGCCACAACG ACCGACGCAT TTGGAAAAAA TTATTGAAAT TTTTTTGTTG TTAGGCG
|
Protein sequence | MSLSLGVASP EQPVASVTKT LAKLCSIASN RSCADCRVTL IDSSQVHASF NPKIELPSSQ YNNFRLNHVN FAPPNYSPSK IPETLDPPVD PSLVAASYVG GHGVFVCALC GAAHKLLGRS ITVVYAVQDS STWSVEEVQL LVKGGGNNRA RSVLEAHIPH SWKQKRPNAD STIADRLTFV RAKYEALAFV LPPSGPFANR AWRAILDRHQ DEWEGNWGAD LHSFSELQLG ESHSQRASRN LMQSVENGNR ESILPNRLVD YFCVVTPSEF VIPDMLGLDL SELSSPEDVL LVPEVTDCFP NQEAHADTEF PKHIGSLVVP DGCRPSLTPR PPSFFSFVLT CGDGVRLYGG ALCLYDNDSD VENLKEKFRI SGYEGQLPGW MGSERSRLVH DQDTRSSSFQ SESDVYVDVV YFPKVLVVVS HYPFFDVWRK FLLQIYRIAL VGAPLPIERF IANFVGEVPL PPPGKILVKF GLTVNDTWSI GRPPENQLPL ADFSFQPLFA ALSVSNVMVV IGCLLEEGRV ALLSRHYAML APTAEALVSL LFPFHWQGMY LPVLPYNMID ILGAPVPFLV GLHSRYLVDV PAGSRPRGVV FVDLDRDVIH LGYEYDEVTP RSSPALPERQ ALKLKSKLET HASVAYVECD SVFSAAATKA ATILTGNEEE LHNSYRDPYA RASKSESCPT SVRRKDIFGR LDRAYADNEL QVPISGFLSE HGQFYEQDAS STPDSKGQRF KFLRAVRPRG GLKSKISADS SGSERFHVTG SPGTKLLDKD DPSGFDSSEI RNAFLRLFVS IFQSYRHYLD KNGFRSEAFI DSLNCSERSS EFLHCVVKTQ FFSRFLDERI QNPSDPEIRF FDESIIAKIN RSKKQTFSKI GRGGGKRETS FLKDDSNMVN ETFTPPPPSN WGLPDDGRTY HYGSFPALNP ELFGKVRRPM KWPRYDQRSS VRTSRRISAL AQANEKAFVA RALKPVSKTP KMLMAETKRG VKNLDTALSA LSSPFVPSSP MRETNRERRA TGSSDFSMSD ISLSSALVMR DDIVSTAEGV VLNARRKQAI LLGIIIKLQT HCRLHLVLKK DHQRESFRVS AEGDRSNLET NAIICLQKWY RVLSVGRSTR EWFRKLRFAT CRMQSLTRDR MVRAGFLLLV TAICRAQARI RGFQTRKIVA SIVEHRLRTY REHIFLLWKA SHTPLSYRCK FWPILKVESY LRVQMAEAEV HRLWSKLNIP LPTSMKNLST EQPKIVKSAS LLRMNFETHW ACIILNIEKP ILILDFSQKG NSKVTAGPEA QRINSERVQI YDRISALKLE QQLALFDRFS VPLKGKRKKL ALAVSIWHAT KLADESTALM LTLFTELNDS TNINFKAPTK KSLRRFPPSP AITPLKRALW AHVALEDRIR LNTLEVARIG LHDIPRLSHK VVGNNAVQTS GKNRLELMRR VLNCGASGVN TGETKSLQAR HLFKPPAMFN RSHASKSDSI REVE
|
| |