Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44298 |
Symbol | |
ID | 7197961 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 149577 |
End bp | 155576 |
Gene Length | 6000 bp |
Protein Length | 1736 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178175 |
Protein GI | 219114759 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCAAG TGACGGACGA TTCGGTGGTA TTTGTTCCAA TTGATAACGA CATTGACGTG ACGTCCACGC AGCGGGAAAC GACAGAGGCT CATCCGTCGG AGGTCGAGTC GAATGCAGCC TGCAGGGTCT CATCGGGAAC GACCCTTTCC GACCTTAGTT CTAACGCTGG TGAAGCCGGT ACGTCGCAAT CACAATCACA GCAACCTCCG TCACGTACAG CGGGTGCTAT GCACATGTCA CGGCATTCCG ACTTTACGTT ACCAACTTCG AACATCTTTT TGGGCTCTCT ACCAAACTCG ATCGCTAGTT CGTTCGACGA CAAGCACAGC GCGTCTTTGG TTCCGCGGCG TAATATTTTC GGAAGCACAG TCCAACAGCA ATACGGAAGG AATCACGTCA GCGACGGTAT GGGTAGTAGC GGTTTCGGCC CACCAGCAGC GGGCTCGCAA TACTCAACCA CTGCTGCGGC AGCAAATCCT GCTCTGACTC TTACAACAAT GCTACCAGCT TATTTGACAA CCGGTAACCC TTACAGTGCA CGCTCTGCTG CAACGCTTGC AGCAGAGAAC CACAGGCTTT TACCAGATGG GTTTTCTGGA CAGAATACGG ACCATGAGCA AACTATAAGC TCTCAGTCAT ATCCTCTGTC ACTGGAATCT GCGTACCAGA TTACGGCACA TCATCATCAT GACTACGACC GCAACGACTC GCTCATGTCT TCGTTGACCC TTCCCGATTC TCTACATTGT CGCAACGGCA TATCCTTTTC GGCGGCCACG CAACTACCGA TAGCACCGTC GCCTCTTCTG GAGCATGAAA GAATAAAGCA TCATAGTCAC CGGGTCCACG AAACCCTTCA CGCACAATCA CTACTACTGG GCCTGGCTTT TTGCATGGTC TGGAGTGCGT CAAACCTTAT GGCCCCAAAC TTGACCGAAA TGGCCTTATT CTACGATTTT GATCACGTAC AACGCGATTT ATATCTTGGT TCATACTGTG CACTTGCCAC AGGAGTATTA AGCTTCCCCA TTGGAGCTGG CATTGGCATA TTGGCGGATG TGGTGTCGTC TCGTCAACGT CTCTTTGTGG CAACAGTCAC GGGAGCGGGC TTGTCCAGTA TAGCCACGGC GTTCTATACC GACGCCTACT GGCAGCTCTT TCTGTGTCGC CTAATCAACG GTGGGTTCAT GTCTGGCTCC GTGCCCGTCG TGTTTTCGTT TTTGGGGGAT TTGTTCGCGA CGGAAGAACG CAATACAGCC AGCAGTGGCT TGACCGCCAT GATGGGGCTG GGAATAATAC TGGGACAAGT GTATGCTGGA ATGATTGGTG GAGGGATTTT GTCAGGGACT GTCTTGGACG TAAACGAAGC AGAGACCGAA GCACACACAG CGTGGCAACA CGCGTTTGTT GTGTCGGGTA TTTTAACATT AATATCGGCA GTCTTGTGCG GCTGTTTGGT TCAAGAACCC GTGCGGGGCG GCAAGGAAAA GGTGCTACAA GATATGCTGC AGGCGGGAAC GCGGTACGAA CGGAAATTGT CTTGGCAAGG GTTCTGGCAT GCTATGCGAC ACAACCAAAG CAATCTGATA CTGCTATGGC AGGGGTTCTT TTCTTCGCTA CCGTGGGGTA TCATATTTGT TTTTTTGAAC GACTATCTGT CGCAAGAACG CGGTTTTAGT GTGCCGGAGG CTACCTATTT GGTGCTGTTG TTTGGTTTGG GCTGTGCCGC GGGTGGAATT CTGGGCGGCT ACTGGGGACA GAAAGTGCAA TCTTACCGTC GGTCCTATTT ACCACTATTT ATGGCCGGAA CGACGGCTGC CGGCATTTTG CCGTTTGTGG GTTTACTCAA TACGCACTTC ACCAATGCGC ACGGCGTTTA CGGGTTTGTA TTTAGCTTTG GTGGTGGTCT CGTAGCGTCA TTGCCGGCGG TCAACGTTCG TCCCTGTCTC ATTAACGTGA ATCCACCGGA AACGAGGGGA GCCTCGTTGA CGGCGGCTAA CCTACTCATT AACTTGGGTC GCGGATTCGG ACCGAGTTGC ATTACCCTAA TGGGATCGAT TTTTAATGTG GATCGGCAGT ATGCGTTTAA TGTCACGGTA CGTTTCGAGG TGATATGTGT GTGTGTGTGT GTGTGTTTTC GGTGCAAATG GCCGGAGATC ACGGGTCGCT TCTTGATTGT TTTGTCTCAT TCGCCTACTT CTGCTTTGGT CGTTGTCAGT TGATTGCGTT TTGGTTGATT TCGGCTCTCC AATTATCGTT ATTGGCCCGT ACATTACCAA ATGATCAGGA TGCAATGGAG GCCGAGTTGG CCTCGTACGC GCAGAAAGCC ATCCGGGAGC AACAAGGCCA ACACGAACCC CCCGCGTGGG ACGATACGGA CGGTTCGGAC CGTGACGAAA AGGAGGAGGA TGACGACAAT GACGGTGCTG GTGATGCGCC CAATCTGGTG AGTTTGGAAG AACGCATGAC TTCGTGGGAC GGCAAGGCGG CCCGGGAGAC GTTGGAATAC GTGCGTCGGG GTGTCCGTGA GTTCCGGGAG GAAGTAATGT CCGGCGGTTC CAAGGCTCTG TTGGGTCCGT TCGGATGTGC GGACGGTATC AGTAGTGAAG ACGAAGAGGA GGAGATGACC ACCCCCCGCA CCATGCTCCG TAAACGGGAA CAATGGAAAC GCGCCCAAGT GATTCGCAAC CAAATTGCCA TGGACGGTGG TACGAGCGAC GCGCCCGCGT ATCCCACGGA ACGCACCCCC TTGGTGGTTT GACGCCACGC GTGTGTTGGC GTTGCCCTTG GAGTGGATAG AATTGAATTC CTAGAAATAT TAGAGTTTCT GGACTGGCTT GTTACATTCC TGCTAGTACC CCGTAGACCA CCGGGACGGA AGGGAAACCA ATGCGGGATG ATTGACTGTG CGTTTGGGAT TCGTGTGGTA CTTTTGGTTA GGGTAGTAGT CTCCTTTCCT ATTTCCTCAT AGGGGAAGAA TAGTGTGGGA TTGGATCCGA ATCGGTAGCG GACGCCGTAC GAACGGGAAA AGCTTCCGTA CGCGGTCTCA CGAGAAAAAA GTCCGACGGT ACACGCACTA CCGTACCAGG TAGTAGGTAT AACGCTCGAC TGCGCACGTT GTCAGGGTGC CGTACGAACT GTAAAAAAAA CGTACGGAGA CCCCCACTCC AGCAACGACC GAGAGAGGAG GATCCGGTTC TTGCCGTACA ACGACACTTT TTGATTTCCC GGAACTCGCA CCTATCCTGG GTAACCCTCG TGTGTTGTTC CAGTAGTCGT TCGACAGTGA CGTGTAGAAT AACACCAAGT CAACGAACGA GACAATCAAT CATGAGTTCC AAACCAACGG TACGTCTAGT CTAGCGTGAG GAATGACACG GTCTTCTCGT GCGGCGACAT GGGGGGTCTT GATTTGTGCC GGTATGGTGG GGGTGCATTA TCCTGTTGGT TCGCAAAGGT GTCAAGTGAT GCGGGAATCA ACGGATCGTC GTTTGGTTGG GTAGATATTC CTGGACTTTT GCTGTAAGAT TCGTCTCACA AACGTTCGGA ACTTGTTTGG TTCTCTCGTG TTGCGCGATT GTGTAGACTT CCAGTGCTGG TAGTAGCGGT ACTGGTAGTG CTGGTGGATG GGCCAACAAG CCTTCGTCTA TCCTGCAAGC CACTGCGGCC GACGTTGGGA CGCCTCCCGC CGTGGGGACG CGGTGGGCCA CGGTCTCCGC AACACCCCGT CGCACCAACA GTGCCGACGT TGGTCGCGCC CCGCGTCGCA CCCACAACAA CAACAACAAC AACAACAACA ACAAACCGCA AGGAGAAGGC CACACGAGTC GTTGGAACAA TAACAACACC AAAACGGGAC GTGGAGGCGG ACGAGGAACG GCGGGTGGCC ACCACTCCCA CCGTACGGGG GGAAACGTGA GTCACACACC CCACCACCAC CACCAGCACA ACAAAGAACG AAAAGCGCCG GCCAAGACGT CGTCCAAAAC TCCCATGGTG AATTTGAAAG ATGTGACGCT CTTTACCGAA ACTGGAACAG GGAACACGGC ACAGGAAAAG GTCGTGGTTC GTCTCTCGAT GGAGCATTTT CTGGCGACAC GACTGAACTA CCTGGACCCA CCGACGGTGT CGGCCGACGG TGAATCATCC CACTGGACTC CGCACGCGCG TTGTATTTGG ACGAGTCAAG ACCGGGGCAA CCAAATTCAA GAAGAAATGA AAGCCTTGTG GGATTACAAG CCCCTCGAAG TCAACGACGA CACCCGCTGG AAAGCTCGAG TCATGGAGGG CCAGAATGAA ACCGCAACGT CGGACTCGCC CGAAGAAATT CTTCGCAAAG CCACCGCCAT TCTCAACAAA CTTTCCTGGA CGACCTTGGA CAAGCTGACG GTCCGCTTCG TGGAAACGCT TAGTGGCGGT GCCAAGAGTA CCCACGGCGA CGCCGACCCG TCCCTCTCCA AAGAGACGGT GCGCGGCACC ATGCAGATGA TTGTCGACAA AGCCATGGCG GAACCCCACT TTGCCGAACT CTACGCACGC TTCTCCGGCA AGCTAGCGGC CGTGCACAAA ACTTTCAAAA AGATGCTGTT GAGTATTTGT CAGGAACAGT TCGAGATATC CGACAAGGAA CCTGAGTTTC CGGCAAGTAT GGACCCCGCC GAAAAGGCGT ACGAGCTGTT GCAATCACGT AAGCAATCGA TCGGGCTCAT GGCCTTTATC GGTGAGCTCT ACAAGCTGAA ACTCATTAAG GGTGCCATTA TGATTGGTTG CCTGCAACGC CTAATGGTAA TTGATGACGA AGAAAAGCTC GAATGCTTTA CCAATCTCAT GGCGACCATT GGGGCTCGCT TGCACGAGCA CGAGAACGAA CCCGAGGTGC ACGAAATTTG GGAAAAGCTA TATTCCATGG CCGGTAAGAC GAACAAGACG ACCGGACCGA AGGCGCCCAG CACGCGCATC AAATTTTTGT TGCAAGACTT GATCGAACTG AAAGAGAATA ATTGGGTCAA ACGCCGAGAA GAGGAAAAGG CCAAAACTAT CGCACAAATC CATAAAGAAG CGGCGGAAGC GGAACGCTCG GCGTCGAGAA ATGGTCCCGT TATCAACACT AACAGCAGCA GTCATAAGAA AGTAGCTCGC TCCCATTCGG CTTCTCTGCC CAACACGCGG TCGTCTTCTT CTTTACAAGG CCAGGAATCC TTTCCGGATG GTGAAGGCTT TATGCAAGTA CCCAACAAGC CAAAGAAAAA CTCCCTGCGA CGTGCACAAT CAGATTCTAT TCCGGACGCA ACGTCCGGTA TGAGCAGTTT GCAGCTCGCA ATGTCGGGGA AAAATAGCAA AACCTCGGCA CGGAAAAGCA GTGGTGGAAG CAGTCAACGT CCAGCGAGTC AAGCCCCGAA AATTGCGGAA TACTTGGATC CGAAGCAAGT GGGTGAAAAG ACCAAAACGT TGCTCAAAGA ATACTTTGTC TCTGGTGACA CAGCAGACGC TGTATTATCC TTCGATGATT TGATTGGAAA ATCGCACAAT GGCGACGTTA TTCGTGGAGG CGCGGTTGTT GAGGCTGGTA TATTGCTAGT GATGGAAATG AAAGAGGAAG ACGTAAAGAA GTTTTTGATG GTCACGGCGG CGCTTCTGAA ACAGGGGAAA ATCCCCCTTG CATCTTTTGC TAAAGGTATG AACAACCCGT TGGAGTCCCT GCGGGACATT GAGATTGATG CTCCAATGGC TGCTAAGCAT CTGGCCAGGA TTATAGCATC GTGGCTTTCA TGCAACGCTT TGTCGATCGA TTTTCTGCTC GGAGCGCCCG AATATTTCTT GTCCGATGGC CGACCCGCAG CGTTGGCCAA GCAAGTCTTG CAAATTCGTG GTGGCAACGT GTCGGACGAA GAAGTTAAGG TGGTGACGCA ACTCATGAGT GAGGAGGAGA AAAAGAATAT TGCATCTGTC AAAGAGTGGC TCCAGTAAAG TTGACGAGTA GTAGTAAACA TTTCGTGTTG
|
Protein sequence | MDQVTDDSVV FVPIDNDIDV TSTQRETTEA HPSEVESNAA CRVSSGTTLS DLSSNAGEAA GAMHMSRHSD FTLPTSNIFL GSLPNSIASS FDDKHSASLV PRRNIFGSTV QQQYGRNHVS DGMGSSGFGP PAAGSQYSTT AAAANPALTL TTMLPAYLTT GNPYSARSAA TLAAENHRLL PDGFSGQNTD HEQTISSQSY PLSLESAYQI TAHHHHDYDR NDSLMSSLTL PDSLHCRNGI SFSAATQLPI APSPLLEHER IKHHSHRVHE TLHAQSLLLG LAFCMVWSAS NLMAPNLTEM ALFYDFDHVQ RDLYLGSYCA LATGVLSFPI GAGIGILADV VSSRQRLFVA TVTGAGLSSI ATAFYTDAYW QLFLCRLING GFMSGSVPVV FSFLGDLFAT EERNTASSGL TAMMGLGIIL GQVYAGMIGG GILSGTVLDV NEAETEAHTA WQHAFVVSGI LTLISAVLCG CLVQEPVRGG KEKVLQDMLQ AGTRYERKLS WQGFWHAMRH NQSNLILLWQ GFFSSLPWGI IFVFLNDYLS QERGFSVPEA TYLVLLFGLG CAAGGILGGY WGQKVQSYRR SYLPLFMAGT TAAGILPFVG LLNTHFTNAH GVYGFVFSFG GGLVASLPAV NVRPCLINVN PPETRGASLT AANLLINLGR GFGPSCITLM GSIFNVDRQY AFNVTLIAFW LISALQLSLL ARTLPNDQDA MEAELASYAQ KAIREQQGQH EPPAWDDTDG SDRDEKEEDD DNDGAGDAPN LVSLEERMTS WDGKAARETL EYVRRGVREF REEVMSGGSK ALLGPFGCAD GISSEDEEEE MTTPRTMLRK REQWKRAQVI RNQIAMDGGT SDAPAYPTER TPLRTPYERE KLPYAVSREK SPTVHALPYQ VSSVRNDTVF SCGDMGGLDL CRYGGGALSC WFAKVSSDAG INGSSFGWVD IPGLLLAGSS GTGSAGGWAN KPSSILQATA ADVGTPPAVG TRWATVSATP RRTNSADVGR APRRTHNNNN NNNNNKPQGE GHTSRWNNNN TKTGRGGGRG TAGGHHSHRT GGNVSHTPHH HHQHNKERKA PAKTSSKTPM VNLKDVTLFT ETGTGNTAQE KVVVRLSMEH FLATRLNYLD PPTVSADGES SHWTPHARCI WTSQDRGNQI QEEMKALWDY KPLEVNDDTR WKARVMEGQN ETATSDSPEE ILRKATAILN KLSWTTLDKL TVRFVETLSG GAKSTHGDAD PSLSKETVRG TMQMIVDKAM AEPHFAELYA RFSGKLAAVH KTFKKMLLSI CQEQFEISDK EPEFPASMDP AEKAYELLQS RKQSIGLMAF IGELYKLKLI KGAIMIGCLQ RLMVIDDEEK LECFTNLMAT IGARLHEHEN EPEVHEIWEK LYSMAGKTNK TTGPKAPSTR IKFLLQDLIE LKENNWVKRR EEEKAKTIAQ IHKEAAEAER SASRNGPVIN TNSSSHKKVA RSHSASLPNT RSSSSLQGQE SFPDGEGFMQ VPNKPKKNSL RRAQSDSIPD ATSGMSSLQL AMSGKNSKTS ARKSSGGSSQ RPASQAPKIA EYLDPKQVGE KTKTLLKEYF VSGDTADAVL SFDDLIGKSH NGDVIRGGAV VEAGILLVME MKEEDVKKFL MVTAALLKQG KIPLASFAKG MNNPLESLRD IEIDAPMAAK HLARIIASWL SCNALSIDFL LGAPEYFLSD GRPAALAKQV LQIRGGNVSD EEVKVVTQLM SEEEKKNIAS VKEWLQ
|
| |