Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46726 |
Symbol | |
ID | 7204631 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 288246 |
End bp | 292533 |
Gene Length | 4288 bp |
Protein Length | 1327 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185855 |
Protein GI | 219121256 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCTT TCCAATTGGA CGATGCCGTC CACGTCTCAC GCGAGTCGCG TCAACTGGAA GGTGTCGTCG CCTACCTCGG TCCCGTCGAC TTTGGCGACG GCGACGATTG GGTCGGTGTC CGTTTGACGG GAGCGGCCGT CGGTCTCGGC AAGAACGACG GAACCGTCCA GGGTCGATCC TACTTTGTCT GTCCGCCGCA GTGCGGTGTC TTTGTCCGAC ACGCAGCACT CACCAAGCGC CCCTTGTCGG CATTGGAGCG ATTGCGTCTC AAGCGCGAAC TCGCCGGTGT CGCCAAACCC CCCGCTCCGC ATCCATCGAC CCCGCCCCTC CGTCGGACCC CCACGCGTCG TTCCGCTACC GCTCCAACGA CTAGCCCTAC TAGCACGAGC ACTACCGGCA GTCCTACCGC AAACAACGAC TCGGTACCGT CTCCGCGGGC GTCCGCGCTT CCACGTCGCG ACCGGACTCC GACTACCGCT GCATCGACCC GGAGCAAATT GGAAGAATTG CGACAGCGGC GTGCCGCTCT GCAGGAAAAG AAGAACGAGG ACGCCACTCC CGCCGCGATG ACCAGTTCCA CCACCGGACC GATGGATAGG TCTATTGCGT CGGACGCCGG GACGGCGCAA ATTGCTCAGA TTCAGGCTCA ATTGGATCAA AAGACGACGG AAGCTACGCG GTTGCAACAA ACAATCGATG CAATGCGAAA CCAAGCGCAA GCATCGCAAG TCAAAATTGA GCAACTGGAA ATCGCCGTGG CCAACGCCAA CGCGGCAGCG GCCGAAGCGA CCGCCTCCGC TCCACAGACC GACACCAATA CTAACCCCGT CGTTCGAGAA GTGGTAACTC CCGCCGTACA GGAGCAAATT CTGCATTTAC AGCGCGACAA GGATACTTTA CAGGATCAGG TAGACAACGC ATTGCGGGAA CTCTCTAATA TTCGGACGGA CTTGGATCGG GAAAAGTCCG CCCACGCCAC CGCGGTGGAT CAGTTGACGG CAGTCCGGTC CCAAGCCACG GCGCTGGAAC ACACACTACA GACTCAATCG GCACAAACCA CACAGCGTAA TACATCCGAT GCCACACATT TTAAGGAACG CGCCAAACTG CAAACGGAAG TGAGTGGCTG GAAACGGAAA GTGGCCCAAC TCGAATTGGA AAAACAGGAA TTAGACAATA CGATTGAAGA CTTGACGCTG GATAAGGAAG GATTGTTGGA AGAAAAGGAA GCCTTGCAGG ACAAGTTGGA AGAATACAGG TTGGATACGG AAACAGCCCA AATGGAAGTC GAAGAACTCA AGATGGAATT GGAAGATGCC CAGACGGCCA CTGAACGAGC CGTGCAAGGT GACAGTATCC CCGTCGCAAC TGCCGAGACC TCTGCGGAAG CTGACGCCAA CGAGCAAGCC GAACGCAAGG CGCACGCTCT GGCAACACAA AATGCTCGTC TTCGGGAGGC CTTGATCCGT TTACGCGAAC AGACAACCAT GGAAAAGATG GAAATTACCA AAACCCTCCG CGCGGCGGAA AAGCAAGCAC TCGAAGGAAA ATCTTTGATG GAAGAAATCG AAGCTTTGCG CGCGACCAAG TCAAAGAACG ATGAGGAAAT TAATGACTTG AAAGACATGG TAGAAGAAGG AGCCGCCTTT GAAAGCATGG TGGAAGATTT GAGCGACCGT ATTTTGGCGC TGGAAGAAGA CAACATTGCC ATGCAGGGAA CGATCCGGGA GCTGGAAGAA GCGTCGGAAT TGACGGCTGA GATGGAAGAA GTACAGAACG ACGAGCTTAA AGCCCTATAC CGCGATTTGG AAGGCCGCGA CACCATTATT CGCAACTTGG AGGAAGCCAT CAAAATGTGA GTGAATGTCC CCTATCCCTT CGCTGTCGGA AATGAGACTC TGACTCTTTT TTCATCGCCA CTGTTTTCCC CAACAGGCAG CGACGTCGAG AAGAAGATTC TCGACGATCG GTTGCAAACT ATCGTACGAC TGTGGATACC CTCAAACAGG AAAAACAGGC GCTGCTAGAA CTGCAGCAGG GTGGAGAAGG CGAGAAGAGC GATTTAATGG TTTCGTCCCA AAGGGCATTG TCTCGTGCAG CGCAGTTGGT TTCGGATGCC GCAGAGATGC GCAAACGCGA GGCGCAGGCC GTTTTTGAAA AGATTGATCG ACAGCTGTAT TTTAATCTAT CAAGTCGTTT GGAGTCATTA CTGCCTCCGT CTGTCGTCGC ACCCGAACTA TCGGCTATGA AAGGCGAGCT GCTAACATCT AAGGTTATTG GCAAGGGATC CCGGACGCTT GAAGGAATTG ACGCTTCTTT TAGGAAGGTA ATCAAACCTG CATTGGGAGA ATTAGATGAA GTGCGCGCTG GTTACGTTCC TGGCATGTTG CAGTTGTCCG ATGAGGTGAA GCAAGACGTG GCTACGATGA TCCACCAGAC TGATTTCGCT CACTCGATTG TGAACGCTTC ATCCTATTTG TTGCGTCTAC TAGCCGCTGG CCAATGGCCA GACTTGCTGT CGCAGAACGC ATCGGTAGAG CTTGGTTCTG TTCTAGGAAA TTGCGTCGCT GACCTTGACA ATTCACTTGG AGTTGTCTTG AAAACCCTCA AAGAAGAAGG CACCCTGGCG CCGGACCAGT CAGATGTAGC CGGCTTCCGC CAGTCTGCGG ATATGGTAAT ACAAAGCATC CAGACCGAGA TGGATCGAGA AGATACCCCA CTCGTTCCGG TTGGCTGGAT GCCGCCGGCA TGGAAGCTTT TGACAGAAGC GACCACCGCC AAATACTCGT GCATTGGGGC TGCCGCAGCT TTATCAACCG TCGTCAACCA GAGTGATACA ATTGTGCTTC CTCAGGCACT CGCTTCACTG TACAATAAAC TCGAACAAGT CTCAATCCAA GCGCGAAGCG TTTGCCTTCG CCTTGCGAAT GTCGATGTGA CCAATACGGA AGTGGTGACG GATTTATCTG CTTCCATGTC CCACTGGGTC GAGTCATCGG ACAAAATTAT CAAAGAAGTT CAGAGCTTGA TTACTTCAGA GGGCAGCCAC TTAGAAGAAT GCCAGGCTGC TTGCGACAGT ACTCTAGGTC ATTTGACTAA AATTTCATCA TCCCTTCGGT CGGCCAATCT TAACCCAAAC GACGACGAAA GTTTCCATGC ATTGTCACCA GAGGTGGAGG ACTCTTGGTT CCGTCTCACG ACGCTTATCC GATCTGTTCG AAGTAAAGAC GGAGACGATG AAGATGTGAA CTTCTTGCTG CGTACACGCG CCATCGAAAC TCAGTTCGAC GAAGCTGTGG CAGATGTTCC CAAACTCTCC CTCGCTAGCG CGAAAGCTGC GAACTTGGAA AAGGTCAGTC AATGCGCTAT CAACAGAGAC TTCGCTTCCA TAGACCGACT CATCTGTTTT TTTCTTTTTT CAATAGAGTA TTTCCGTAAG ATCCAAGGAG GTTGCAATGC TCAACGCCCG CTTATCGGAA TTGGAAAGAC TTTTAGCAAA GTCGAATGTC AGCCCGTCAA AAGCGAAAAC AGCTGACCTA AACTCTGTAG ATGAGTACAG TAGCATGAAA GAAGAAAACA GAGTGGTACG TGTTCGAGGT GTTGGTGAAA GTTTTGTAGG CCAAGCCATT CTGACGCAAT CAAATTTTTG TTTGCTTGCA GTTAATGGAA GCGATGGACG TTTTACAGCG TCAGGTCGAC GAATATGAGA ATGAAATACG TGCTCTGAAA GATTTCAAAT CACCAAAGAG GGGAGCTGTG AATAACAGAA CACCACGACG GTCTTTAACA TCAGTAAATG ACATGAGTTC TTCTCAACGT AACCTTGGAG ATGATAGCCA AGGCAGTGCG TATGTGCTAG AGGCCGCTCT TTTTAGACCC GCTTTGCAAC AAGCCATTCG TGAAGCCGCG CGCTGGAAGA CGTCATCAAC TTTGACAATG CTTTCAAGCC TGCCGCCTCT GCCAGCGCTT TCATCGAATC AGTTTCAAAC AAATCTGAGC AGTTCTCGGT TTCAAGAATT GAATGAGATT TCCCATTTGT CTTTGGCTCT CTCCGCTTTC CGTCTCGAAA AGGCATCGGT TTCGCTTGTG GACTTGACGA AACAAGGAGT GCCCCCACGA ATGCAACTAC GAAATCTGAA CGCACGGAAG GCTGCCGCAT CGGAGCGATT AGAAACGATC ATACTTCGCT GTCGCGGTCA GTTGTGTACG TAAAGAAAGA AAGAACAACC GTGAAACATC TCACAGATGG CAACACCAGT TAAATTAAGT CGTCTAGCTA GTAGCCTA
|
Protein sequence | MTSFQLDDAV HVSRESRQLE GVVAYLGPVD FGDGDDWVGV RLTGAAVGLG KNDGTVQGRS YFVCPPQCGV FVRHAALTKR PLSALERLRL KRELAGVAKP PAPHPSTPPL RRTPTRRSAT APTTSPTSTS TTGSPTANND SVPSPRASAL PRRDRTPTTA ASTRSKLEEL RQRRAALQEK KNEDATPAAM TSSTTGPMDR SIASDAGTAQ IAQIQAQLDQ KTTEATRLQQ TIDAMRNQAQ ASQVKIEQLE IAVANANAAA AEATASAPQT DTNTNPVVRE VVTPAVQEQI LHLQRDKDTL QDQVDNALRE LSNIRTDLDR EKSAHATAVD QLTAVRSQAT ALEHTLQTQS AQTTQRNTSD ATHFKERAKL QTEVSGWKRK VAQLELEKQE LDNTIEDLTL DKEGLLEEKE ALQDKLEEYR LDTETAQMEV EELKMELEDA QTATERAVQG DSIPVATAET SAEADANEQA ERKAHALATQ NARLREALIR LREQTTMEKM EITKTLRAAE KQALEGKSLM EEIEALRATK SKNDEEINDL KDMVEEGAAF ESMVEDLSDR ILALEEDNIA MQGTIRELEE ASELTAEMEE VQNDELKALY RDLEGRDTII RNLEEAIKMQ RRREEDSRRS VANYRTTVDT LKQEKQALLE LQQGGEGEKS DLMVSSQRAL SRAAQLVSDA AEMRKREAQA VFEKIDRQLY FNLSSRLESL LPPSVVAPEL SAMKGELLTS KVIGKGSRTL EGIDASFRKV IKPALGELDE VRAGYVPGML QLSDEVKQDV ATMIHQTDFA HSIVNASSYL LRLLAAGQWP DLLSQNASVE LGSVLGNCVA DLDNSLGVVL KTLKEEGTLA PDQSDVAGFR QSADMVIQSI QTEMDREDTP LVPVGWMPPA WKLLTEATTA KYSCIGAAAA LSTVVNQSDT IVLPQALASL YNKLEQVSIQ ARSVCLRLAN VDVTNTEVVT DLSASMSHWV ESSDKIIKEV QSLITSEGSH LEECQAACDS TLGHLTKISS SLRSANLNPN DDESFHALSP EVEDSWFRLT TLIRSVRSKD GDDEDVNFLL RTRAIETQFD EAVADVPKLS LASAKAANLE KSISVRSKEV AMLNARLSEL ERLLAKSNVS PSKAKTADLN SVDEYSSMKE ENRVLMEAMD VLQRQVDEYE NEIRALKDFK SPKRGAVNNR TPRRSLTSVN DMSSSQRNLG DDSQGSAYVL EAALFRPALQ QAIREAARWK TSSTLTMLSS LPPLPALSSN QFQTNLSSSR FQELNEISHL SLALSAFRLE KASVSLVDLT KQGVPPRMQL RNLNARKAAA SERLETIILR CRGQLCT
|
| |