Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44446 |
Symbol | |
ID | 7197685 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 582478 |
End bp | 586819 |
Gene Length | 4342 bp |
Protein Length | 1354 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178257 |
Protein GI | 219114923 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGAAACAG TGAGACGACT CATTGACAGT GAACAAGCAC CATGCTGGCA CGTTCTATAG GAAGTTCTAC AGCTCGTCTT CGGCTTACAA ATAGCAAAGA CTTTGCGGGA GTTGACCGAG CGCGTCAAGT TGTACGAGAT TTTTCCTCTT GGCAACAACG TCAACAAGTC CGTTTTCATT CGTTTGCTAG AATCACACCC GGATTAGTAT CCAGCAGCGA TCTAGTTGAA AACTCTTGCT GCAGAAGAAG TACCGAAAAT CCCCTATTTC AACAGGTGCG GTGGAAAAGG AAAGACGGAC ACAAGGCTTT CGGTCGCGAG CGCCCGCCAA CGAAGGCACA ACGCAAACGC TATAACCGGT CCTTGCAAGC AAAAGAAAGC AAGCATGGAG TCCCGGGGAG TAAGGCAGGA ATACGGCGAG AGTGGGAAGC GGAACGAAGA GAAGATATAA TGAACCCCCC ACCAGATCTC CAGCTAGATG ACATGGAGTA TGGAATGGGA GACGCTATGC TAGACGATCT CATGGGCAAC ACAGCATATT TGACGGCTCA AGAAACTCCC GAGCCGGCAT ACCTTGGACA TCTCCACAAA AAGTTCTTTA GCCAAGCTGC TAACCGTATG CAAACGTATA AAGATCAGCT GGAGCGATTA AGCGACCCCA ATAGCGACAC GAGCCAACTG GTGGCCGCGG ACCGCGTGGA AATGCCAAAC GACAAAAACA TTTCTCTCGC GCTTCGTGCT TTTCGCGATC GGTACGGCAC CAAACGACAA CCGATAGGAA TGGTGAAAGC GTTGGAGCAT CTGGTGAAAG ATATCGGTGT TCCTATTACC GCTTTTGGCG AAAATACCTA CACAACTCTA ATGACGTGTA GCCGCAGTCC AGCCGAAGCT CGACGTATCA TTGACTTGAT GAGAAAAGAG CAGCATCCTG TCTCGTCGTA CAGCTGGTCC ATTTTAGTAG ATACGTACGC AAAGGCTGGC GACTTTAAAG GCTGTGCAAA TGCACAACTC GAAATGATTA ACGACGGTAT TCCTCCAACC TTGGTATCCT TTACCTCCCT TCTCGCGGCT TGTGCCAAAG TTTGCAATGA CGGTCGCGTT GCGCATAGTA TTCGAGCTCA GGCGGGAAAA CTGGGCTGGG AAAAGTGGCA AGAAATGCGT GTGGTGGGTG TTCATCCGGA TGTGATGGCC TACGGTGCTA TGCTAAAGCT CACGGCAGCT CGGGGACATC CGGAACGTGC AATTAATATT CTAGAAGAAA TGCAACAAAT GGATGTCAAG CCCACAACCC TTTGTTTCAG CTATGCTCTT CGAGCAGTCG CTAAGAGTCA TTCAACATCC ATTCGCTACG AGCGAGGTGC ATCGCGAAAG ATGAAAAGAC GGGAGTTTCT GACGCATCAC CACGGTAAGC TCGCGCGGTC GATTGTACTT CTTGCCGAAA ACGCTGAAGT CAAACAGGAC CAGGGATTTG TATCAGCACT TATAATGTGT GCAGCTGCTG CCGGTGATGT TGCTACTGCC AAGGCGATTT ATATTGCTAG CCAAATCCGC CAATTGGATG AGCTTCGAAC TATTGGATCT GATCAGCATC TGGCAAGACT GAGAGGAGAG GACATCCATC AATATGAGGG TGGCCTTTTG GAAGATCCTT CTCTTTCGAA TACAAACTCT TCTCTCAGCT TGCCTCCTCA AAACATGGGT CGTAAGTATC CAAGTTTCGA AGAACGAGAA TATGGGAAAG ACTCGAGAGT TCTCTCTGCC ATTCTTCACG CCTGTGCATG TGCTGTTGAT AAAAATGGTA TCGGTGATAT GTGGCAAGGT CGCGAAAACC AAGGGTATTT GTGCGAAAAT TCTCTTCGGC TACTCAAGGC TCGACAGGTG CCAAAGTTTG TCGATAATTC TATTCCTGGC CAAACGAGGA CGGATGCACT CAAGTGGGAA GGAGAGTATC GGGATGAGGA CTATCGCGGA AAAAAGCGGA GTCCTCGTAA ATTCCGAGGA GTCGAAGAAG ACATCGAAGC CGGAACGACT CTTGATCAGG TTGACGGGGA CATTGCTCGT ATGTATGAAG ACAAAGAGGG CCGTTTGAAA AAGGAATACC GAAGCACGTC ATTCGAGGAC GTTTGGAAAT TGAAATATGG CGAGGATTGG GCCAACGACA GCGGTACAAA AAGGATCGAA GAAAGGCCGA TGTTCAACCT CGAGTCAGGT CTGACGTCAA CAGGAACAGG AGACGCTTCT GTACGGGATG ACGGCACTAA TGAGATCGAA GAACAAATGT TCTTTGACAG AGAATCCATG CGATGGACGA CAATACCATT AAGCGAACGG ACGAAAACGC GACCGTCTAG TGTAGAGATT AATTTACCTG CAGTTGAAGG TCCGCCGCCG GAGCATGAAG AGATGTATTT TCATAAAGAG ACATCTCGTT GGATGACTCG TGCGGTTACG CCATCGAACA TGACACCAGA AACTGAAAAA AACATGTCGC AGGCTGTTAG CCCCTCAGAA TCGCGTGACG CACAGGAAGC TGTCGACTTC GACGACAGCA ACGAGGACCT ATACTTCGAC AAAGACGAGA GGCGTTGGAA AACGAGAAGC AGAGCAGACG CAGAGAACGC ACGTCGAACC GCGTTTGAAA ACCAGATCCT TCAAGCGAAG AGCCAGCCAA ACACGAAAAT GGCAGACAGA GAGAAGGTTA GTTTTTGTAG CATCGACAAG AGGCCGTACT TTTTGCACGT TACCAGTGAA GCAGTCTCTT TATAAGGAGC GACGTTGTTG CTTTGTCTGG ATCGACGTAC GGCAAAAGAA ACCGCTTGAG AGCCTTCGGA GAGTGTGTTG CTAGACATCT ATGAAGCAGC GCTTTCGGAG TTTTTTTGCT ACAACTCAAA TTGTTTTCCC TTTTTGCGTA GGTCGACACC GAAAAGACAG TTAAAAAAGG ACACGAGTTT CAGATCTTCT ACAGCGAACT TCGAAAGGAA ATGCAAGAAA GTGGAGAGAC GATTGACGAT TTGACGGAGG ATGAAGCCTA TGAACTATTC TTGTCTGCAG AAGAAGAATA CAACGAAATG ATGGACATGG ATCCAGACGA GTTTTCGACT TTGATCGACG AGCTCGATGA CGAAGAATTG GCTACCCTAT CTTCTAAATC TTTGCCTGAG CCGGAAAGTG CTCTTTCTTC CAATCTTGGA GCGGCTGAGA TCAAGTCAAA ACAAGAGTTA CATGCTAAGA TTGAAGAACT AAAGGATGAA TGGGACGATA GCGATGATGC CCCGGATGAT ATGTTCGAGC CATCTTTCTC CCCCAGCTCA TATAAGAGCG TCCATAGCCA ATCTTCTCCT GTTCGCGAGA AGACATTTCT GGAGGAACCA ACGGTGACAA CCACTCACGC TGCACAGGAA CCTACCATTT CTTCCGTGAG AGTGTTGGAT GCAGAATATG TTGAAAAGCC ACTCACCGGA CAATCTCCTT CACCGAAAAA TGTTGGCCTT GAGCAAATTG CTGGTGGTCG AGACGTTTTC CCCGGTGCGT TTCAGGAGCA TGCTGCAGCC ACGGACGCGG AGAGTTTTAC CAAGATCGAC GAGCAACTAG AGTTGCTCCG TGGCATGCTT CCAGGGTTTT CCGACAAGAG ACTAAGTAAG ATTCGAAAAG CCTTCCGCAG CTCGCTTGGC GACCCTTCTC TACTCGAGCT CACTTTAATT GCACGGGAGC AAATGCCTGA TTACATCACC AATAATTGGC TCAAGCATAT GAGCAGTTTG ACCTCTCGCT ACATCATGCA TAAGGCTGTT GAGGAAGAGA AAATTGATGT TCACGTGCTC AATGCTGTCC TCGAGCTCGA TACGGCAGCT GGTAGCCTAG ACCGGGCGTT GCAATTTTAT GAGACCGAGT TTGCTATCCG AAATTTGGAG CCAACGGCAT ACAGCGCCCG GCTTGTGATA CAAATGTTTG TCAAAAATAA AAGACTGCAA CGTGCTTTGA CGTTCAAGAA CTCGATTGAA ATGAGGGGAA GGACGCTTGA CATACTGTCA TACGGCGCCT TGATAGACTA TTGCAGTCGC CATGAACAAC TGGGATCGGC CCTACTGCTT TTAAAGGAGT GCCTTAGTAA GCACGAGGCG CCACCTGGCG AAGCCTACCT CAAGCATGTC CGACTGCTGT GTCGTCAAGC CGACCTTATT GAAGAAATTG GCCTGGAAGA TATGATTGGC AAAGATCCGA TCGAATGGCT GAGGCACGGC GAGGCAAATC TCAAGCGAGA TATGTCTAAA AAAGGCCGCC GCGATATTAA TCTTGGGCGG AATCGACTCA TGCAGCTTTA AACTAAAAAA AGTTACAGCT TT
|
Protein sequence | MLARSIGSST ARLRLTNSKD FAGVDRARQV VRDFSSWQQR QQVRFHSFAR ITPGLVSSSD LVENSCCRRS TENPLFQQVR WKRKDGHKAF GRERPPTKAQ RKRYNRSLQA KESKHGVPGS KAGIRREWEA ERREDIMNPP PDLQLDDMEY GMGDAMLDDL MGNTAYLTAQ ETPEPAYLGH LHKKFFSQAA NRMQTYKDQL ERLSDPNSDT SQLVAADRVE MPNDKNISLA LRAFRDRYGT KRQPIGMVKA LEHLVKDIGV PITAFGENTY TTLMTCSRSP AEARRIIDLM RKEQHPVSSY SWSILVDTYA KAGDFKGCAN AQLEMINDGI PPTLVSFTSL LAACAKVCND GRVAHSIRAQ AGKLGWEKWQ EMRVVGVHPD VMAYGAMLKL TAARGHPERA INILEEMQQM DVKPTTLCFS YALRAVAKSH STSIRYERGA SRKMKRREFL THHHGKLARS IVLLAENAEV KQDQGFVSAL IMCAAAAGDV ATAKAIYIAS QIRQLDELRT IGSDQHLARL RGEDIHQYEG GLLEDPSLSN TNSSLSLPPQ NMGRKYPSFE EREYGKDSRV LSAILHACAC AVDKNGIGDM WQGRENQGYL CENSLRLLKA RQVPKFVDNS IPGQTRTDAL KWEGEYRDED YRGKKRSPRK FRGVEEDIEA GTTLDQVDGD IARMYEDKEG RLKKEYRSTS FEDVWKLKYG EDWANDSGTK RIEERPMFNL ESGLTSTGTG DASVRDDGTN EIEEQMFFDR ESMRWTTIPL SERTKTRPSS VEINLPAVEG PPPEHEEMYF HKETSRWMTR AVTPSNMTPE TEKNMSQAVS PSESRDAQEA VDFDDSNEDL YFDKDERRWK TRSRADAENA RRTAFENQIL QAKSQPNTKM ADREKVDTEK TVKKGHEFQI FYSELRKEMQ ESGETIDDLT EDEAYELFLS AEEEYNEMMD MDPDEFSTLI DELDDEELAT LSSKSLPEPE SALSSNLGAA EIKSKQELHA KIEELKDEWD DSDDAPDDMF EPSFSPSSYK SVHSQSSPVR EKTFLEEPTV TTTHAAQEPT ISSVRVLDAE YVEKPLTGQS PSPKNVGLEQ IAGGRDVFPG AFQEHAAATD AESFTKIDEQ LELLRGMLPG FSDKRLSKIR KAFRSSLGDP SLLELTLIAR EQMPDYITNN WLKHMSSLTS RYIMHKAVEE EKIDVHVLNA VLELDTAAGS LDRALQFYET EFAIRNLEPT AYSARLVIQM FVKNKRLQRA LTFKNSIEMR GRTLDILSYG ALIDYCSRHE QLGSALLLLK ECLSKHEAPP GEAYLKHVRL LCRQADLIEE IGLEDMIGKD PIEWLRHGEA NLKRDMSKKG RRDINLGRNR LMQL
|
| |