Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47371 |
Symbol | |
ID | 7202521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 431275 |
End bp | 435915 |
Gene Length | 4641 bp |
Protein Length | 1546 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181554 |
Protein GI | 219122443 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAAAC GCCGTCGCAG TAAATCGGCC GAAGGAACGG CGAGCACGAA AAGTGCTACG TCGACGAGAG CCTCCCCGAA AATGAGCCTG GATGGTACGC CTCCATCACA GACACGAGCG CCTACGCCGG CGCGCTCCAA CCCGTTTACG GACCTGTTTG TGGAGTCCGA AAATGTTGTA AACGATGGCA TTCTGGACAG CACACTGGTG TTTCTGGAAA AGGGAATCCT CACCGAGCCT GGCTTTCGTA CAGCGGCTAC GGAGTTGATG GACTACGTCG AGGCTTGCCA AGATGACAAC ACGTCCACAA CGCCCACAAC AGTGACATCG TGTTCCACCT TGACGACGCC GCAAGTTTCC CTTGTGCTCC TGCCATGGGC AATTCGTAAT ATTTTGGGCA CGGAAGTATC CACGGACGAA TTGGTGTGGC GTGCACTTTC CGTTTGTCTC CAATTGGTCG CAGCTACCTC TCACCAGAAC GATTTGCAGA GAGATGATCA ATGGGAAAGT ATTTTGACCA AGTCGACTTT ATTCAAGCTT GTTCCCAGAA TTGCTGTTTT CTGTGTACGC AATGAATGCA TCCTGGAAAC CGACGATACG TCGGAAATAG AGAAGGCCCG AGCATTTGGT TGTCAAGCGT ACGAAACAAT TCTGGCGTCA AACTTGTACG CACCGACTTT GGACGTCGCC TGCCACAAGG TGTGGATTCC CCTAGTGGAG TGTCTAACCA GAAACGACCA GGGTTCTTCT AGCGCCAAGC GCACGACCTG TGCACTGCAG GCCACGGTAC GATTTCTGCG CAAGCTCCAA TGCTCCGGCA AGGGCAATCC GAAAACGATT TTTGGTTTAG TGGCAACCCG AGAAGTCCTG GTGGCAGCAT CGCGGACGTA TACGTTGTTG GATACAAACC CTGATGGGCG AACAACGCTG CGAGACTTTC TTTACCAAAG TCTCTTTGAC GTGTCGCAGC ATATGGATGG ATTTCGATCG TTGCTCTCCA AGGGCACAAT CCCTGTTCAA GAAGATAACA GTATGGACAC AGGATCTCCA CTCAGCGAAC GAGACAAACC TACTCTTTTC TTTCGATGCT ACCAAGATTC TCTGCTGAAG ACGATAACAG AAAGCATTGT TATGGAGGAC GTCGATGTCA TTCAGACCGT GCCCGTCTTA CTCCGTGGCT TTTTGCATGA ATCCATGGCT TGGGAGGTAA GGGAGAGAGA GAACAAGAGG CAGAAAGGCT CACTTAATTC CGAAATCTCG AATACGGTTC TCTTTCAAAT GTTCGTATTT CTGACTGTTC CCCTTCGTAA GTTGCTGGTT TCCTCAAAAG TACCAGAAAC CATCAGTCCC GCGACGCAAG CTCTACGCGA GTGTCTGGAG TCTTTGTTCG AGCAAGACGC CTACCTTCCA TCACAAGACG TGGACGGAAA ACAACTCTTG TATTTGGGCA TTATCACCAA TGAGCTGAGT TCGTTATCGG CTACCCAGAT GGTGCAGCCA GCTGACAATG CACTTGCTGC TAATTGTATA CACTCTTTTC GAACACTGAT GCAATTGAAT CACAATCTTC TTCATGAACA GATATCTTCC ATTACGGTCC TCTTGTTTCA GTTTGGTGGC GATTACCGAC TTGGAAACGA GATAACCCAG TTTCTGGTCG TAGTTGTGGA GACGTATACC AAGCTAAGAC GACAAGGCTA TTTAATTCGT GCAATTCTGC AAGTGGTCGG AGTCCTTACC CATACAAAGG GAGGTGAGAA TGCTGTATCA CTTCCGTCTC TACTACAACA TACATCATTG ATGACGGCTT TCGCCAACTC TGCTCAGTAC AGTCCGGTCT TTCAGGTGAA AGAAATATTT GAAACCATCC AAAAGTTTAT CTTGGGTTTG AAAGAGAGTG ATAGTTCCTT GGAAAACACC GTTCGTGCGT TGGATTCAGT CGTAGAGCTG ACAATCGTCA TGGTACAAAA TGTGAGAGTT GACAGTGGGA CAGCTTCCGA GGTGGCATCA TTGTGTCAGG AGGTCGTCGA CACTGCGCTT TTAAGACTCA TCAGTCCTTC ACTGGGTTCT CTCACTGGTG CAGGGCTACG GCTTTGTGGG TGGTTTATTG AATTACACGC AAGGTGTGCC TTTTGGTTGG GCCCGGAAAC CCAACTGGAG ATTCCACCTT CAGTCATGGA GATTCTATCA ACTGCATCGC AATACGCCAA AGGAGGAGAA GATTTGGCGG GCTACGAGCA AATACTGGAC GAGCTCCTCT TTTTGGCATT GCATCGACTA CGCCAACTAC ATTCGCTGAT TCATGAGCAA GAGCGCATAA ATTTGGGGGC TCTGAGGTCG ATAGAAGGAA ACAATGTTTT CACCATAGAA GCTAGCAAGT TAGCGTTTTT TGCTGGATTT GTTGCGAAAC AGAATGAGGA CTCTCCCTTG TCAGGTTCTC GATGGAGTAA AGTTGCTCGC GCTTTCGCTT CATGGTCGCC ATACGCTGAA GACGAAGACG TCCGACTCTT TTTGGAATGG ATGATTGGAG TTTTGGCTAC TGACGAAGCA ACGACAGAAT GTCATCTCGA TTCTTGTAAG ATTCCTCAAA GCAATGCAGC CGTACGAGAA AATCTACAAA CAGCAAGAGC ACTTCTGTAC GATTCCTCGT TTTTGGAGGA TGCTCGCGTA GCTTCCAAGT TTGCCCTTGC CGCCCTCTCC TGTACATCGA TTTCGGTAAC GTCGGCAATC GAAACGCTCG GAAGTATCCG CTTCCACGAA ACGAAGTCAT CGGGTACTTC TCCTTTACCC CTTGGAATCA ACAGCAAGGA CAAATCTGGT CAGCTTGATG CGCATCTCGT GACAGAAATG CCGCTTTTCG ACTCGACTAC TGCATTGTCG AAGTCAAGTC GGAAGACAAT GCTGGCATGT TTGCAAAAGA CTGGAAGGCC TTTGATGTTT GTTAACAGCT TAGCCTCACT GTATTGCCAC CTAGAGGATC CCATGGATTT CGTCGACTCG TTGTTGAGGT TGGATCAAGT CTGTAGATCC ATGGCGATGT TGAGTTCCGA TATATCTCAA AAAGCTCTTC ATCTAGTGAA AGTGTTAAGG TTTGCTGTGG CCAGCACCCT TACTCGCATC GACGCGGGGT CTCTTTTTCA CGTGTTGGGT AATTCTCAAG AAATCACGGA GGTCTTGAAG ACAATAGTGC TGTCAGTAAA GCATTTGTGT CTTGAGACCT CCCTTTCAAC GTCGGAAGTG ATGGCAGAAA TTTTGACGGC ATCTTCTGCA CTGACGGAGC AGCTCGTCCG TGTATCCGGA GTTTTCGAGG AGCAGATAAA ACAAAGATTT GAGCAACTCA TCAGAACAGC ATTCTGCATC GATAGTTCAA TCAAGTCAGA ACGGGTTCAT GAGATTGGTG TAGTTGCGTG CCTAGGACGT TCCATCCTGA AAGGCATGAA ATCAAATCAA CTCTTTCATG AGGAGACGTC GGATTCAACT GCTTTCAGAA ACATTCGCGG TTTGTTGTTT CCATTGGTTG CAGAACTCTG CTTAGGAAGC GAAGATAGTA CATGTAGCCG TCATGGATAT CTCTTGTTTG GAGATCTCAT TCGATTCACC ACTGATTCGG AAAACTGTCC CTTTGCCGAA ACAAGAAGGG ATATTGAATC TGTTTGCATA GCCTCGCTTT GCAACGCGTC TCTCTCTAAA GATGCCTTAC ATTGCAGGCG GTACGTAGTG GCATGTCTTG TAGAGACAAG ACCATCTCCT GCAGTTGCCC GGCAAATTCT CGACCAGATC CTTAATTGCC AGGTTTCGTT TCCGCTGTTG GATACTAGTT TTTGTCAGTT GGTCGGCAGT CTTCATGAGC AAGCTCTGGA GGAAATGTTT GAACGTCTTG TATCTGACCA GCAATTGTTG GTAAGATCTA GGCCACTGAA TCTTCGTCTT TCACGATACG TTGTGCAATG TGTGAAAGAA ACGGACCAAA TCGAAGTGGC ATCGAAGTAC GGCCAAACTC TATTTCGAGT TGCCGTGGAT TCAATTTACG GTTTTACTCC CGGTGCTTCA TGGCAACATG ATTTTGAGGA GTCAGTAAAC TTGGTTGTAG AATTGATTTT GAGACGAGAC GTATTTGCCT GTCGGGAGCT GGACTTGGCT CATTTGTTGT GTCGGCTGAC GGATGTGCTC CGTCCAAATG GAAAAGTGAA GTATGTGACG GATCATATAT TTGCCTCTTG CGGACGAATA ATCATGACAA TCTTTCTGCG CTACTCCAAA CAAGTGTACA GTTGCGTTCC GTCGATGATA CAGGTCCTGC GCTCGTTACA ACGACATGTT TTGTACAGAA CCGGCGAGAA TGGTCTGGAT ATTGCGGACC GGGCGCAGCG GCTGACACGG TTGTACGAGC AAGTGTACGC GCACCGAGAC GTTTTTAAAA AGCACGTGCT GGGCTTGCTA CTTGACTTTG TCTACTGCCT CCAACAGGAC ACCAATCCGA CTGTCAAGGA AAGTATGACA CCTGCCGTAT ACTGTCTTTT GGATACACTC TCCAAATACG AAACGAAGCA ACTGAAGGGT CTCATGGATT TGAAGGCGAA GGCAGTGTTC AAGGCCGTCT ATCAAGGCTA TCATGTGCAT CATGCATACA AAGGGCAATA A
|
Protein sequence | MPKRRRSKSA EGTASTKSAT STRASPKMSL DGTPPSQTRA PTPARSNPFT DLFVESENVV NDGILDSTLV FLEKGILTEP GFRTAATELM DYVEACQDDN TSTTPTTVTS CSTLTTPQVS LVLLPWAIRN ILGTEVSTDE LVWRALSVCL QLVAATSHQN DLQRDDQWES ILTKSTLFKL VPRIAVFCVR NECILETDDT SEIEKARAFG CQAYETILAS NLYAPTLDVA CHKVWIPLVE CLTRNDQGSS SAKRTTCALQ ATVRFLRKLQ CSGKGNPKTI FGLVATREVL VAASRTYTLL DTNPDGRTTL RDFLYQSLFD VSQHMDGFRS LLSKGTIPVQ EDNSMDTGSP LSERDKPTLF FRCYQDSLLK TITESIVMED VDVIQTVPVL LRGFLHESMA WEVRERENKR QKGSLNSEIS NTVLFQMFVF LTVPLRKLLV SSKVPETISP ATQALRECLE SLFEQDAYLP SQDVDGKQLL YLGIITNELS SLSATQMVQP ADNALAANCI HSFRTLMQLN HNLLHEQISS ITVLLFQFGG DYRLGNEITQ FLVVVVETYT KLRRQGYLIR AILQVVGVLT HTKGGENAVS LPSLLQHTSL MTAFANSAQY SPVFQVKEIF ETIQKFILGL KESDSSLENT VRALDSVVEL TIVMVQNVRV DSGTASEVAS LCQEVVDTAL LRLISPSLGS LTGAGLRLCG WFIELHARCA FWLGPETQLE IPPSVMEILS TASQYAKGGE DLAGYEQILD ELLFLALHRL RQLHSLIHEQ ERINLGALRS IEGNNVFTIE ASKLAFFAGF VAKQNEDSPL SGSRWSKVAR AFASWSPYAE DEDVRLFLEW MIGVLATDEA TTECHLDSCK IPQSNAAVRE NLQTARALLY DSSFLEDARV ASKFALAALS CTSISVTSAI ETLGSIRFHE TKSSGTSPLP LGINSKDKSG QLDAHLVTEM PLFDSTTALS KSSRKTMLAC LQKTGRPLMF VNSLASLYCH LEDPMDFVDS LLRLDQVCRS MAMLSSDISQ KALHLVKVLR FAVASTLTRI DAGSLFHVLG NSQEITEVLK TIVLSVKHLC LETSLSTSEV MAEILTASSA LTEQLVRVSG VFEEQIKQRF EQLIRTAFCI DSSIKSERVH EIGVVACLGR SILKGMKSNQ LFHEETSDST AFRNIRGLLF PLVAELCLGS EDSTCSRHGY LLFGDLIRFT TDSENCPFAE TRRDIESVCI ASLCNASLSK DALHCRRYVV ACLVETRPSP AVARQILDQI LNCQVSFPLL DTSFCQLVGS LHEQALEEMF ERLVSDQQLL VRSRPLNLRL SRYVVQCVKE TDQIEVASKY GQTLFRVAVD SIYGFTPGAS WQHDFEESVN LVVELILRRD VFACRELDLA HLLCRLTDVL RPNGKVKYVT DHIFASCGRI IMTIFLRYSK QVYSCVPSMI QVLRSLQRHV LYRTGENGLD IADRAQRLTR LYEQVYAHRD VFKKHVLGLL LDFVYCLQQD TNPTVKESMT PAVYCLLDTL SKYETKQLKG LMDLKAKAVF KAVYQGYHVH HAYKGQ
|
| |