Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47619 |
Symbol | |
ID | 7202823 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 297191 |
End bp | 302273 |
Gene Length | 5083 bp |
Protein Length | 1565 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182043 |
Protein GI | 219123462 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAGC GAGGTCGTAC GTCCGTCGTA GGAGCCGATC ACTGCGCCAA AAAATTCCTT CCGACCCCCC TAGACTTTGT CGGTGTTGCG GACCGGGACC GTAAAAGGCA ACTTCCCTCA CTGGAAGTTC TCGATAGACT CTGCGAGGGC ACAAGCGCGC GTTCGTTGGC TGGATGGACC TGGTGTTTGC CTCTTTCACC AGAGAACACT AGAGTCCCTA TAGAAGGGAA GAGCTTTACC AATAGATCGA ACCACGTACA TCCCGAAGTC ATTATGAGCG CGGCGGCAGC CGGCAGTGGC TGGAAATGTT CTCGTTGTAC GTTCTACCAC CATGCTCCTG GATCACGTTG CGCCATGTGC AACGAGCTAC GAGTCTCGCG ACAGCAGATG CGCGACTTCG TCATCGGCAA GCCTATTCCC AATGACGATG CAGTATCCCC GATGGGCCAA AAGCTACACG GACCATCCAC AAGCCTGTAG TCAATCCCTA CGTTCGCACC TGTCCCAGCA ACAAACCATA CTTTCTACAT TGCGGCCCAA GAGCGCCTCT TCACGGGCCA TTGCGAATCC GTACGCAAAA CCGTTGGCTG TCTCACAGCA GCAACAATCC GAATCGGTCG ACAAGACGCC GTCATCACCA ATTCAACCAC ACATCCCTTC CGAGTCCCTG CCACTGCCGC AATATACCGT CACGAATCAA ACAACCAGTT CCACTAGCAG TACCGCAGGT ATTGGGAATA AACTCCACCG ATCAGCAACC AGTGATGGAG ATTCTAATCC ATCCGATACG GGACGCGGAA AGAACCCCTT CGTGGCCCTC AAACCTCGCC CTGTTTTGGA GTACAAACCT GGTCCGGTCC CGATAGACGA GAGCACGGCT CATGAATGGG TCTATCCGAC AGCCGAAACT TTCCCCAAAC GTCAGTACCA ATTAGAAATT ACGAAAACGG CTCTCTTTCA TAACACTTTG GTATCCTTGC CAACCGGCCT AGGCAAAACT TTGATTGCTG CCGCGGTACT CTACAACTAC TATCGATGGT TTCCAACCGG CAAAGTCATT TTCCTCGCTC CCACTTTACC TTTGGTCAAT CAGCAAGTCA AGGCTTGCTA CGATATTATG GGTATTCCTC CATCAGACAC GGCGGTGCTC ACAGGAAAAG TGCATGCCGC TCGTCGGGAA ATCGTATGGC GTGACCGCCG CGTCTTTTTT TGCACGCCTC AGACCGTCCA AAAGGACTTG GACGCGAACC GCTGCGACTC CTCCAAAGTC GTCTGTGTAG TGCTAGACGA AGCTCACAAG TCCACGGGAG ATTATGCTTA CGTCAAAGTC GTGGAACGCC TCGAGGAGGC TGGTGCACAT TTTAGGGTTC TGGGATTGAG TGCGACGCCC GGTACCAATA TAAAAGCAAT TCAGAGTGTG GTAGATGTAC TGCGCATCAA CAAGATTGAG GCCCGTCGGG ATTCGGATCC ATCGGTAGCC CGATATATAC ACGAAAAGCA GTCGGAGATT GTTGTTGTGA AGCAGGCGTC TGCATCACGA ACCATTGAAC GGGCCCTCAA CGATGTTGTC GGACCATTGT TGGAGCGATT ACGGAGCGCA GGGGCATTGG GGCGTCTCAC GGGAAACGCT ACAATAACCG CGTATAATTT AATACGTGCC CGGGAAGAGT TTTGCAAACG TCGGAACGAC GACGGCTTGA TTGGTTTCTT TCTCGCTGCG CAACAATTTG TACAAATACG ATCGGATTTA CACAAACACG GAGTTGGTTT GGTGAGGTCA AAGCTTAGTC GTCTACGGAC CGAACGTCAA CGGGGTATGG TGGCTTCAAT TGTCAAGGGG AAAGAATTTC AAGCTTTGTG GGAGGAAGTT GCCAAATCGA CGTGCGACCC AAATTCCAAT CACAATAACG TGCAGGACAA ATTAGTGAAC AATCCAAAAC TAACGCAGCT CAGAGAGATT CTTGTCGAGC ACTTTGAAAG AGCTCGGGCT TGCTCGACGT CTTCGCGGGC TATCGTCTTT TCGCAATTTC GAGACTCTGT GTCGGAAATT GTGGATATAC TTTCCGCTTC CAGACCGTTA ATTCGGCCTC GACATTTTGT GGGTCAAGGG AAGGGTACCA AAGGCGAGGG AGGAATACAG CTAAAAGGGA TGAGACAAGT TGAACAGCAA CAAGCTATTC GCGAATTTCG CGAGGATACT TTCAATGTCC TCGTATGCAC TTGTGAGTAA AATCGGGTCA AAGGATGTGC ATTTGCGCGC TAACACGCCT TTCTAAACCT TGCCCCAATT TGCTTATCGC TAGGCATCGG CGAAGAGGGT TTGGATATTG GAGAGGTTGA CCTGATTGTT AATTTCGATA CTCTTCGCTC CCCAATCCGT ATGATTCAAC GTACCGGGCG TACTGGACGG AAGAGAGATG GTCGCGTGGT TTGCTTAGTG GCTGAAGGGC CCGAGGAACG AACGTTGCTG GCGTCTCGCC AGTCCGAGCG AAATCTAGCG CACGCTCTGA AGAATCCCAA GTCATTTCGA GTAGCGCAAA CAATGCCACT TTTTCCAAGC CAACCGAAGT TAAGAGAGCA AAACATGTTG GTATCAAAAG ATTTCCACAT TTCTCAGGTT GAAGGTCACG AAGGAACTAG GCGCAAGGTT TTGGGCCCCT TTGATGCGAG AAATAGTCAA TCTGCTCAAG AGAAAATTCG CTGGAGGCTT GTGACAAAAG AGGAAAACGA ACGCGAATCG GCTTTAGGGA GAATACTTGT TTCCAAGGGT TGCTCAGAGG TTTGGAGAAC GTCGCTACGG CGCCGTTTTC TGTTGGCAAG GAGTCTTTCC AATATCTCTG ACACTCGACG ACTCAATTAC ACAACTGGAA GAACCGTCAG AATTCTTGCA GATCTTCGGT ATTCCCACGG TGTAAATGTT TCACACAATT ATAGCAAGAC ACACTCTAGG GGTCATGGCT GGACTTTAGA GCATCTCTTC CCTCTCAAGT TTGACTCATT GATCGGCAAA GGATTGCATG GGGAAATTCT GCCGGTAATC AATCAGTCGC TTCCGAAGAA CCTCCTAGTT GAGCTTGATG TGCTAAATAA AGGGAGCATG GAGACTATTT TCCGGAATGA CCACGGATGT GTGGGGAGGA ACTCTATCCT CGGACAGAAA GGCTTTGTCA GAAACAAGAA AACACGAGAA ACAAATACGG AGCACTCCGA AGGGGAACAT TATCTGACCA AGGAGCAAAC AGAAGCTTTC GGAATCGATG CGCCTCCCCA TGAAGCAAGA AGAAATTGCA GAAGAAGAAT TCAACAAGAA GAAGCCAAAA AGAGACCGTT GTCTTCATCA GGAGTCGATA TACTCGGTCA AGAATCTATG ATGAGCAAAT TCGACTTGGC CTCCACTGCC AATGACAAAG AAAACGAGCC TCATTGTGCG TGGACGGGCA GTACAACTCT AGAGACATCG ATAGTTGACG AATTTATACT GCCCACTGCA GACCAAGATT GCGATACAGA TGAGTGCGCA CAAGATGAGT TTGTCCTCCC CCCGACGGAA GATTTGTCAT CGAGTAGCGA CGAAGAAGAA TCTAGGCCAG CCTATGAATC TGTTATTCCC AGAACTGCTG TTGCTCACAA TACAGGATCT TTTGCCGGAG CGAGCTACTT CCCTATGTCG GAGCATGAGA CAACTACTGT TGATTCAAAC CAGTTCTATT ATCGTCTGCC GACCCAGAGC GACTCTTCTT CCGATGAGGA TGACGAAACC GCTATGGCTT CGAGGTATGT GGTAAACAGT CAGTCAAAGA CGGCAAGCAA GCAGGTTGCA GTCTCTTCTT CCATTAAGGA TAAGCAAAGA TCCTCTACTG GACTTCGATC AACGGCACGT TTCGGTCTTG AAACGTCATC AGAATGCTCA TTTGACGATT TAGTCACCAC CGATAGTATT CGCGACGCTA GCGTCATAGA GAGTCCAGAT TCCCAAGCAA AAATGAAAGC TACTACACGA CGTGCCATTG AAGACACCCC AGAATCTACG ATATCGCCTC GTGTAAGCTT GGCAGAACCC TTACAAGGAA AAAATGAGCT TACCAATGAC GTAGATGGGT TACTGGATAC ACAAGATGAC GTTGTCTCGG GTGTTTCCGA TATAGTATGT GCTGTATGCT TCTCTGGGGA GTCGGTCGAC GATGACCCAA TAGTCTTGTG TGATGGACGA GGCAAAGGAG AAACATGTAA TTTAGCTGTG CACGCAACCT GCTATTCAAT ACCCATCAGT TGCCTCGGTG ACGCTGAGTG GTTCTGTGAT CTTTGCCGGT TTCCTACCAA CCCGTCACAG CCTGCTCCCT CATGCTCGCA TTGCCATCAA GAAGGAGGTT CTCTACGGCG GATGCCTTCG TTTGAATGGT CTCACCCGCT TTGCCCTATG AATCAAAATA GCGAAGCGCG TAGAGCAGGG TTCAAAAGGC TCCGAAAGAT CCCAAAACGA AAGTCGCTCC CCCTCGATTT GTCATCACAG ATTGATATGT ATCATGCAGA TGACAACACC AACGTACCCA AGCGAAAGCT TCGGCACTAT CGCTGCTTTC TGGACGAAGA AGCTGGGATT GATTCAGACG AGGACATTGA CGGAGATCGT ATGGAGGACG AAGAGCTGGA TGCTATCGAG GACGAGGAAG CCGCAATGAG CAGTTTCATC AACGACTCGT CACAATTGGG CCAAACGCAG GACGAATTGG ATCTAGCTGA CCCACCTGCA CCAGAGGATT GCGTGCATCG CCAGCTCGAC CTTGATCGTG AGCGCAAAAC CGTCTTCTCA ACTCCACTTC TTAATCGCCG AATGAAACGT CGAAAAGGAC GCGATAGTTG GACACCAACA CCAGCGTCGG CCCCAGACTC AGAGAAAGGA CTGGGGAACA TGCATTTCAT AAGAAGTGTG ATCGAACACC ATCGTCGAGG CGGTGACTCG GAAGCTATCG AGCAACTGTA CCGAATGGAA GAAGACCAGA GTGTTGATGA AAGTGGTGCT CTTGACTGCG AACAGACACT ACGTCCTCGG ATAGTTCACT ACTGTAACAG TGACTCAGAC TAG
|
Protein sequence | MSERGRTSVV GADHCAKKFL PTPLDFVGVA DRDRKRQLPS LEVLDRLCEG TSARSLAGWT WCLPLSPENT RVPIEGKSFT NRSNHVHPEV IMSAAAAGSG WKCSRCTFYH HAPGSRCAMC NELRVSRQQM RDFVIGKPIP NDDAQQTILS TLRPKSASSR AIANPYAKPL AVSQQQQSES VDKTPSSPIQ PHIPSESLPL PQYTVTNQTT SSTSSTAGIG NKLHRSATSD GDSNPSDTGR GKNPFVALKP RPVLEYKPGP VPIDESTAHE WVYPTAETFP KRQYQLEITK TALFHNTLVS LPTGLGKTLI AAAVLYNYYR WFPTGKVIFL APTLPLVNQQ VKACYDIMGI PPSDTAVLTG KVHAARREIV WRDRRVFFCT PQTVQKDLDA NRCDSSKVVC VVLDEAHKST GDYAYVKVVE RLEEAGAHFR VLGLSATPGT NIKAIQSVVD VLRINKIEAR RDSDPSVARY IHEKQSEIVV VKQASASRTI ERALNDVVGP LLERLRSAGA LGRLTGNATI TAYNLIRARE EFCKRRNDDG LIGFFLAAQQ FVQIRSDLHK HGVGLVRSKL SRLRTERQRG MVASIVKGKE FQALWEEVAK STCDPNSNHN NVQDKLVNNP KLTQLREILV EHFERARACS TSSRAIVFSQ FRDSVSEIVD ILSASRPLIR PRHFVGQGKG TKGEGGIQLK GMRQVEQQQA IREFREDTFN VLVCTCIGEE GLDIGEVDLI VNFDTLRSPI RMIQRTGRTG RKRDGRVVCL VAEGPEERTL LASRQSERNL AHALKNPKSF RVAQTMPLFP SQPKLREQNM LVSKDFHISQ VEGHEGTRRK VLGPFDARNS QSAQEKIRWR LVTKEENERE SALGRILVSK GCSEVWRTSL RRRFLLARSL SNISDTRRLN YTTGRTVRIL ADLRYSHGVN VSHNYSKTHS RGHGWTLEHL FPLKFDSLIG KGLHGEILPV INQSLPKNLL VELDVLNKGS METIFRNDHG CVGRNSILGQ KGFVRNKKTR ETNTEHSEGE HYLTKEQTEA FGIDAPPHEA RRNCRRRIQQ EEAKKRPLSS SGVDILGQES MMSKFDLAST ANDKENEPHY QDCDTDECAQ DEFVLPPTED LSSSSDEEES RPAYESVIPR TAVAHNTGSF AGASYFPMSE HETTTVDSNQ FYYRLPTQSD SSSDEDDETA MASSIRDASV IESPDSQAKM KATTRRAIED TPESTISPRV SLAEPLQGKN ELTNDVDGLL DTQDDVVSGV SDIVCAVCFS GESVDDDPIV LCDGRGKGET CNLAVHATCY SIPISCLGDA EWFCDLCRFP TNPSQPAPSC SHCHQEGGSL RRMPSFEWSH PLCPMNQNSE ARRAGFKRLR KIPKRKSLPL DLSSQIDMYH ADDNTNVPKR KLRHYRCFLD EEAGIDSDED IDGDRMEDEE LDAIEDEEAA MSSFINDSSQ LGQTQDELDL ADPPAPEDCV HRQLDLDRER KTVFSTPLLN RRMKRRKGRD SWTPTPASAP DSEKGLGNMH FIRSVIEHHR RGGDSEAIEQ LYRMEEDQSV DESGALDCEQ TLRPRIVHYC NSDSD
|
| |