Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48282 |
Symbol | |
ID | 7203379 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 719534 |
End bp | 725803 |
Gene Length | 6270 bp |
Protein Length | 1567 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182749 |
Protein GI | 219124937 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAATGT TTGCATCGAC CACTCGCAGA CATCACGCTT CTCTGACCAG TGTTCTTTGC TTGCTGGTCT TGACAGTTCC TCTTTTTGCG CAGACAACGC CGCAACGTCA AATTGTGTCA GAACTCTCGC TCTGCGTGGA GGTGGCTGGA GCGAGCCAAG ACGACGGGGC CTCCATATTT CAAGGGGATT GTAATGACGG AAACAAGCAT CAAGTCTTCG ACTTCATTCC TGCTCCCGGT ACAGACAGCG GTTTTCATCG AATTCGAGCC TCGCACTCCA ACAAGTGCCT TGGCGTGGCT GATGGGGCTT TAGCACCTGG AGCTGAGGTA GTGCAACTGT CTTGTGTCGA CAACAATCCC AGTACGCTGT GGAAACAACG TGATGGGGCA TTGATCTTGC AACACTCTGG TTATTGTTTG AGCGTCGATG TCAGTTCGCG TTCCACATCA GGGGCCTTTA TGGTGCAATG GTTTTGCGAC AGTCCCAGCG TCAAGTGGCA AGTTAAGAGT GTGTCCGAGC AACATGACGG ACTACAGCAA CTTTCTAAAC GCGGCAATTC TGTGGGAAAG TCTCAATCCG GATTCTATCT CAGTACGGGA ACGCCCCACG ACAACCAGTT TCTTAGTACG AATTCATCCT GGACGGAGAC AGAAACCAAC GGCGCAACAT ATTCGGGATG CGGTAGCTAT GGGCAAGACA CATTTAACCG TCATCGCACT GGGCCTTCCT TTGTATTTAC GATACCAGGG TTCGTTGCGG GCGATACGTA CGACGTAACA ATGGGTTTCG CCGAGATTTG GGCCCCGCAA TGTGAAGACG GAAAGCGAAT TATGGATATA GCAGTCAATG GCAGAAGCTT TGCGCATGAC TTGGATGTAT ACAATGCCGC TGGTGGGTGC CATGCTGCTT TGATTCTGAC CAAGTCGTTC GAGGCCAATA GCAGAGGAGA CTTTGTCATC GATTTTTCAA GTACGATTCA CAACCCAATT GTTTCATTCA TTCAGATACA ACATGTTGGA GCAAAGGGTA TACAAGCCCG AATGCTAGCG GATGTACCTA CGGCTTCGCC AAAGCTCAGC ACCGAGCCCA GGATTGCGCG AGCAATAGAT GTCTCCACCA TTGTATCAGA GGGATTGAGT GGTCTCTGCG TTGATATAAT TGACGGCAGT GAAATGGATG GTGCTCAGGC TATCCAGGCG TCTTGCAACG GAGGTGCCAG TCAGGAATAT CAATTCTTGA TTGTTGGGGA GGGTCAGTTT CAAGTTGTGG CCTCTCATTC ACAAAAGTGT TTGGGTGTCG CCGACTTCGA TGTGACTGAG AGCGCCAATA TCATTCAACT ACCGTGTACA AACAACACCC TTTGGTACGT TGTTGGCGAT AGCGCATACC TGCAACTTCG CGTTCTCCAC AGTGCCAAAT GCTTGAGTAT TTTCGGAGCT TCGATTGCTT CGGGTGCGAA GCTTGTTCAG AAAGCCTGCA ATGAAGGCCT CGATCAGCGC TGGAGTGTCG CTGACGGCTT GTTTGTAGTG GAGACGCTGA AGCCTTCAAC GTCAGCAGCT CCAAGCCCGA GTCCCAGTGT GAGCTTTGAG CCAACGAGAG CGGGAGCGGG AAAGGGCCCC CCTGCTCCGA CAATTCAACC AATTGACGTT CCAAGTAGTA TGCCTGCCTT ACGCGGATCC AGGTCACCCT CGCGGCGACC TTCGCAAGCC GTAGGTGTTT CACCGGGGCC ACCGCTGTTC TATCTCAATA CTGGAACTGA TCAAGACCTT GAATATATCT CTGGTGGTGC CGTTGGTGTC TATTCCAAAC CTGGGGCTCA GATTTTCGAA GCTGGAAGTT ACGGAGAGGA GACATTCAGG AGTCACCGAT GGGGAGAAAC GTTCGTGATT ACTATCCCTG GACTCAGTGG CGGCGAGATG TATACCATAT CACTCGGCTT TGCTGAGATC TTCTTTTGTG ACGAAGGTCA GCGTATTATG ACCATTACTG TAAACGACGA GGTTTTGGAG GCCGACTTGG ATGTGGTTGC TACCACTGGT GCGTGTAATA CTGCTCTCGT GCTGAGGGAA GATTTTGCAG CAAATAGTGA TGGTGCATTT GTCGTTGCTT TTTCCAGCCT TGTCGACAAC GCAATGGTTT CGTTCGTCGA GATTCATTTG GCGGGGACAT TGTCACCATC CTTTTCTCCA GCTCCTGGCT TTGTACCGAG TTCAAGTGCA GAACCTACCG TAGCTCCGGA GCCAAATCCG TGGGAAGGTG AGTACGCCAT GTCTCTGGTA GCTGTAGCTG CTGCAAATCT GGACGACGGA AGAATTCTGG CTTGGTCCGC GTGGAGTAGG ACCGACTATG CCCGTAGTGT CGGAAAAACT TTCATTTCCA TTTTCGATCC TTCCACCAAT GAGAGTACTG AGGGAGAAAT CACAAACACG AATCACGATA TGTTTTGCCC CGGGACTGCT ACGTTAGGCG ACGGTCGAAT TATGATTACC GGAGGGTCTA GTGCAGCGTC CGTTACATTT TTCGACCCCT CCGCTAATAA CTGGTACAGA GGACCACCTA TGAATATTCC TCGGGGCTAC CATTCGATGA CTGTGCTAGG GGATGGATCG GTCTTTACTC TCGGAGGGTC ATGGAGTGGT GGACGAAGGG GCAACAGGGG AGGCGAGGTA TGGAGTCCAA GCGGTGGGTG GGTTTTGAAG ACTAACATTC TCATTCCCGG AAGCTCCACC TTGCTAACAA ACGACCGTGG TGGTATCTTT CGGTCGGACA ACCACATGTG GCTCTTCACT GCTCCCAACG GTAAAGTATT TCATGCCGGA CCGTCGCAAA GAATGCACTG GATCGATATT GCTGGCGAAG GAGAAATATC CGACTCTCTT CTCCGTGGCA ACGACAACGA CGCTATGAAC GGCAATGCGG TCATGTTTGA TATTGGCAAA ATTTTCACCG TCGGAGGCGC TCCCAATTAT GAGTATGGTG ATAACGAAGG AACAAAGCTG GCCCACGTTA TTGATATCAA CGCTGGAGAA GGGTCTGAGA CTGTTGAGAG GGTCGGTGAC ATGGCCTTCG CAAGGACACT GGCTAATAGC GTGGGTCTTC CCTCGGGAGA GGTGATTGTT GTTGGCGGTC AAACGCGAGT ATTTCTCTTC ACAGACAGAG AAGCTGTCTT TGCTGCCGAG ATTTGGAGTC CTATCACAGG CCAGTTTACG ACTTTGGCGG AAATGAAGAT ACCGCGAACC TACCACAGTG TAGCAATATT GATGAAAGAT GGTCGCGTAT GGGCAGCAGG TGGCGGCCTT TGCGGAAATT GTCCTACAAA TCATCAAGAC GCTGAGATTC TTACACCACC CTACTTGCTC AATGGAGATG GTTCCCTTAA GACCCGGCCT GTCATAGAGT CTTCACCGTC TCGGATAGTC CCCGGTGAGA CGATTACTGT ATCGGTGGAC AGGAGCGGTA GCCACAACTT TGTGCTTATG CGGATTTCCG CTGTTACTCA CTCTGTGAAC AACGACCAGC GGCGCATACC ACTCACGATT GTAGGTGGCG ACAATAATTC CTTTCAATTG ATTGCTCCGG ACAACTACAA TGTGACTGTA CCTGGAACAT ACTTCTTGTT CGCCATGAAT GCAGATGGTG TTCCAAGTGT TGGAAAGACG ATTGTAGTTG ACGCACCAGA CGGTCCGGTG CCAGAGCCGT CGCTCATCTT TCCTATCGAG TCTGCGGACT TTAGTGGTCT GTGCGTAAAT ATTGCCAGCA ACAGCTTTGA GAATGGGGCC CAAGCTACTC AATGGACGTG CAATGAGAAC GCTAACCAGC AATTTGAGTT CCAATTTGTG GAAGGAGGGC TCTACCGTAT TGTCGCCTTC CATTCCCAGA AATGCTTGAC TGTTACTCAA GGATCGATAA GTGAGGGAGA AAATATTGTA CAAGAGCCTT GTGACGACGT TCCTCATCAA CTATGGACCG TCACGGGATC TGGGAGTGGC CAGGAACTCA AGGCTTCACA TTCCAACAAA TGCTTGAGCA TTTTTGAATC CTCCATGGCG ACTGGCGCGA GGTTGGTTCA ATTGGAATGC ACTGAGGGAC CCGCTCAACT CTGGCAAATA GATGACCGAT TGACCTTATC GGAGGTAGAG GCCCCATCCC CCTTTAGGCC ACCGAGCTTG GCACCAAGTG TGAGCCCCGC GCCAGTTGCA GCGTTGGCAA GTCTACCACC GACAACAGCC GCAGGGCCGC CAGTCTTCTC CCTTCGAACA GGATCATCGC TAGACCTCCC TTACATATCA GCTGGGGGGC CGGCAAGCAA GACTTACCAG GATCCTACTC CGTCAGCAAT CTCCGGAGCC GGAGCGTATG GGGACGCAAC ATTCCAGCGC CATCGATGGG GGAACACCTT TACTTTTACC ATCCCTGGCT TCGTGGCCGG CAACACATAC GCTGTCACCC TTGGTTTCGC GGAGGTCTAC TTTTGTGCTG CGAGTAGCTC CCGTACCATG ACTATAACCG TAAACGGCGA AAGCTTCGCC ACGAATTTGG ACGTCTTTGA AGCTGCCGGC GGTTGCAATT CGGCTCTTTT ACTGACGCAA GAATTCAGCG CCAGTAGTGC AGGTGATTTT GTTTTAGCCT TTGCCAGTCC GATTCAGAAT GCCATGATAT CACTGATCGA AGTAAGATCA GACGCAGGAT CGCCTCTTCC TGATGAGTCT GTGCAGGCAC CGCAGGGTCC GACTGTCTCC CCAGGGACGA CCCAATCACT GCAACCAAGC TCGGTTCCGA TGCTGAGCGC CGAGCCAACG GATATATCGG TGCCACCTTC AGTCGTGCCG TCTGTACAGC CAGTCACGAT GCCCAGCATA ATTCCGTCCT CAATAGTAGG GCAGCCCAGC GGTTCACCGC TTGATAAGCA GAGTCGGACA CCGTCTGTCA TACCATCCTC GTCTCCGAGT TTCAGTCGCG GGTCAAGCTT GACTCCAAGC ACAGGTCCTA GCTTGACTCC TAGTTTATTC CCATTTTCCC TGCCATCGAC CACCCCAAGC CGATCACCTG CTTTCGAGCG GAGTCCTACC CCATCAACAA CACCAAGTAA CTCTCCGTCG TTGCAGTTGT CAGAAGAGCC GAGCGCCTCT CCCCTTGTCG CGCAAAGTCG TAGTCCGTCT ACCGTGCCGT CTTTGTCTCC AAGCTCCAGA CCTGAGTCGT CCATAAGGCC AAGCGAAGGA CCTAGCTCAA CACCTAGTTA TGGGCCTGTC TCCCCGCCAT CAACCACACC AAGCCGATCG CCTGCTCGAG AGCCGAGTCG CGTTCCGTCG ACCACACCGA GTTCATCTCC AAGCGCTCTG CCAAGTACGA CCCAAATTAC TACTGGTGTC CCAACTACAG CTACTCCGGG AGAAGCATGG ACGATACAAT CCAGTGCCGC AAATAATAAC TGGACTGCGG TGATATATGG CAACGGGACA TTTGTCGCAG TTGCGGCAAC CGGCATCGGC GACCGGGTCA TGACAAGTCC CTATGGTACC ACGTGGACGA TACGAGCGAG TGCAGCAGAT AACGACTGGA ACGGTCTGAC GTACGGTGAT GGGATATTCG TCGCTGTTGC CAGCACGGGT CTTGGCAACC GGGTCATGAC CAGTCCAGAC GGCATCGCGT GGGCTAGTCG ACCGAGTGCT GCGGACAATA ACTGGACTGC AGTGGCGTAC GGCAATGGAA TATTCGTCGC GGTTGCGGCA TCTGGTATTG GCAACCGTAT CATGACGAGT CGTGACGGCA CAACTTGGAC ACTCAGAGGG AACGCCGTAG ACAATGAGTG GCGAAGTGTA ACGTATGCCG AGGGGACGTT CGTCGCTGTA GCGAGTACCG GCATTGGGAA CCGAGTCATG ACGAGTCCCG ATGGCATCCA GTGGACGATT CAAACCAGCG CTGCAGACAA TTGGTGGTCT GCCGTCACGT ACGGCGACGG GACGTTTGTT GCTGTTGCGG CAACCGGCAC TGGCGACCGG GTCATGACAA GTCCCGACGG GATCACATGG ACGACTCAAA CAAGCGCCCC AGACATTGAT TGGCGCAGCG TGACTTATGG GGATGGGATA TTCGTTGCCG TTGCGAGTAC CAGTATTGGC AACCGGGTCA TGACAAGTCC TGACGGGATC ACGTGGACGA CTCAAGGCAG TGCCAACGAC AATGATTGGC ACTCCGTGAC TTATGGCAAT ACAACATTCG TTGCCGTGTC AAAAACGGGA ATCGGAAACC GGGTCATGTC CAGTGGTTAA
|
Protein sequence | MGMFASTTRR HHASLTSVLC LLVLTVPLFA QTTPQRQIVS ELSLCVEVAG ASQDDGASIF QGDCNDGNKH QVFDFIPAPG TDSGFHRIRA SHSNKCLGVA DGALAPGAEV VQLSCVDNNP STLWKQRDGA LILQHSGYCL SVDVSSRSTS GAFMVQWFCD SPSVKWQVKS VSEQHDGLQQ LSKRGNSVGK SQSGFYLSTG TPHDNQFLTK GIQARMLADV PTASPKLSTE PRIARAIDVS TIVSEGLSGL CVDIIDGSEM DGAQAIQASC NGGASQEYQF LIVGEGQFQV VASHSQKCLG VADFDVTESA NIIQLPCTNN TLWYVVGDSA YLQLRVLHSA KCLSIFGASI ASGAKLVQKA CNEGLDQRWS VADGLFVVET LKPSTSAAPS PSPSVSFEPT RAGAGKGPPA PTIQPIDVPS SMPALRGSRS PSRRPSQAVG VSPGPPLFYL NTGTDQDLEY ISGGAVAPGF VPSSSAEPTV APEPNPWEGE YAMSLVAVAA ANLDDGRILA WSAWSRTDYA RSVGKTFISI FDPSTNESTE GEITNTNHDM FCPGTATLGD GRIMITGGSS AASVTFFDPS ANNWYRGPPM NIPRGYHSMT VLGDGSVFTL GGSWSGGRRG NRGGEVWSPS GGWVLKTNIL IPGSSTLLTN DRGGIFRSDN HMWLFTAPNG KVFHAGPSQR MHWIDIAGEG EISDSLLRGN DNDAMNGNAV MFDIGKIFTV GGAPNYEYGD NEGTKLAHVI DINAGEGSET VERVGDMAFA RTLANSVGLP SGEVIVVGGQ TRVFLFTDRE AVFAAEIWSP ITGQFTTLAE MKIPRTYHSV AILMKDGRVW AAGGGLCGNC PTNHQDAEIL TPPYLLNGDG SLKTRPVIES SPSRIVPGET ITVSVDRSGS HNFVLMRISA VTHSVNNDQR RIPLTIVGGD NNSFQLIAPD NYNVTVPGTY FLFAMNADGV PSVGKTIVVD APDGPVPEPS LIFPIESADF SGLCVNIASN SFENGAQATQ WTCNENANQQ FEFQFVEGGL YRIVAFHSQK CLTVTQGSIS EGENIVQEPC DDVPHQLWTV TGSGSGQELK ASHSNKCLSI FESSMATGAR LVQLECTEGP AQLWQIDDRL TLSEVEAPSP FRPPSLAPSV SPAPVAALAS PSLTPSLFPF SLPSTTPSRS PAFERSPTPS TTPSNSPSLQ LSEEPSASPL VAQSRSPSTV PSLSPSSRPE SSIRPSEGPS STPSYGPVSP PSTTPSRSPA REPSRVPSTT PSSSPSALPS TTQITTGVPT TATPGEAWTI QSSAANNNWT AVIYGNGTFV AVAATGIGDR VMTSPYGTTW TIRASAADND WNGLTYGDGI FVAVASTGLG NRVMTSPDGI AWASRPSAAD NNWTAVAYGN GIFVAVAASG IGNRIMTSRD GTTWTLRGNA VDNEWRSVTY AEGTFVAVAS TGIGNRVMTS PDGIQWTIQT SAADNWWSAV TYGDGTFVAV AATGTGDRVM TSPDGITWTT QTSAPDIDWR SVTYGDGIFV AVASTSIGNR VMTSPDGITW TTQGSANDND WHSVTYGNTT FVAVSKTGIG NRVMSSG
|
| |