Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_56602 |
Symbol | |
ID | 7197549 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 645258 |
End bp | 649947 |
Gene Length | 4690 bp |
Protein Length | 1479 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177975 |
Protein GI | 219112447 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.678567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCGGGCAAA CGATGCCACC ATGCAGATCG ACCGTATTAT AATCGCCTTC GGGATCGTTG GAGTGACTTT GTGTGGCAAT GTTGTTGACG GGTTGGTTCC CGTGCAAACG AGGAGCACCA GGACTCTCTT GGGGTCCGTG AGAAGCGCAC GGAATCCAGC AGGATCCAGA CCTTGTCACG CGGCACCAAC GTTCACTAGA CTCGCCGCCG TTGCCGTACA TGACGAGGAC CATGCCGTGT CACCACAGCC AAAGAAGCAC AATACTGCCC GTAAGCCGAA AATTGTGTTG GTGGCCGGAT TCGAATCGTT CAATCGCGAA TTGTATTCTC AAGCCGCCCG CGACCTCGAC GTCGACTTGA CGGTCTTCGC CGACTCGGAC ATTCGGGTCC CCCCAACCAC GACAACGACG ACGGTACGCG ATGCCCACGA CTTGGCCGTC AATCCAGACT TTGCCTCCGC GGTCCGTCAG GCCGACGCCT TTATAGGTTC GCTCATTTTC GATTACGACG ACGTGCTGGC GGTGCAATCG GTCCTCCCCG CGGTCCAAGG ACCCCGCCTC GTTTTTGAAA GTGCCACGGA ACTCATGACG TTCAATACCG TCGGAAGCTT TTCCATGGCA CCGTCAGCGG AAAAAGGTGG TGGCTCTGCC GGGCCACCAC CCGCGGTCAA AGCCATCTTG TCCCAATTTG GCTCGGGCAA GGAGGAAGAC AAGTTGAGTG CGTATTTGAA GCTCCTTAAA GTCGGTCCAA CCTTGCTGCG CTACGTGCCC GGTGAAAAGG CCAGTGATCT ACGCACATGG TTGGAAGCGT ACCGCTACTG GAATCAAGGG GGTAAAAGCA ATGTGCAGGC CATGTTGGCG TTGATTGCGG ATCGGTGTCG GGATCAACCG TCCGCCGTGT TGACCGCTTT ACCGCCTCTG CAAGTCACTC CGGATATTGG TCTGATCCAT CCAGTGCGTA GCCGACAACA ACAACAAGAC GAAACGAAAG CTGCTATCTT TACCCAGCCA CAATACTTTG AATCTCCAGC GGAATACCTG GCGTGGCGAT TGAGTGCGAA TACCCAAACG CTAGCGGCGC AACAGGGTTT CGTTTTGGCC CCGGATGACG CTCCTCGGGT AGCCGTACTC CTGTACCGCA AACACGTTAT TACGGAACAG CGTTATTTGG GAGATCTCAT TCGACAAATG GAACAAGATG GATTACTGCC CATACCCATT TTTATCAACG GCGTCGAAGC GCATACCATT GTGCGTGATT TGTTGACGAG CAATCACGAG CAAAAACTGG TGCGTGACAG GAAACTCCGG CGTGATTCGA CCTATCAGCC GTCCCAAGCC GTTTCGGTGG ACGCCATTGT GAACACGATT GGGTTTCCCC TGGTAGGGGG ACCCGCGGGA TCCATGCAAG CCGGACGGAA CGTGGCCGTG GCCGAGACCT TGCTAACAAA CATGAACGTA CCTTACGTGG TAGCTAGTCC ACTATTGCTC CAATCTATCG CACAATGGAA GACCAATGGA GTCTTAGGAT TGCAGTCAGT GGTTCTGTAC AGCTTGCCCG AACTGGACGG CGCTATTGAC ACTGTCGTTC TTGGCGGATT GGTGGGCGAT AAGATTGCGC TGGTTCCGGA ACGCGTGCGC AAATTGAACT CTAGGGTGAA GAATTGGGTG GAACTCCGGC GGACGCCACC ATCTGAGAGA CGTATTGCCA TTGCCTTGTA CGGCTTTCCA CCCAACGTGG GAGCAGTCGG TACCGCGGCC CTGCTGGACG TGCCACGGTC TCTAGACGCC CTCCTGAGAC GGCTTGAAAA GGAAGGATAC CGTGTGGGTG ATTGGACATC CGATCCGAAT GCTTGCGGAG AAAGTTTAGT GGCGGCGTTG GCCACCCTGT GTGAAAACCC GGTCATTACA GCCGGCGCGG ACCGTATGCA AGAAGCGATC GAGAGCAAAA TTGCCCGTGC CACGGCCGGA GACTCGACGG TAGCCGCTAC TCTAGCGCTG CCGGGAGGTG GACTCGGAGG AGCCCAGGTC GTGGCGAAAG ATATATCGAT CGATGAACTC GAAGAAATGC TGGGAAGCTA CATGATGAAA AAAGTGCGGC GAGCCTGGTC TGAGAAAGAT CGCGGTCCCG GTGTTACCAA GAATGGCAAG TTCGCTGTAG CAGGGCTCCA ACTTGGAAAC GTGTGGATTT TCGTCCAGCC TTTGTTAGGC GTCGAGGGTG ATCCTATGCG TTTGCTGTTT GAAAGAGATC TGACTCCACA CCCACAGTAT TGTGCGACGT ATGAATGGCT ACGGCGTCCC AAGGCTCTGG ATGGGCTAGG GACACAAGCA TTGATTCACT TTGGAATGCA CGGCACTGTG GAATGGCTTC CCGGGCAGCC GCTAGGCAAT GATCGCAAAT CATGGAGTGA TGAATTGCTT GGTGGCCTCC CAAATCTTTA TGTGTATGCT GCAAACAACC CGAGCGAGAG TATCCTCGCC AAGCGTCGGG GGTACGGAAC TCTAGTGAGC TACAATGTCC CTCCGTATGG TCGAAGCGGA CTGTATCTTG AGCTGGCTAA TTTGAAAGAC TTGGTCGAGG AATATCGATC GGACGAAGGA TCCAATGGAT CGAATCGAGA ACTTTTTGAA ACTATTTACG ACTTGGCCCA AAGAAGTGGT ATGGTAAACG ACGTTCCGTT GATGTGTGAT TCCAAAAATC CTTTTGATAT TGATTGTGAG CAGCAGTCAA ACGTCCTCGA GTCGTTGCCA GCCGATGTGT GTTCCGATTG GGTCGTGCGA TTGTCTGATT ATCTGAATGT TCTGCAAGAT CGCCTGTTTT CCAGTGGACT ACACGTTTTC GGGCAAAGCC CGTCCGACGA AGACCTGGTG TCTTATCTAG CGGCTTACTT CGGAGAGCAA CTATCGGAGA GCGATTGTCG TGATCTAGTG ACCAAGTGGC GCGAAACATC AAAGCAGAGC GAGAAAAGCC AAAATGTTTT CACCTCTTTC TTCCAATTTC TCGAATTTGT TGCATTTGGG GAAACCAGTT CAGATCCATC TTTCTCTGAA CAAAGCATCT CCGAGCGGGC AATGACGATT GCAGGACTTC TAGATCGTTC TTCGGACGAA CTCCAATCTG TGGTAAAGGG TTTGGACGGA GGATATATTC CTCCTGCGCC GGGAGGGGAT TTGTTGCGTG ATGGACCGAG CGTGCTGCCT ACCGGACGCA ACATTCATGC TCTTGATCCG TATCGAATGC CGTCCGCTGG GGCTTGGGCC CGCGGTCAAA AAGCAGCGCA AGAAGTCATT CGCCAGCACC AAGAGAATAA TGAAGGCCGA TTTCCGGAAA CGATTGCTGT TACGCTCTGG GGTTTGGATG CCATCAAAAC ACGTGGTGAA TCGGTGGCGA TCGTTTTAGC GTTGCTTGGG GCGAAGCCAG TAAAGGAAGG AACTGGTCGT ATCGTTCGGT TTGATTTGAT CCCTTTAGAA GAGCTTGGCC GGCCAAGGAT TGATGTTTTA GCCTCCCTTA GTGGCATATT CAGGGATTCT TTTGCCAACA TCGTAGACCT ACTTGACGAT ACTTTTCAAA GAGCAGCTCA GGCCGACGAA TCAGACGAAA AGAACTTTGT TAAGAAGCAT ACCAAAGAAC TGATAGCTAG TGGTGTAAAA GATGGAGCTG CTGCAAGATT GTTCAGCAAT CCACCGGGCG ACTACGGAAG TATGGTGAAC GAGGTTATAG GAACCGGAGA TTGGGAAGAT GAGGAGAGTT TGGGAGAGAC ATGGAAAGGA CGGAATGTTT ACTCTTACGG TCGAAAAGAA GGACAGAGCG GAACTGGGGG GACTGCTCGT CCTGCTGTTC TTAATAAGTT GCTGGCAACG ACCGAGCGGG TCGTTCAAGA GATCGATAGT GTCGAGTATG GTCTTTCCGA CATCCAAGGT ATGTGAAACA AAGACTGGAA ACCTTTTGCT CAGACTGTTC TCACCAATTC TGTTTTTCTC TTTGTCAATT AGAGTATTAC GCCAATACAG GAGGTACGAA TTGTCGCCAA ATATAAAGTC CACGTTGTTG ACGCAATTGA GCTTATGACT AACATATGCT TTTGCCTGTA GCGCTGAAAA AAGCGGCGGA AAATAGAAAA GAGATTGATC CGAAGACGGG GAAGAAACGG AAAGTATCTA TATCGGTAAT AGAGGCATTT GGTGGATTTG ACGAGAACGC TCTGGTTCCG GTCAAAGATG TCGAGGAAGT GCTTAGGTTA GAATACCGAA GCAAATTCCT TAATCCAAAA TGGCGAGATG CAATGCTAAA GCAAGGGTCT GGTGGAGCAT ATGAAATCTC TCAACGGATG ACAGCAATGA TTGGTTGGTC AGCGACGGCA GAGATTGATA ACTTTGTCTT TGACCAAGCT GCTGAACGTT ATGCACTTGA TGCTGATGTC GCCAAACAAC TTCAACGCGC CAACCCGGAA GCGTTCAAGA ATATAGTGAG GCGGCTGCTT GAAGCTTCGG GTAGAGGTAT GTGGTCAACG GACACTAGTA CGTTAGATAA GCTGAAGGAT CTTTACGCCG ACGCGGACGA CATAATCGAG CAAGGGTCCG ACTTTCGGGC AAAGATCAAG CAAGAAGCAT AATAATACGT CGGAACGCCA ATGCAGTCTA GCCCAAAGCC CCTCCAAATG TTGATAAGAA TACATAGTTT CTATTTGGTT
|
Protein sequence | MQIDRIIIAF GIVGVTLCGN VVDGLVPVQT RSTRTLLGSV RSARNPAGSR PCHAAPTFTR LAAVAVHDED HAVSPQPKKH NTARKPKIVL VAGFESFNRE LYSQAARDLD VDLTVFADSD IRVPPTTTTT TVRDAHDLAV NPDFASAVRQ ADAFIGSLIF DYDDVLAVQS VLPAVQGPRL VFESATELMT FNTVGSFSMA PSAEKGGGSA GPPPAVKAIL SQFGSGKEED KLSAYLKLLK VGPTLLRYVP GEKASDLRTW LEAYRYWNQG GKSNVQAMLA LIADRCRDQP SAVLTALPPL QVTPDIGLIH PVRSRQQQQD ETKAAIFTQP QYFESPAEYL AWRLSANTQT LAAQQGFVLA PDDAPRVAVL LYRKHVITEQ RYLGDLIRQM EQDGLLPIPI FINGVEAHTI VRDLLTSNHE QKLVRDRKLR RDSTYQPSQA VSVDAIVNTI GFPLVGGPAG SMQAGRNVAV AETLLTNMNV PYVVASPLLL QSIAQWKTNG VLGLQSVVLY SLPELDGAID TVVLGGLVGD KIALVPERVR KLNSRVKNWV ELRRTPPSER RIAIALYGFP PNVGAVGTAA LLDVPRSLDA LLRRLEKEGY RVGDWTSDPN ACGESLVAAL ATLCENPVIT AGADRMQEAI ESKIARATAG DSTVAATLAL PGGGLGGAQV VAKDISIDEL EEMLGSYMMK KVRRAWSEKD RGPGVTKNGK FAVAGLQLGN VWIFVQPLLG VEGDPMRLLF ERDLTPHPQY CATYEWLRRP KALDGLGTQA LIHFGMHGTV EWLPGQPLGN DRKSWSDELL GGLPNLYVYA ANNPSESILA KRRGYGTLVS YNVPPYGRSG LYLELANLKD LVEEYRSDEG SNGSNRELFE TIYDLAQRSG MVNDVPLMCD SKNPFDIDCE QQSNVLESLP ADVCSDWVVR LSDYLNVLQD RLFSSGLHVF GQSPSDEDLV SYLAAYFGEQ LSESDCRDLV TKWRETSKQS EKSQNVFTSF FQFLEFVAFG ETSSDPSFSE QSISERAMTI AGLLDRSSDE LQSVVKGLDG GYIPPAPGGD LLRDGPSVLP TGRNIHALDP YRMPSAGAWA RGQKAAQEVI RQHQENNEGR FPETIAVTLW GLDAIKTRGE SVAIVLALLG AKPVKEGTGR IVRFDLIPLE ELGRPRIDVL ASLSGIFRDS FANIVDLLDD TFQRAAQADE SDEKNFVKKH TKELIASGVK DGAAARLFSN PPGDYGSMVN EVIGTGDWED EESLGETWKG RNVYSYGRKE GQSGTGGTAR PAVLNKLLAT TERVVQEIDS VEYGLSDIQE YYANTGALKK AAENRKEIDP KTGKKRKVSI SVIEAFGGFD ENALVPVKDV EEVLRLEYRS KFLNPKWRDA MLKQGSGGAY EISQRMTAMI GWSATAEIDN FVFDQAAERY ALDADVAKQL QRANPEAFKN IVRRLLEASG RGMWSTDTST LDKLKDLYAD ADDIIEQGSD FRAKIKQEA
|
| |