Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14760 |
Symbol | |
ID | 7203411 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 48900 |
End bp | 55174 |
Gene Length | 6275 bp |
Protein Length | 415 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182602 |
Protein GI | 219124630 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCCTTCCCGT TGGGTCTATG CGAAGGCGAC TGCGACCGAG ACAGTGACTG CCAAGGCAAT CTTCGATGCT TTCAACGTGG TGCTAGTCGG GAGTCCGTGC CTGGTTGTAC CGGAGCGGAC GCCGATGCAA CTCGTTTCGA CTATTGCTAC GACCCCCTCA CTCCTACACC CGCCTCTGCA CCTCCGAGTG GGATTACGAC CATGCCGACC GCAACCATAA CCGCTGCACC GACCGGCGTG ACAACATCAA TGACCACTTC CATGCCCAGT TCCAGCAACA CCACAGCCGC GCCGGTAGCG ACAGCACCTC CAATTACGAT CGACGCATTT CGACTGAAGC TGTACTGGGA AGATGGATAC TATTGGCAAG AAGAAACCTT CGAGCGCCCC TGGTGTGTGC GATGCACCTC CCTTGGTTGT CAGGTTGATG GTGCTATGAT CATTGTCAAT TGTGACAATG ATGGCAACCA TGGAGAACCC GATTTGTTCC GGTTTCTGCA GCTACCCAGT ACCGACGGCA CGGCGACCTT CCAAATTCAA GAAATCACCA GTGAGCTGTG TTTTCAACGA ACACAGGATC GGCACATTTC ACTGCAACCT TGCCAAGCCG AGCTCCTTCG GCAGCAATTT TTTGCGAGTG GCGGCAACGT CGCGATCGGT GATCGGTTCC AAATGCAGCC TCTTGGCTAT CCGGGGTTTT GTGTTACCCA GGATCACCAT CCCAAGTACG GAGAGTTCTT AGAATTGCAG TCGTGTCCAT ATGCGGAATA TTCCGAGACA TCCTACTGGG TCCTACTACC TTAAAACCAG TAGCAGTTGG ATCACCACTG ATTCCAGGAT GGAGAGTAGA ACATCCTGGT TTGTTGTAAA AGTCGGAACA GATTACTGGA CTGTGGAAGC ACACGAAGAG GGTATAGCAC TGAGACCCAG TGGCATGAAA CCCTACTATT GTATTAGAGG AGTGCTGCAG TGGAAAACTA ACTACGATAA GAATTCCACA ACATTTCCTG CTTGCTATGC TCCCGGAAGG ATAGGGAGGT CTAGACTTTT CCATGGGTGC CCGCTGATTG CAACAGGGTG CTTCCTGACC ATATTTGGTG GACATCGTCG CACTCTCAGA CATTTTCAAA ACATGCATGG TCTTAAGAAA TAAAAAGCGC CTAACTGTAA GACCTTCTCA AGGGCAATAT CGGGGATTGG CATGCAATCT AAGATTGCAT GCCAATATGA CGTGACCAAG ATTGTGTGTG CCTTGTCACG CTTTGCTGTT TCCATATATT AAACATATAT ACTTAGTTGG CATCTAGCTA CAGCCAGGGT CATTCACGCC TGAGTCGCGG AGTACGCCCC TACCTAAGTA GCCGTCTCCT ACCGAAGTTT TGGGGCTGAA AGGGACACTA TTTGCATAAA CAGCGAATTA GAGGCTATCT CGCGCACATT TTGGAGTAGC TCGGTCCTGC AATAAATGAA GGCCCAGAAA CAAACCATAG GTGAACGGGG GAGGTCAACC GAGAGGCTCG AAATGTAAGC CCTAGGGACG CAGATAGCAG CGGTACCCAA AGGGAATTAA TCCATTCCCT CTGTGTAGCA CTGAACAATT CCTCAACTCT ACACATCGCA GCATAGGAAC CGAAGTGTTC ACTTGGATAC AAAACATGAA TCGTTGCCTA CCCGTCGTGG TACTGTTATC TCTCATCGTC TTGGTGCAGG GCATAGTTGT TGTGGACGAC AACAATGACC ATCGGGACAT GAATAGAGTC GAGTCGTCCG CTGAAATGAA CGACGTTCGT CGTTTACAAC CAAACATCCG TTTGGCCGGC AACAATGGAT TCCCCCTCAC GGCATTTCCT CTAGGTGTCT GCGAGGGCGA CTGTGATCAA GACTCGGACT GCGCGGGCGA TCTACGGTGT TTTCAGCGCT CGAACGGTCA TCTGGTGGTG CCGGGATGTG CAGGGGGGGC TGCTGACGTG AGTCATTTCG ATTACTGCTA CGATCCGTCC TTTGTCTTAC CACGTATCGC GAACGAGACT TTTCTCGTCA AATTATATTG GAAACACGGG TATCACTGGC AAGAGGAAAG GTTCGAACGG AGATGGTGTT GGCGGTGCTC TGCCGATTGT GAGGCTGGTG ACCATGTCAA GGTGGTCAAC TGCGACAACG ACGGGGGTGT AGGTTTTCCT AATCGGTTCC GGTTTCTGCC TCAGATAGGC GGAGATGCAC AAATACAAGA AGTCAGTAGC GGCTTTTGTC TGCAGCGCAC TCACGTGCGG CAACTTACTT TGCAACGCTG CCAACCCAAG TTGGTCCGTC AACGCTTTAT GCCCATGAAG GGTGCATTCG ACTTTGACGA TCGCTTCGAA ATCAATGCGG TCGAACGCCC CGGGTATTGC GTGACTCAAG GACATCATCC GAAATCTGGA GAGCTGCTCA AGCTACAACA GTGCAAGTTC CCTGAGGCAG CCGACACATC CTACTGGAGT CTGGAGGCAC CTGGTAAGGA CGGAAATCTC TAAATTCACA AGCAACTTGA TTTCGGTGGC GTTCCAACTA TAAAATTAGA CTTTAGAATC CTGCCACCTT CTCCGTGCAA ATCAACACAT CGTCGTTGGG CTTTCAACCA CTGATGCTTA GCAAATGCCA TACAGCATCT TTCCGATGTG TTTGTTGCGC GCAAGCTTTG TGAATTTTTG TGACCGAGAC ATTTCGCTTT TCGAACGCTG GCAATATAGT GCTGCTGATT TAGCTTTTCC CGTACGTCGG CTCCAGTTGC CATCCCTCCG CCACCCGGCC GTTCTAACAA TCGCTCCAGT AAAAAACAGC ATTCCTCCTT CTTCGCGCTT TCTGCTCGCT CTATTCCATG GAGACTTTGG ACAATACTTG AAGAGGTTGA CCGGGGTGTT TTCGTCGAAA TGGAGGAATA GCCCTTCCGG GACCTAGAGT GATCCAGAGC TTTTCTAGGG TTTCTGATAA TTGCTAGAAA CCGGATCCAA GACCGGGATC CAAGACCTCC GAAATCTGTC AAGGTAAAGC CGTTTAGCAA CATCTGCACA ATTGCATCGG ATATTCCAGT GCCAGGTATT CCTCCATATT TTGATGCAAA CTGTCTACAC AGAAAAGGAA ATGACGCTTC CCTGAGGTAC ACGCAGGGCC AATACCTCTC TTTTATGTTT CATTAGTACA CCTCGATGGT CACCTAATGT TAGTCAGTAC GTTCTTTTAT ATCCTGATCG AGCCCTCTCA TTGTGGCAAT GAAATTGTCC GAACGGAATA GGTACATAAA TGGTTCGACA CATTAGGCAG CCGATATACA ATTATGAAAC CAGAAAGAAA TTAGAAATTT AATTGAGGAC AGAAAAACTG GCTCCATAGT CAGCAAATAA TTTTAAGAAT AACATATTTG TTTGTGATTT TGTTTTGTGA TAGAAGGAGT GAAATGATGT GAAGTGCGTC CGTGAGGTTC GCATGTCTTT CGTTCAATTC ATGTGTAAAG TCCTACGGGA ATGCAAAAAA AGTGAGAGGT TCGAGGTCCG TCTTGGTCTT CACTGGTTGA GCAATGCCCT CCGTAGTATT GCCTGCATCT GAAGGAATGA GGGGCTAGAA ACCTTTCCAC TGGAGAGCTC CCCATATTTT TTCTCACAAA CATTTTCGGC ATCATGAGGC TACGTTCATC CATTACCCTC CTCTTCGCTA GCACGTATGG CCCAACGTTG GTAAGTGTTC AGTTGGCGCT TTCCACTGTC TCCTCAGCGC GTGCCGATCC TTCGCTTTCC TCTCTCGGAT CAGAGGTCTC TTCTGGATTG CCCGACGAGA ACCGAAGTGA CCGCCGTTTG CAGTCTGTCG CAGTTGTTGG CGATAATGGC TCTCCTTCCT CTCGCTTCCC GCTAGGACTC TGCCAAGGCG ATTGCGACAG AGATAGTGAC TGCGCTGGAA CCCTCCGCTG TTTTCAGCGG AACGGAGGAC AAGATGTCCC TGGTTGTGCT GGTGGCGCGA AGGACCGTAG CAAATCGGAT TATTGTTACA ATCCCAATGC TAGTGTGAGG CCCCCCCCTG CACCTGTCGC CACTCGGCCG ATTCCCGCGC CTGTGCCCTC TCCTGCTGTA CGTCCGGGAG CTCCCTCCGG CATAACATCG GAAACATTCA TTTTAAAGCT TTACTGGGAG GAGGGATACT ATTGGCAGGA GGAGTCGTTT GAACGTCGCT GGTGTATCCA ATGCCGTAAC GGATGCAAGG TCGGAAAGAG CACGAAGATC GTCAATTGTG ACAACGACGG TGGCCGAGGT TTGCCGAACC GCTTTCGTTT TCTACCTCAA ACCAGCAAAC AGGTGCAGAT TCAAGAAGTC TCTAGCGGAC TGTGCATGCA GCTCGCACCC AAACGTCAGA TCACGCTTGC TGTCTGTCAA GCGAGCGCCT CCACTCAGCG TTTCTTTGCG AGGGTCGGAG AATTAACGTT TGGAAGTAAA TTCGAACTGA GTCCGGTCAA CCGTCCCACG TTTTGTGTGA CCCAACGTCA CCACCCGAAA GACTCAGAAA TAATCGAATT GGAGCCGTGC GACACTGTGG CTGAGCGCTC CGATACGTCG TTTTGGAATC TTTTGCCGTA GAGCGACCCA AGGTGTTCTT GCGAAGGTAG CAAGCAGTCA GGGTTGATAC CTCATTTCCC CTAGTAAATT TCAAGGCCGT AATGTTTGCT CGTTGCACAA GTTCTTCCTC TTATCTTTGG GGAAGTAATG ATTTCCACTT AGGCGACCTT CGATCTCGTC TTCAGAGCAA CTTTCATTGA ACGGACTAAT TATGCGTAAA AATGTGAGAA ATCCTATATA CACAGGAAAA ATCTAAAAAA CCGGTGAGAA ATTTGGAAAT ATTTTCTTAT TCATTTTTAT AAGCGGCTCT CTCTAGCTAG CTAGAGCGAA AAAGAAAATA AAATTTTCAG GGTAGGATAT CTTGAAGCCC GCGCTGACTC TGAAATTGAA GTGGTAAGCA GATGCGATTG AATCGGTAAA GGAACCACGT GAAAGCGGGG CTGGGTGTTG AGAACTGTGC GTGTGTTTCG CTTGAAATGA ACGATGAACG AGCACGTACG TGGAGGATTA TTATGGACGG GTCTGGGTCT ACACGCCCAG TAACATTACC TACTAGTGAA TTTTCTTTTA CAGTTAATTT CTGTTCTCGT GATCACGTCT GCCACAGGGC CACAAGCTTA ACTGAAAGCT GGGCTTACAT TAATATCCTA CGTCTGTCTG AGATTGCCTC CATTTCAAAT GGCCGTTAGA AAAACCGCCT GGCGCTGTTT TTAAAAACAT TTTGTGAAAT ATGAAATGTC TTCGCGTTTT TTTGCAACGA TTCAAACTTA TGCAGGCACG GCCCTGAAGG ATTCAAGGAA CCGATGAGGC TTTTTGGTGG AGGTTCGCAT GCGTTCGCTT CAATACATGT GTATAGCTTT TTCTAAGAGA ACGACAAAAA TTGAGAGAGG TTGAAGGGCG GCCGCAATCT TCCACTGTGG ATGCCGAGGT CATCTCCTCT GCACTGTAAT ACTTTTACGC TTACAAACAT TTCATTCTTT GCTTATAAAT TCTTTCGGCA TCATGAGGCT GCGTACATCC ATTGCACTTC TCTTCGCTGG CACTTCTGGC CCAACCCTGG TAAGCATTCA GTTGGTGCTT TGTACCCTCT CGTCAGCGCT TGCCGATCAT TCGCTCTCTT CTATTGAATC CGAGACTTCA GCATCTGGAT CGCCCTACAG CAATCCTAAA GATCGCCGCC TGCAGTCTGT TGCTGTCGTT GGCAACAATG GATCTCCTTC TTCTCGCTTC CCTCTAGGCC TCTGCCAAGG CGATTGCGAC AGAGACAGTG ACTGTGCTGG AACCCTCCGC TGTTTTCAAC GGAACGGTGG ACAAGATGTG CCCGGATGCT CTGGTGGCTC AAACGAGGGA AGCAGATCTG ATTTTTGCTA CGATCCTGCA CCTGAAGCCA GCAAGCCTGT ACCAGTACCG ACTCGGCCGA ATCCCGCGCC GGTACCACCT CCTCCTGCAG GACCTGTTGT TTCCCCTGTC CCCGCTCCAG TTGCTGCCCC TTCTCCTTCT GGCCCCCGTC CAGTTGTGAC GGTTGTTGGT GATAACGGTT CACCTTCTTC CCGCTTCCCT CTAGGTCTCT GCCAAGGCGA TTGCGACAGA GACAGTGACT GTGCTGGAAC CCTCCGCTGT TTTCAACGGA ACGGTGGACA AGATGTGCCC GGATGCTCTG GTGGCTCAAA CGAGGGAAGC AGATCTGATT TTTGC
|
Protein sequence | AFPLGLCEGD CDRDSDCQGN LRCFQRGASR ESVPGCTGAD ADATRFDYCY DPLTPTPASA PPSVCEGDCD QDSDCAGDLR CFQRSNGHLV VPGCAGGAAD VSHFDYCYDP TYGPTLVSVQ LALSTVSSAR ADPSLSSLGS EVSSGLPDEN RSDRRLQSVA VVGDNGSPSS RFPLGLCQGD CDRDSDCAGT LRCFQRNGGQ DVPGCAGGAK DRSKSDYCYN PNASVRPPPA PVATRPIPAP VPSPASVAVV GNNGSPSSRF PLGLCQGDCD RDSDCAGTLR CFQRNGGQDV PGCSGGSNEG SRSDFCYDPA PEASKPVPVP TRPNPAPVPP PPAGPVVSPV PAPVAAPSPS GPRPVVTVVG DNGSPSSRFP LGLCQGDCDR DSDCAGTLRC FQRNGGQDVP GCSGGSNEGS RSDFC
|
| |