Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_107 |
Symbol | |
ID | 7196818 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1779747 |
End bp | 1785992 |
Gene Length | 6246 bp |
Protein Length | 601 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177369 |
Protein GI | 219111235 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.799153 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGACATG TGCAGAAGAT CGATTTGTCT TACAATCATC TGGTAGGATC TCTACCTGCG GTGACCTTCG AGCTTCCCTA TTTAGAAAGT CTGGTTTTGG ACGCGAATAC TGATTTGACT ATTGAATTCG ATGGGATTGC GAACGCTCAG TTCCTCCAAA CTCTTGTTTT AAGCAAAACG AACGTCAAGG CATTGGACGG CATTTCGGCA GCGATAAGCT TGGAGGTTTT GCACCTAACT AATCTTGGTT TGCGTGGGCC TCTTCCGGAG GAGCTGTTGC GATTGACAAA TTTGCGATCA CTCCTTGCGA ATTTCAACCA CTTTACGGGT ACTCTGCCTC AAGCAATTGG AGACTTGACG GGATTGCAGG AGTTGCTCCT CTACGAAAAC GATTTGACTG GCGCAATTCC ATCTACTCTT GGTAATCTTA TTCATTTGCA GACGTTGAAT TTAGCTCAGA ACGCGCTCGG CGGTGAAATT CCAGTCGAGT TGGACAGGTG TACGGAACTG GAAAGTTTTG CGCTTCATCG CGTCTTTGGA AGTGAAAAGG GACCAGGCTT GTCAGGGAGC CTGCCAGCTT TCTCGAGCCT GTCTAAAATC AAAGAATTGT ATTTGCAGAA CCATCAGCTG AGTGGGACCA TTCCCGAAAA CTTTCTGGCG TCTGCGTCGC CATTTGAAAC AATCAAAGTA GACTTGTTCA GTAACGCCCT CACTGGCGCT GTTCCTCCCT CTCTTTTGAT AAAGCAACGC ATGAATCTCT ATTTGGCTGA CAACAGCATA ACGCTGTCCT CTTCCTTCTG CAATTTTATA CCCCCGGACT GGATGAGCGG AACAGTCGCT TTGCAAGGAT GCGACGCGTT TTTGTGCCCA AACGGTCAAA GGGCATCGTT CGGTCGAGCG ACGCAGTCGA GACCATGCAC CCCTTGTCCC GGAGCACAGT ACTGGGGGAC GACGACTTGT CCGACTACGG TTGTGGGCTC CGACGAAGAA CGTGCCATCA TGACGGAGTT TTACGAAAAA ATGGGTGGGC GTTTTTGGAA GAATTCAGAC AATTGGTTAA ACCCAGCTCT CTCGCCATGC TCGTGGTTTG GCATTGAGTG TACCACGGAC GGACATATTT CGGCAATTCG ACTCCGGAAC AATGATATGA CGGATTCTGG TAGCTATCCT TCTCTTTTTC GACTCCCAGA GCTGCGGATT CTCGACCTGA GCGCAAACTC AATCAACTTT CGGTTCGAAG GAATTGAAAA CGCGACCAAA TTGGAGGAAC TTGTCTTGAC CCAATCGGAT TTGACTTCTT TGGACAACAT TGGAAGTATC GTTTCATCAA ACATTCGAAG ATTGTCATTG GCGTCGAATC AGCTTGAAGG CGCTGTACCG GAAAGTCTTT TTCAGATGAA GGTGCTGGAA GACCTCGATC TTTCGCACAA TAAGTTTAGC GGTATTCTAT CTTCGAGTAT TGGACTCCTA ACAAACTTGA AACGTCTCAA GATCTCCGGC AACTCGTTGA CCGGAGCGTT GCCGACAGAG CTTGGAAGTC TGTCGGATCT CATCGAAATT GCTGCAGCTG AAAATTCATT TTCGGGAGAG CTTCCCACCT CGCTTGGCAA TCTCTCGCTC CTGCAAATCT TGTCAATCCG TCAGACCGCG AGTGTCGGCG ACCTTACAGG TACATTGCCC TCGTTTTCCG GGCTTGAGCA GTTGACAAGT TTACAGTTGG GAGGAAACTT TTTGACGGGA AGTTTGCCTG TCGAATTTCT ACAAGCAACT AAGCGAGGCA ACGACCGCAT AGAAGTACTT CTCAGCGGCA ACGCATTCGA AGGATCGGTT CCTTTGAGCT GGGCCAGTCG CTTTGGTAAC TTGGTGTTGG ATCTAGCTGG AAATCGTATT ACGAGTCTTG ACGATGGGAT CTGCGACCAG CAAAACTGGA ACGATGGGCT CGTTGGAACT TATGGCTGTA ACGCAATTCT TTGCCCTATT GGGTCGTTCA ATGGTTTCGG GCGGCAGACT TTTTCGGAGT CTATATGTTC TCCTTGTGCT CAGGCTCAAG TAATGGGGGT GACGAAATGT GGAGGCAACG ACTCGTTGGG AGATGAAGCA GAAACTTTGA TCATTCTTCA AGACCTTTAC TTTTCAACTC ATGGAGACAG CTGGTTGAAT AATAGTGGAT GGACTTCTAC GACCGATTTC TGTTCTTGGT TTGGAGTGAC ATGTAATGCA GTCGGTGAGG TTGTTAAGAT CAATTTGGAG AACAATGGCC TCACAGGAAC TCCTTCTCGT TCAATTTATA ATATCACTTC GCTGGAAGCA CTCAATTTTC AGCAAAATGC TGTATTGTTT TCTTTCAACG GTATCGCCAA AGCAACAAAT CTCCGGAGTC TTGACTTGTC AAGCACAAAC CTTGATTCTG TTTCTGGAGT TGGCCAGTCT TCTAGTTTGA GTGAGCTTCG TCTGACGGAC AATGACCTGA CCGGGCCCTT TCCGCCAGAG ATTCTGCAAT TATCAAACTT ACGGCAACTC TTTTTAAATT TTAACGCGAT TGAAGGGCCT TTGCCGATTG AGATTTCGTT GATGAATAAC CTTGAGGACC TCTTTCTACT CAATAATAGG TTTTCTGGTC AGCTCCCCGC CAGCATTGGT TCTCTTTCTA GTCTGAAGAG ATTGGCTTTA TCAGATAACA ATTTTGAGGG TTCCATACCG CCCGAACTGA ACAACCTCTT ATCCCTCGAG CTCTTTGCCA TTCAAAGAGA AAATGGAAAA GGAAATAATG ACATTGATGC GATTATCGGA ACTAGCGCAG ACGCCGGGAG AGGCTTGACA GGGCCTTTGA TATCTTTTGA TCGGCTGTCA AACTTAAAGC AATTATATCT CAGCAGAAAT AGCCTGACCG GCTCCATTCC TCAAAATTTT CTCGATAGCA ATCAAGAAAA ATCAAGTATT GAAGTAGATC TCGCGATGAA CAGGTGAGAC TTACAACTCG TTGATCAATA TAATACTTTA TGAACTGATC TCACACAGCA TTATGTCTCA CCAGACTGAC GGGCGCAATT CCTGCTTCGC TTGCCCGATT TGAAAACATG TCGTTATACC TCTCAAGCAA CATGATAAAT GATATTCCTG ATGGTGTCTG TCGGCAGAAT AACTGGATGA AAGGCACGGT TGACTTGTAT GCATGTGATG CGATTCTATG TCCACCCAAC ACGTTTAGCC TGTTTGGTAG GCAGGAAGAT GCCTCGTCCG TATGTGAGAC GTGCCCAGAA GGATCGTCGG CACCTTATTA TGGAAGTGTT ATGTGCGATG ATGCAGATGA CCAGCTTCTT TTAGGGCAGA GAGATGTACT TGAGCAATTG TACACCGAGG CCGACGGAGA CAACTGGAAA TTAAATGACA ACTGGATGAA TTCCGATATA GAGTTTTGCT CCTGGTACGG TATTGAGTGT GATAGCGATG GCTTTGTAAA GAGTATTGAC CTGATGCAGA ACGGACTTCA GGGAAAAATT CCGACGTCAG TGTACAGTCT GCCGCGGTTG CAGGAGATTA ATTTCGCATC GAACGGAATT GAAATAAGCT TTACAGGGAT CAGTGAGACG CGAAGCCTAT CGTATATCAA TCTCGACTAT ACTGGATTGA GCTCGTTGAG TGGAATTGAA AAGGCTTCCA GTCTCAAATT GTTGCATTTA GTCGGCAACG AATTGGGTGG TGTATGGCCG ACTGAAATTA CCGCTCTCAA AAGTCTTCAA ATACTGTACC TATCTGAGAA TGATATTGGA GGTGCTTTGC CTGCAGCGCT AGCAGAATTG TCAGATCTAG AAGTCTTTGC TTGTGTTGAA TGCGGCTTAA AAGGAACGCT CCCTACTTCA GTTGTGTCGC TCAAAAAATT GGAGTACCTG AACCTTTCCA GAAATTCCTT TTCGGGGCCT CTTCCGCTGG AGCTAGAATC CTTGCTGTCA CTGAAGTACC TTAGTCTCTC AGAGCAAATC TCCTCCATCA GCAACGGATT GACTGGACCG ATACTGTCAT TTCAAAAAAG CCTGAACTTG ACAGATCTCC ATATTCATAA AAATCAGCTC TCAGGAGCAA TTCCGAGCAA TTTTCTGCAA AATGTTAGTT TGTCTCAAGA AGTCCGTATC GACCTGCATT CCAACAGACT CAATGGTACA CTCCCCCTCG AGCTTGCTAG ATTCGACAAA ATGACTCTTG ATCTTGCAGA CAATAAGATA GAGGGAATTC CACAAGGTCT CTGCGAAAAG TCTTGGAATG AGAGGAATGC CTTGCCGTCC GGAGACTCCT GTGATTTCAT TCTGTGTGGG GCTGGCTCCT TCAACGGGAT AGGTCGCGCA TCCGCTTCTC TTCGATGCGA GCCCTGTCTG ACAAGGTCGG ACGCCCAATT TTTTGGACAA ACATCGTGTG GGCCAGATAT TGAACGTGAT GTTCTCAATT CTCTGTTCCG CGACCTGTAT GGTTTACAAT GGAAGCGAGC TGATGGATGG GGTAGCACTG CTAGTGTATG CAGCTGGTAT GGCGTGGATT GCTATTTAGG AGGCGATCAT AATGAGCTTG TTCGAAGTAT TGTCTTGGAC AATAATAATC TGGTTGGGAC AGTGGGTTCC GGAATATGGC TCCTCACACA ATTGATTGAA CTGGATCTCA GTGAAAACCA AATTGACGTG GACTTCGATT CGGTGGGAGG TGCTTCAAGT CTAGAACTAC TGCGCTTGTC GCAGACAAAC ATCACTTCAA TAGTCGGACT CGGCTCGTCT ATATCTCTTA GAGAGCTCGA GTTGGCCTCT CTCGGATTGA CGGGATCTCT ACCGGAAGAT TTTTTTAAGT TGGAATCTTT GGAAAGACTA GTCCTTGATC ACAACAACTT GTCAGGCGAA GTCTCCAGGT CAATAGGTGA TCTTTCCAGT CTGGAGGAGC TCTACCTCGG CAATAACGGA TTTTCTGGAC CGATACCAGA TATGTTTGGT TCGATCTCCC GTTTGCGTGT TCTTTCCATC GGCGGAAACA AGTGGACGGG CGAGGTAAGA GACCCTTTGT TCGCAGGGAG GGAGGTTACA TTCCATAAGG AACTTCGTTG TCTCACCTTT CTGTTAGATT CCTTCGAGCC TAAGTTTTTT GTCTTCTCTT GAAGTGTTGT CCATCGAAAG ACAACCGGAT GAGCTTGCTG GTGAACCTGG GTTGGTCGGG AGTCTCCCAG CATTTCATCG AGCGCCACGA CTTCGCGAGC TTTATCTAGC AGCGAATAGT TTGGGGGGCA CTATCCCCTC AATTTTCCTA TCTGGACACT CGGACAAGAG CTCCGAAGTC ATTGTTGATT TACAGCAAAA CCTCGTACAC GGTGCAATCC CAGAAGTTCT TGCTGATTTC AGCCAGATGC AACTTCTTTT AGGAGGGAAC AGGATCAGTG CTGTCCCGAA TTCAGTGTGC GAAATGAAGG ATTGGATGGA TGGTCTTTTG CAATCCGGCT GTGATGCCTT ACTTTGTGGA CCGGGTACTT TTAATACCAT CGGACGCCGG ACGTCGACTG AAGACTGCCA ACCTTGCACC TACCGTGCCT CAGCGCTTTG GTACGGGAGC ACACGCTGTG GCGCCATCTC ACCTGAGCAA CTTACCCAGG AAGAAATCTT GAGAGAGTTT TTCGAAAGGA CAGGAGGGAA CGACTGGAAA AAGGCAGACA ATTGGTTACA AAATGGGATT ACTGTCTGCG AATGGTTTGG TGTTGATTGC GAGCCGAACT CCGAGGGAAA GGATGATGTT GTCAAAATTG AGCTCAGCGG CAACAATTTA TCCGGAACTA TCCCGTCTTA TATGTTCTAT CTCCCTTCTC TCCGAGTTCT CAACCTTAGC GGAAACAGCG TTACCATGGG ATTCGGGGAC ATCAAAGATT CGAAGTCACT GAAGGAGCTT TACATCGATT CCACGAACGT TATATCCCTC GAAGGCCTAA ACGAAGCCAC AAATCTTCGT GTTCTCAAAA TGGATAATAC GGCATTCAAT GGCCGGCAGA TTCCGACCGA ATTGTACTCG CTGACCGCGC TGGAATTCCT CGACATCGCT CAATGTGGAT TTACGGGCAC TTTGTCGCCT TCCATAGAAA ACTTATCCAA TTTGAAAGCA TTTGTTGCTT CTCACAATGA TTTGAGCGGA CTCATCCCAG ACTCAATTGC GAGCTTGACT GTTCTTCGCA ACCTTCTACT GTCAGAGAAC AACTTTTTTG GGCCCCTAGC TCCGCTTGAA AGCTTA
|
Protein sequence | LGHVQKIDLS YNHLVGSLPA ALDGISAAIS LEVLHLTNLG LRGPLPEELL RLTNLRSLLA NFNHFTGTLP QAIGDLTGLQ ELLLYENDLT GAIPSTLGNL IHLQTLNLAQ NALGGEIPVE LDSIVSSNIR RLSLASNQLE GAVPESLFQM KVLEDLDLSH NKFSGILSSS IGLLTNLKRL KISGNSLTGA LPTELGSLSD LIEIAAAENS FSGELPTSLG NLSLLQILSI RQTASVGDLT GTLPSFSGLE HELRLTDNDL TGPFPPEILQ LSNLRQLFLN FNAIEGPLPI EISLMNNLED LFLLNNRFSG QLPASIGSLS SLKRLALSDN NFEGRCLASS LKLLHLVGNE LGGVWPTEIT ALKSLQILYL SENDIGGALP AALAELSDLE VFACVECGLK GTLPTSVVSL KKLEYLNLSR NSFSGPLPLE LESLLSLKEL ELASLGLTGS LPEDFFKLES LERLVLDHNN LSGEVSRSIG DLSSLEELYL GNNGFSGPIP DMFGSISRLR VLSIGGNKWT GEIPTELYSL TALEFLDIAQ CGFTGTLSPS IENLSNLKAF VASHNDLSGL IPDSIASLTV LRNLLLSENN FFGPLAPLES L
|
| |