Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_25840 |
Symbol | |
ID | 7204049 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1055177 |
End bp | 1059308 |
Gene Length | 4132 bp |
Protein Length | 1128 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186457 |
Protein GI | 219113747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.908694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAATCAATC ATAATTCGTA TCCTGGAAAA GACACGACAT TCAGCTGATA AGACTGACTG CATCTCATGT GGCTGACTCT GAGGCCCATC AAAATCAGAA GTGGTCTGGA TTCGGATTCT AAAGTTACTT CCTCATCCTT TTACATTAGC GCTTCGCGTT CACACCTTCC CCTCCCCTCT TCCAGATCTT CCCGAGAACT GTCGATCGAG ATGGAGGACA TAAATTTCTT CAAGCCATGG GACAGACGAT CTATCCAGCC CATTGAACTC TTCCTTGAAT TTGCTGACAG TAAGCGACAA TGAGAATCAC CAACTTTTCC ATCGTTGGGG CCGTGACTGC GACAGCCTTG GTTCTTTCTG CTTGTAGTAT CAGCCAGGCC TTTTCCCCTC CGTTGCGGTC TGTTCCACGA ATCCGTATCA ATGGTAGACC TTCGCCTTCT TTCCGCTTTG CGATCATTCC CGACAAGGAA ATCCAACAGA AGACGAAAAA GGATACGACT CAAGCGTTTT TCGATGAAGA GGCCCCATCA CCTCACGATT CGGGTGCAGG AATTCCTTAC GAGCGTCTTA CTATAGGAGT ACTGAAAGAA TCTTTCCCAG GCGAGTGTCG AGTGTCACAA ACACCTGACT CGGTCCGAAC ACTGGTCAAA GAAGGATTTA CCGTCGCCGT CGAATCTGGA GGTATGTATT GATCTAAATA TTCCAGAGTT TGCCTTGTCG CTTCATTTCT GGTATCTTAC AAATATTCTG GATTGTAGCT GGCGACAAGG CCTCTTTCAA CGATGCAGCA TACCTAGAAG CCGGCGCGAT TGTCTTAAAA GCCACCCAGA TATACCAGGA TGCTGACATT CTCACCAAGG TCCGTCCGCC GAGTGATGAG GATGTACCAA AATTTGCCGG GAAAACTTTA ATTAGTACAA TCCAGCCCGC AATTAACAGT GATCTATACC ATGCTTTGGT GGCGCAAGGG ACGAACGTCT TTGCCTTGGA TTGTGTCCCT CGCATGCTCA GTCGAGGGCA GTCGTACGAT ATTCTTTCCT CTCAGGCCAA CATTGCAGGC TATCGCGCCA TCGTGGAAGC GGCGGAGCAA TTTCCACGTT TCTTTGCGGG TCAAATGACA GCCGCTGGTA AGGTACCTCC CGCCAAAATC CTGGTGCTTG GGGCTGGCGT GGCCGGATTG GCGGCGGTGC AGACCGCCAA AAATATGGGG GCAATTGTCC GGGCCTTTGA CGTACGTTCC ATCTGCAAAG AACAAGTCGA ATCTATGGGC GCTACGTTTT TGGAAGTCGA TGTACAAGAA GACGGAAGTG GTACTGGTGG ATACGCCAAG GAAATGAGTG ACGCCTATAA AGCGGCACAG GCCAAGCTCA TGCTAGAGCA GGCCAAGGAC GTGGATATAA TTGTTACGAC GGCCTTGATT CCTGGCCGCA AAGCCCCTGT GTTAGTAGAT GAAGGGATGT TGTCGGAGAT GAAGGCAGGA TCCGTCTGTG TGGATTTGGC CGCTGCGAAC GGCGGAAACG TAGCTCTAAC TAAACCGGAC GAGATCGTCA CAACAGATAA CGGGGTTAAA ATTGTCGGAT ACACCGACCT TCCGTCCCGT CTACCTGCGA CAGCCAGCAA TCTTTTCGCA AACAATGTGG CCAAGTTCAT TTTAAGTATC GGTCCCCAGA CAACCAAGGA AAAGAATCTC TTTCTAATCG ATTTAGATGA CGATGCAGTG CAAAATATGT TGATCGCTTA CAAAGGAGAG GCCCGTTGGC CCGACAAGAT TAAGCCTTTT AGCCCTCCTC CACTACCAGC CAAAGCCCAG GTAGACGTGC TATTGACGGA AGAAGATCCT ATCGTCTCGG CCAATAAAAA GCAAAAAGTG TCGTTCGTAA AAAATACTGG CATTTCGAGT CTTGCTGCGA TTATTTTGGT TGCCTTTGGT CTTACTACCG ATTCTCCTGA GAGTGTATCT CTATTGGCCA CCTTTGCGTT GGCTGGCCTC GCAGGGTATC AAGTAGTCTG GGGCGTTGCA CCGGCTCTGC ACTCTCCACT GATGGCCGTA ACCAACGCCA TTTCCGGGAT GACAGCAATT GGCGGCATGC TTCTTTTGGG AAGTCATGCG ACGGACACCA AGGGTTTAAT TCCAGATAGC CCAGCACATT GGCTGGGCGC TATTGCAACA GCGCTGTCGT TCGTAAATGT TGCCGGTGGA TTTTTGGTTT CCGGAAAGAG TAAGTAAGCT TACAGGTAGT TCGCTCTGAT GTTTCAGTCC AACTAGAGAC GGCTAACTTC CAACTGTCCT CTTTTTAATT GGTAACAGTG CTGGAATTGT TCCGCCGACA GAATGAGCCG AAGGATTTCT TTGAACTGTA CGCAATCCCC ACGAGTGTTA TTCTTGCAGG ATTGGGTATA GCTGGCTTCT CGGGAGTGGG AAATCTAGAC ACCATGACCG GCACTGTCGG CATTGCTTCG GCCATCTGCT GCATTGCTGC GATTGCAGGG CTTGCGAATC AAGAAACTGC TAGGACTGGA AATGTTCTAG GTATGGCAGG TGTAACGTTC GGCTTGGCGG CGGCAACCGG TGAAATGGCC GTCGCCGAAG CTGCCCCTGC TGCTTTTCAG CAAGTCGGGC TATTGGGAGC TCTTGGAGGT GGGGTGGGAT TGAAGCTTGC ATCAGGTGTC GGACCGACTG AGCTCCCCCA GACCGTTGCC GCCTTCCATT CTCTTGTCGG TCTAGCAGCC ATGGCTGGAG CCGCGGGTGA ATTCTTTGCC GGTGACGATC TGACCACAGG TACTCTCTCG GCGATTTATC TTGCCATCCT GATCGGAGGC GTTACCTTTA CTGGTTCGAT AGTTGCCTTC AGTAAACTTG CCGGTATCAT GGGCTCGTCC CCATTGCGCT TGCCTGGGCG CGACCAACTA AACCTTGCAA TGATTACGAC TTGTGTTCTA GGTTTCGCTG CTTTTCTTGA CCCCTCTTTG GCTACAAATC TAGTCAGTAT TGATCATGGA ACTGTGCAGC TCGTTAGCCT CGGAATAGTA GCTAGCATCG CATCAATTCT CGGCTATCAT CTTACAGGTC AGGCAAACTT ATTCTATACG TTTTGCTTCG TTCTCAACAT GCTCTTACTT GCTCCTCCAT TGATGTTTTA CCAGCAAGCA TTGGTGGTGC GGATATGCCG GTTGTCATTA CAATCTTGAA TTCATACTCC GGTTGGGCAC TGTGTGCAGA GGGGTTCCTG CTTGGCAATC CTCTTCTGGC ACAAGTCGGT GCCCTAATAG GATTTTCTGG AGCCATACGT GAGTTCTTGG TTTCCTCAAA GCTAGGTGGC GTCGCCGATC TACTCTGAAT CTTACGAACC TCTTGTTTTT GATTTGATTG GCAGTGACTT GGATCATGTG CGAGGTATGT CAACTAGACT TCTCTCTAAA AGTTGATATT GTAGTATTTC GACTTACAAA GTGGTGTCTT TTGACAGGCC ATGGGAAGAA ATGTTGTATC CGTTGTACTA GGTGGAGCAG GAACTGCTAC GAGCTCGGAA TCTACAGAAA CAACTGCATT CGAAGGTGAA ATTACTACGA CCACAATCGA CAACGTTGCT GACGCTCTCA AAGAGGCAAG GACCATTATG ATCACGCCTG GATACGGTCT TGCTGTAGCT CGTGCACAGT TTTCGATTGC TGAAATCGCC AAAAAATTGA AGGAAGAAGG AAAGAATGTG CGTTTTGGTA TTCACCCGGT CGCCGGCCGT ATGCCTGGCC AATTGAATGT TTTGCTTGCT GAAGCTGGTG TTCCGTACGA TATGGTTTTT GAAATGGAAG ACATCAATGA AGAATTTGCG GAAACGGATG TAACTCTTGT GATCGGCGCT TCGGATACCG TTAGTAGTGC CGCCGAGGAC GATCCCAAAT GCAGCATCTA TGGCATGCCC GTCCTTCGAG TTTGGAAGAG TGGCCATGTT TTTGTCCTGA AGCGGTCTAT TGGCAACACT GGGTATGCCG GTATGCAGAA TCCGATCCTA TTCAAGGACA ATGTCGATGT TCTTCTCGGC GACGCCAAAG ACAGCTGCGA TGCTCTTCGC TCAGCATTGA TCGACTGAAA ATCAAATTGT TGATCGAGTA ACCTTACATT AATGAATCGT CC
|
Protein sequence | MRITNFSIVG AVTATALVLS ACSISQAFSP PLRSVPRIRI NGRPSPSFRF AIIPDKEIQQ KTKKDTTQAF FDEEAPSPHD SGAGIPYERL TIGVLKESFP GECRVSQTPD SVRTLVKEGF TVAVESGAGD KASFNDAAYL EAGAIVLKAT QIYQDADILT KVRPPSDEDV PKFAGKTLIS TIQPAINSDL YHALVAQGTN VFALDCVPRM LSRGQSYDIL SSQANIAGYR AIVEAAEQFP RFFAGQMTAA GKVPPAKILV LGAGVAGLAA VQTAKNMGAI VRAFDVRSIC KEQVESMGAT FLEVDVQEDG SGTGGYAKEM SDAYKAAQAK LMLEQAKDVD IIVTTALIPG RKAPVLVDEG MLSEMKAGSV CVDLAAANGG NVALTKPDEI VTTDNGVKIV GYTDLPSRLP ATASNLFANN VAKFILSIGP QTTKEKNLFL IDLDDDAVQN MLIAYKGEAR WPDKIKPFSP PPLPAKAQVD VLLTEEDPIV SANKKQKVSF VKNTGISSLA AIILVAFGLT TDSPESVSLL ATFALAGLAG YQVVWGVAPA LHSPLMAVTN AISGMTAIGG MLLLGSHATD TKGLIPDSPA HWLGAIATAL SFVNVAGGFL VSGKMLELFR RQNEPKDFFE LYAIPTSVIL AGLGIAGFSG VGNLDTMTGT VGIASAICCI AAIAGLANQE TARTGNVLGM AGVTFGLAAA TGEMAVAEAA PAAFQQVGLL GALGGGVGLK LASGVGPTEL PQTVAAFHSL VGLAAMAGAA GEFFAGDDLT TGTLSAIYLA ILIGGVTFTG SIVAFSKLAG IMGSSPLRLP GRDQLNLAMI TTCVLGFAAF LDPSLATNLV SIDHGTVQLV SLGIVASIAS ILGYHLTASI GGADMPVVIT ILNSYSGWAL CAEGFLLGNP LLAQVGALIG FSGAILTWIM CEAMGRNVVS VVLGGAGTAT SSESTETTAF EGEITTTTID NVADALKEAR TIMITPGYGL AVARAQFSIA EIAKKLKEEG KNVRFGIHPV AGRMPGQLNV LLAEAGVPYD MVFEMEDINE EFAETDVTLV IGASDTVSSA AEDDPKCSIY GMPVLRVWKS GHVFVLKRSI GNTGYAGMQN PILFKDNVDV LLGDAKDSCD ALRSALID
|
| |