Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49381 |
Symbol | |
ID | 7195772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 99226 |
End bp | 102641 |
Gene Length | 3416 bp |
Protein Length | 969 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184182 |
Protein GI | 219127938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.158773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATTGCGTTG CAGGTGCCCT TCTATTGCAA CATGGATTGC TCTTTGAAGC AATTGCTTGA CAACAACATT GCCAGCAAGA AGGAGTGGAT GACTGAGTGC ATCCCTTTTC CATCGAAAGA AGGGCAGGAA GTTAATCTAT GCGATTACAT TGGTCTAGAT TGTAAACGCA TTGGAGAGTG TTTTATGTTT CCGGAAGGAT ACGACTCTAC CTCAAATTGC CGAAATAGTC TAGCCAAGGC CATTAAGATT GCTGCTGCAA ATGGAAAGTT TCCCTTAGTC GAACGAGGAT GGGATGCAAA AAAGAGGCGA CTACGTTTTG AATGCTTTAG AAGTAGATCC CATGATGCGG ATAGGTACAA AACAAAACAA AACAGTCTAT CCAATGTGGC TATAAAACAT CGCAACGTTT CCTTCATACG TCCACAAAAG GGAAAAGAGT GTCCTTTTCG GTTCAGTATT TTTTGGATGA CAGAACATAA ACAATGGTGT CTTTTTGGAG GTGTAAAAGG CAGCTGCAGG TTTCACTGTC ATCATTTACC AATGGATCCA TGTGAGGTAA AAAAGAGTAT TTCCTATATT GACGGTGGTG AAGTAAAAAT AGCACTTGAT GCAGCAAAAA GTAATGCCCC ACCAAGCGTG ATTGGGCGGC TGCTGAACAT ACGAACTGGA GATATCTTGT CAGGTTCATC TCTAAAAAAT ATACGGATGC AAGCTGAAAA AGGTGAACGG AACAAATTTG GTGCAAACGA TAATTTCACG ACGCAAGCGG ATCAACTTCT AGCTTATCTT GAGAGCACCC CGGATGTCAG CTTCTGTGCC ATATATGATG AACCAGATTC CCCTTTATTC ACTGTTTACA AGCAGAGGGC AAAAACTGGA CGTCGGCACC TACACACCAG TACACGGAGT ATCTCTGGAG GAATTGCCCA ACAAGAAGTG CTCAATGAAA AGGTGCTAGA TGCTATTGAT CCAAGGGGAG AGCTTGATGA CTATATAGAT AGGACGCGGA GGGCGTTCAA GTTGAAGGGC AACGAAAAAA TGCTTTTAGG AGTGGCCTGG ACCAACAACG AGAGTAGAAG AATCTTTGCT CGCTATTCCG AGATCATGGT AGCAGATGTG ACAGAAGGTA CCAACAATGC AAAACGGCCG CTGTTTTTGT TTTCGGGAAA GACATCAAAT CAAAACACGT TTACAGCACT TTGGGCCTTT CTACCGCAAC AATCTCGTTG GGCCTTCCGA TGGGTGTGGA CAAGATGCAT TCCACAGCTC TTACCTGAAC AGGGGCTGAA CAGAATGCGT CTATTGATTA CTGATGGAGA CCCGCGAGAG TATGGTACTT TTTTAGATGC AATACCTACT TGGTATAGCT TGTGTCGGCA CAAACTATGC CATTGGCATC TACTCTATCG CGGCAGTCTT ATGAAAGCAC AGACTGGAAA CTGTGGAACA AAAGCAAAAA TTCTATTCCA TGTGGTCCTG AAGTGGATAG AAAGCTGGAT GACAAAAATT GAGACGCAAG AGGAGTACAA TTTGTCAACT GGGCTTTTGA TTGACTGGCT GAAATCTCCG GAGGCACTTG ATACAAATTT GGGCGGAATG GGTTGTGCCC TTGTTTCGCA GATAAATGCA TTTTTGACGT CCTCTCTGTT TCCGCACGAG CAACGCTGGG CTCGATACCA TTTTCTGAAC GTGAGAGCAT TCAACACGTC CGCAAGTTCC TACGGAGAAG CAGAGAACAG TGCTCTAAAA CGACGGGGTG ATGGGGTCAA GCCAAACTTT TCGGTGCCAA AAGCAACACG GGCAATAAAC GAAGGGACTC AATTGCGAAC AGTGAAGAGG CAACAAAAAG CAGTTCATAA CCTCAATGCT ACAAAGAAGA CAAAGGCAGC AAACTACACC AATATATCCG ACCTAGTAGA TTGCATACAG GAAACCATAT CCCATGAATT CAATGCAGCC AAAAAATATG ACCTCTTTTG CCCGGGTCCA AAAGAATTTT GGGTAAAGCG AGCATGGTAC CAAATTCCAA GCGAGACCTA CCAGGATTTC AACGACAGCA ACTTTTGCCA ATTTATGATT CCACAGTTTG AGCGCACCCG CATCGTAAAA ATTACGGAAA TTGAAGGTGA ACTCTATCTG GAATGTAGTT GCGGCAAGTT CCAACGACAA GCTTCTCCAT GTGCTCACAT CTACAAAGTA CTTAACCGAC CACCACAATC AACAGACGTT TCTGTGAGAT GGACAAAAAT TTGGGATGTT TACCTGCATC GACCTGGATA TCATGATCTG TCGGACCAGT TAGAGGAATT GTATAAGAAG GAGCGGCCAG GGCCACATTT CGAAAACACA AATCAGTGGG AAGTTGGAAA GGGTGAGAGA GAGTACAACT ATTTTAAGAG ATCACTTCCA AGCGAGCCCA CCATTATCCA GAAGTACAGC AGATGGGCTG ATTCTTTTTC ACGACAACCT GGATGTTATG TGCATAAAAG CACTGAACAG GAAACAGTTC CTGCAGCAAG CGGTATGGTG CAAGAGTTGA CCAGCCTTTC CCAGGGGTAT GCTATTGAAA CTCAATTGGA TAGTGAAATG GATGTTGGAG ATGTAACTGT CATGCAGGTT GAAGAAATTG ATTCAAATCT CTCAAAATCG GGTAAAAGTC CATACACAAA CAATCTTCAT TTTTACGAGG AAATCTCAAA ACTTGCCAAA TTCAATTCAA AAGCTGCTGA CATAATGACA AAAGGAATGC AGGAAACTTT GGAATTGCTA CAGAAACATG TTGCAGAAGG GTCAGGTATG GTAGATTACA GTATTGGCCC AGCTATTGGA AAAGAACCAG TAGGCCAAAG GCTCAGGCCA AGCTACAGTC CTTCAAAGAG CAAGAATCTA AGAGACAGAC AAAAAGAAAC AAAGGCAAAA TTTTGGTGGC TGAACAAGTA ATGATAGTGA GACCTATTTG TCATCATGAG AAGCATTCTA TAGGAAAATG TTAAAGACTG TAGCATCCCT CATTATAATT GTGAGCCATT CTTGGCACTC TTCCTCTGTC AGAACCATCA TGCCTTATAC TAGAGTCCTC AGACCTTCCA ATGCAAAATC ACTAAGGTGT ACTTGTAGCC CCAACGTATT GGCCATTTCC AAGCCTTTGG TGTCAACGCC GGCTTTGCTT TCATTGTCAC CTTTGTTGAC GGCTGAAGGA CGGATACGGC AATAGTCTTT TACCATACTC AGCTCTGTTT TGAGGACCAA GCCCATACCG CCAGCAGCAG CAACATGCCC GAATCGACGC TTATTGAACG GCAAAGCGGC GTCTTCCACA GTAACAGCTG TAGTGGCACA AAGCGCGGAA TCCGGATCCC AACCAAGGCA GCAGCGTTTC TCCCACAGCA TTATCCCCGG CCACAACCAC TACTTG
|
Protein sequence | MDCSLKQLLD NNIASKKEWM TECIPFPSKE GQEVNLCDYI GLDCKRIGEC FMFPEGYDST SNCRNSLAKA IKIAAANGKF PLVERGWDAK KRRLRFECFR SRSHDADRYK TKQNSLSNVA IKHRNVSFIR PQKGKECPFR FSIFWMTEHK QWCLFGGVKG SCRFHCHHLP MDPCEVKKSI SYIDGGEVKI ALDAAKSNAP PSVIGRLLNI RTGDILSGSS LKNIRMQAEK GERNKFGAND NFTTQADQLL AYLESTPDVS FCAIYDEPDS PLFTVYKQRA KTGRRHLHTS TRSISGGIAQ QEVLNEKVLD AIDPRGELDD YIDRTRRAFK LKGNEKMLLG VAWTNNESRR IFARYSEIMV ADVTEGTNNA KRPLFLFSGK TSNQNTFTAL WAFLPQQSRW AFRWVWTRCI PQLLPEQGLN RMRLLITDGD PREYGTFLDA IPTWYSLCRH KLCHWHLLYR GSLMKAQTGN CGTKAKILFH VVLKWIESWM TKIETQEEYN LSTGLLIDWL KSPEALDTNL GGMGCALVSQ INAFLTSSLF PHEQRWARYH FLNVRAFNTS ASSYGEAENS ALKRRGDGVK PNFSVPKATR AINEGTQLRT VKRQQKAVHN LNATKKTKAA NYTNISDLVD CIQETISHEF NAAKKYDLFC PGPKEFWVKR AWYQIPSETY QDFNDSNFCQ FMIPQFERTR IVKITEIEGE LYLECSCGKF QRQASPCAHI YKVLNRPPQS TDVSVRWTKI WDVYLHRPGY HDLSDQLEEL YKKERPGPHF ENTNQWEVGK GEREYNYFKR SLPSEPTIIQ KYSRWADSFS RQPGCYVHKS TEQETVPAAS GMVQELTSLS QGYAIETQLD SEMDVGDVTV MQVEEIDSNL SKSGKSPYTN NLHFYEEISK LAKFNSKAAD IMTKGMQETL ELLQKHVAEG SGMVDYSIGP AIGKEPVGQR LRPSYSPSKS KNLRDRQKET KAKFWWLNK
|
| |