Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48354 |
Symbol | |
ID | 7203567 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 211611 |
End bp | 215056 |
Gene Length | 3446 bp |
Protein Length | 868 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182919 |
Protein GI | 219125295 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.951648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTATTCGTC GACTGGATTA TTAGCGGACA CACTTTTTCA GCGTTTCTCC TCTGCTCTAC GTCTAGGCCT GACTCGGTAG AAAGCTATAG AGTCTAGGAT TGACTGTTTT ACATCACCAA CAGGCAAGCA CTTACTGTCG ATTCAAATAT TGAAATCGTC CGAGGTGACC ATGGATTATT TCGCGGACGA TGCATCCGTG GCCGACCACT CGGTTGGTTC GTTGGCAGCT ATCGAAACAA ACGCTGGCGG CGCACCACGT CGCAAGGAAG AAGAAAAGGT GGAGGATGAA CAATCCCCGG ATCGCCTAGT GGAAAAGAAA TCACGATCTG TGGCCTGCAC CAAATGGTTG CTTCTTTTTC TGCTGGTGGT GTCAGGTATC TTTCTTACGG TTAGCACCTT CGTAACGACG AAAGAAACGC GCGTTGAGGA CAGCGGACGT AGCGATGCGA CTACATACGC AAGCGTCGTC GCGGGAATTT TTTCGGTCTT GATTATGGTA TTTTTACGAT ACGACTATCT CGTCGCGCGC CGGCAATCGT TAGTAATAGA TATGGCAAAG CGATCAAAAA CTATTGTCGA TTCCTTGTTT CCAAGTATGG TTCGTGAGCG ACTCCTTCAG GACAACTCTG GTCAGTTCAG TATTATTTCT GCGGAGGAGC CGGTCAGTTT TTCAGAGCTC ACGAAGATCA ACACTCTTGC CAACAAGAAT ATAAGCGAGT CTGATCCGAT TCGCATGCCT CAGCTTCAAA GCAAGGCGAC TTTAAAATCG GTTCGTGAAA GTGAAGGAGC AGCTCTAGAT CATCGTAGCT CGCAGCCTAT TGCTGATTTA TACAAAAACG CAACTGTTTT ATTTGCGGAT ATTGCTGGCT TTACAGCCTG GAGTGCAGAG CGCGATCCCC CGCAAGTTTT CAAACTTCTG GAAACGGTTT ATGCCGAATT CGATTCCATT GCTGAAAAGC TCAAGGTCTT TAAGGTACGC CTTTGCAGAG TAGCACGTGC AATTTTACGT TTGAAGCAAA CAGCTGTCTA AATTTAAATC CATGTATAGG TCGAAACTAT TGGGGACTCC TACATGGCAG TGACAGGGGT TCCCGAGCCA GATCCAGATC ATGCTGTTAC TATGGCCAAG TTTGCCTACC AATGCCTGGT AAAGATGGAT GGTGTCACAT CTGACTTGGA GACTCTGCTC GGGTCTGGCA CCAAAAACTT ACATGTCCGC GTTGGGTTGC ATTCTGGGCC AGTGACTGCC GGAGTGTTGC GAGGACAAAA GTCTCGGTTC CAGCTTTTTG GCGATACTGT TAACACCGCT AGTCGCATGG AAAGTACGGG CGAAAAGCGT CGTATTCAGG TTAGCTTAGA AACTGCAGAA CTTTTGATTG CTGCCGGAAA GATGCATTGG GTAAAAGAAC GTCAAGATCA AATTGTTGCC AAGGGAAAAG GTCAAATGCG TACTTTTTGG TTGGATCCTC ACAAGAAGTC GAAGCCGTCC GGCGAAGACA ATTGTCGAAT AGCCGCAAAA AATGGGTCAG TAGCGCAAGC TTTTAAACAC ACGCTTGTCC GACAAACGCT CCTGACAGAT GCCACTTACG TTAGTCCATG GGATCGAAGG CTGTCCGAAG ACAAATATGG TCGCTTGATT GACTGGAATA CGGACATTTT CCTTGAGCAT CTCAGTAGAG TAATGGCCTA TCACCATTCG GGAAGAGGAG AAACTCGATC AGTTAAAACA GTCAATCATC GTGACGAAGA CGCTGAAATG CCTCCACCCT ACAATTCAGT GGCGGACGTT ATTTTCTTGC CGGCTTTTGA CCCAAATGCT CGGCATCAGC CTCCCATGGA CACTGAGCTA GCTAAAGTCC GTATTCTTTT GCGCGAATTC ATCGCTGACA TTGCCGCCTT GTACAACGAC GTACCGTTTC ACAACTTTGC GCATGCATCT CATGTAACGC TAGCGGTTAG CAAAATATTG AGCCGCATTG CAAAATTGAA GGACTGTGAC GAAATGGAGC GGCATGAAAT ATCCTACGGT ATCAGTTCGG ACCCACTGAC GCAGTTTTCT GTTGTATTTT CTGCGCTCGT ACACGATGTT GGGCATTCCG GCGTCCCGAA TTCGCAGCTT GTTGTCGAAG AAGCGGAGGT AGCGCTACAG TTTCGAAACA GGTCTGTGGC GGAACAGAAC TCGGTCGTGC TAGCATGGAA GCTGTTAATG CAACCCAAAT ACCAGCGCCT TCGCATGTGC ATTTACAAAA CCAACGTCGA GCGCAAGCGT TTCCGCCAAG TTATGATCAA CTGCGTCATG GCCACGGATA TATCGGACCG CGAGATCAAC CAATTTCGCG TACGCAAATG GGACAAAGCC TTTAAGAATA CTGATAAAAA TACGGACAGA CGCGGTCTAG GTGGGAAAAT GACACGGGAC GAAATCAATC GCAAGGCGAC AGTAGTCTTG GAATGTATTA TTCAAGCAGC CGACATTGCC CATACAATGC AACACTGGCA CGTTTACCGT AAATGGAATG ACAAGCTGTT CGAGGAAAGG TATAAGGCCT ACTTAAGCGG AAGGGCTAGT TTTGATCCTG CTGATAATTG GTTCGCTGGT GAACTTGGCT TTTTTGATCA CTACGTGATT CCCTTGTCGA AACGTATAAG TGAAAGCGGT ATTTTTGGGT TAGGAGCCGG TAGTGTGTAC GAAACAATGG CACGAGAGAA TCGTCAAGAG TGGGAACGCG ATGGTCAAGC ATCGCTGAGA GCCATGATTA AATCCGCTCA TCAAATATCA CTGTTTTCGG TAGAGGAAAC TGACGAAAAT TCGCTTTTGG GGGACAATGA TTCGGTCGAA GCCGTTAGGT GAGGTCGACA AGGAAGAATC TCACCTGTAT GTTGGTTTGT CCAGGTTAAA ATGTTGTCTG GAAAAGTTGC GGAGTTGCTT TTGGTGGAAT CTGCATGTTC GGAGTGTTTG AACAACAAGG GGGACAACAC ACCTATCGAC CTCACCAATA GATCATAACC GTGCTTGCTA TTGTGTTGCT CTCGGGAGGT ATGGAAAGAA GAAATGAAGA ACAAAGATTG CATTTTATTC CTTTTTCCTC ATGAAGGATT GACAGTCGCG GTACGGTCCC TTCGATAGGG ACGGATGTAC GCATGAAGCC AACGCAAAGG TCGTACCAAT TCAGTTGAGT TCTAACGAGT ACAAGCTCTG AGCTGCTTGA GCTGTTATCG TCCTGATTTG TGAAGCCAAG GAATAGCAGC GTCCTCATTG AACGACGAGG GCGTTATCAA TACAGAACTA CTGTACAGTA TGGCGATTCT TACCCGCCAG ATAACTATTG GCACCTCGGG AACAAACAGC TAGAGGGCTA TCCAATACCA TGAGTTTAGC CGTATGCGCA CTTTCGCAAA AGCCGGAAAA GTAGACTGCG GATTCTGTAA AAACCCTGTT CCATAT
|
Protein sequence | MDYFADDASV ADHSVGSLAA IETNAGGAPR RKEEEKVEDE QSPDRLVEKK SRSVACTKWL LLFLLVVSGI FLTVSTFVTT KETRVEDSGR SDATTYASVV AGIFSVLIMV FLRYDYLVAR RQSLVIDMAK RSKTIVDSLF PSMVRERLLQ DNSGQFSIIS AEEPVSFSEL TKINTLANKN ISESDPIRMP QLQSKATLKS VRESEGAALD HRSSQPIADL YKNATVLFAD IAGFTAWSAE RDPPQVFKLL ETVYAEFDSI AEKLKVFKVE TIGDSYMAVT GVPEPDPDHA VTMAKFAYQC LVKMDGVTSD LETLLGSGTK NLHVRVGLHS GPVTAGVLRG QKSRFQLFGD TVNTASRMES TGEKRRIQVS LETAELLIAA GKMHWVKERQ DQIVAKGKGQ MRTFWLDPHK KSKPSGEDNC RIAAKNGSVA QAFKHTLVRQ TLLTDATYVS PWDRRLSEDK YGRLIDWNTD IFLEHLSRVM AYHHSGRGET RSVKTVNHRD EDAEMPPPYN SVADVIFLPA FDPNARHQPP MDTELAKVRI LLREFIADIA ALYNDVPFHN FAHASHVTLA VSKILSRIAK LKDCDEMERH EISYGISSDP LTQFSVVFSA LVHDVGHSGV PNSQLVVEEA EVALQFRNRS VAEQNSVVLA WKLLMQPKYQ RLRMCIYKTN VERKRFRQVM INCVMATDIS DREINQFRVR KWDKAFKNTD KNTDRRGLGG KMTRDEINRK ATVVLECIIQ AADIAHTMQH WHVYRKWNDK LFEERYKAYL SGRASFDPAD NWFAGELGFF DHYVIPLSKR ISESGIFGLG AGSVYETMAR ENRQEWERDG QASLRAMIKS AHQISLFSVE ETDENSLLGD NDSVEAVR
|
| |