Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44478 |
Symbol | |
ID | 7197709 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 689115 |
End bp | 692602 |
Gene Length | 3488 bp |
Protein Length | 549 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178281 |
Protein GI | 219114971 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.578489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCGGTCGG CTTCGAATCC AACTCATCGT ACAAACCGCG ATCCAATAAC CAAAGCACAA CTACACAAAA ATATTGCACC ATGTTTGGAG GGGGAGGTCG TAAGAAAAAG GATGAAAGTC TTTCACAGGC GATGAAGCCG GAAAACCCGC CGTCAGGTGG TGGAAGCGTA ACTGGATTCG ACCCTGAGGG TCTCGAACGT GCGGCGAAGG CCGCTAGGGA TCTAGATAAC AGTCGGAACG CATCCGCAGC AATCGAGCTG ATCAAGACAC AGGAGGCTAC GAAGCAGCAT GAAGCAGCGG CGAAGAGAGC CGAAATGGAT GCATATGCGC AGCAACTGCG TGCTCAGAGT ATCGAGAAAG AAGCCGACGA GGCACGCAAA ACCCTCGATG CACAAACTCA ACACGAACAG CGTCGAGCCG AGTACCAAGA TCAACTGGAA CGCAAGCGCC AAGTTGATAT GCTTAATGCG CAAAAGTACA TGCAAGAGTA CGTGGCGGCT CAATATCGTT GTAGGAAACA CAGCTTTTCC GTATTTGTTT GCTTATTGTA TTTATTTCGA TCGCTTGTGA ATCTTTTAGG GAGCAGCTCA AAAAACAAGA AGAGATGGTT GCACGCCAAG AGGAGATGCG TCGTAAAACA GCGCAGTATG AGGCTGAGCT CCGGACGAAG ACCGAAATTG CTAAAGCAAA AGCCGAAGCT GAGGGACGGA TCGCGCAGGA GCGTCAGAAC CATGATCTCA TTTTGGATAA AGTTCGTCTC GAAGCCTCCG AAAGCCGCGA TACAGTTTTG AAAGCAATCC AAGATGGCGG CAAGCTTCTT GGAGAAGGAC TTTCTAGCTA TCTTAATGAT ACCGAGAAAC TCCGAAACAC TGCGTTGACG ATAACTGGCA TCGCCGTCGG GGTGTATGCT GCACGGACAA GTATTGGTAT CACTGGTCGT TTCGTTGAAG CACGTTTGGG AAAGCCGAGT TTGGTTCGTG AGACTTCACG AATGACTGTC TCGCAATTTT TTACCAGCCC TGTAGCATCT AGTCGGCGGA TATTGGGGAT AGGCGTACAC GAGCAAGATG CCTTGAAAGG TATCGTTTTA GAAGATTCCC TCGATACTCA GCTTCGCAAA GTGGCGGTAT CGACGGCTCA CACCAAAAAG AATCGTGCCC CTTTCCGTCA CCTGCTACTT CATGGTGAGT AGAAATTGAT GCCGGACGTT TTTTTACCAA CCTTTTTTTC AAAAGTTGCT TACAATTGTC ACGTTCTTTC GCAGGCCCCC CTGGGACGGG GAAGACCATG TTCGCACGAC AACTCGCGCA GCATTCTGGA CTGGACTATG CTGTTTTAAC AGGTGGTGAT ATTGCTCCAC TAGGACGGGA AGCCGTCACT GAACTTCACA AATTGTTCGA TTGGGCCAAA ACAAGCCGAC GCGGTCTACT ACTTTTCGTC GATGAAGCCG ATGCTTTCTT ACAGTCCCGT GAAAACTCTC GTATTTCGGA GGATCAGCGC AACGCATTGA ACGCGTTTTT GTTTCGAACC GGTACAGAAA GTGATCAGTT TATGATGGTG TATGCGAGCA ACCAGCCTGC TCAGTTCGAC GAAGCTGTCA TGGATCGTAT CGATGAAATG GTAGAATTTG ACTTGCCAGG ACCACACGAG CGGCGAAAGA TGATCGCCGT TTATATCGAT AAGTACTTAT TGAACCCACC AAATCGCTGG ACGAGAAAGG TAGAAACTAT CGACATTGGA GACGCAGAGA TTGAAGAAGT TGTCCGCGAA ACGGAAGGTT TCTCTGGTCG CGCAATATCC AAGCTCGCTA TTGCCTGGCA GGCCGCCGCT TACGGAACGG ACGGTGCCAT CCTTGATCGC GAAACATTTT TCAAGACGGT GGAACTGCAT AAAAAAAGCA TGATGACAAA GGAAATCTGG CTCAAAACCG CAACGAAACG CGCCCAAATG CTTACTTCGG ATCGTTAACT CTAATTTAAG CCACGATCGT CTTCTTTTCG TCTATTCACA TCTTAAATCT TGTTTGTTGG TTAACTTTTA TTCCTTATTG GAGCTGATTC AACTCGCAAT CCTTCGTTGA AGTTCACACC CTGCGTTGCT AGCGGGCAGA CTCTGGCTTT CTCCCAGTTT GCCCACATGT TTCCGATGCG ACGACGAGCT CCTTCCGTGT TCACATAAAC GCAGGCGAGA GCATCATGGC CACCAGCGCC TGGTACGAGC GCACCAATAA CTCCAGGCAA GGCAAGGGTT GCTTCGATCA GCTGTGTTTG TTCTTCCGGC TCAACAGGAA CCTTGGCAGC TACACCCAAT GCTTTCAACT CTTTCCGAGC CGCCTGTAGA GCGTTGCGTA ATGAAATGAG ACTCTCCTCA ATCTTATCGT GCGCGCACCA CTGCTCCATC GAACAATTTG CTAGCCTTTC TATTTCAGCA TGATCGATAG GAATTGCGCC TATGTGCTCT AGCCGGTCCA CCACCTGTCG ATTTATTTCC GCTAACTTAT GCCAGTGAGG AGCATTTCCG TTTCCAAAAC CTTTCCTCCA CTTTAGCACT CTTCGAGCCA TTGAAGGGCT TTCAGATCCC CCTGAGACAT CCGCCAGCAT TATTTGCAAG ATCGCGGGTA GTCGAATTGG AGCGGCAACA CCTCCGGTCC AAGTTTTCTG TACGATAGCT CTCAGAATGG CTTGTACATG CTTTATTTCA GATTTCGTTT TGTCTAATTC TCTTAGCAAA TCAGCAAGAA GATATTCAGG AAACCGCCGG TAGACATGGG AGCCGTGACA AGCCGCTGAA ACATCAAATC CACTGCCAAC TTTTCCTTGC GCGTGACAAT GCGAAATTTG TGCTAAATTG TAAATGACGG AAGGCTGATT GCATGCGTAG CATAGCGATC CCACAAGACT GGTCACTAAG CAAGCACTGC TACCGAGTCC AGTTTTGAGC ACATTACCAT CTGGCCCGGA AGTAGTAGCG GGTAAAAACG GGGGAAGTAG TTCTACCGAT TTCAGAGAGC GATCCAGACC CCGCTCTTGC AGATGCGGAA TCAAGCTGTA AAAGTCGTTA TCTGCTTGAA TATCCAGAGT AATACGACAA AGGCAATCTT CCTTTGCAGT CAACAGGTAA AGTAGTGATA CTCGTAGACT CTTTTCAATG AAAAGATTCA CGCTATTGTT CGAGGCATCT GCAGACAAAG TCAATGTCAC AGAATTGTAA AGATACTTCC ATGTTTGCCC AAACTGTGGA CTGTTCACAT CAATCTTAAC ATGCGCTGAC GCAGTTTGGT CAAATTCAAA GGTGGCAGTC GTATAGAACC GCTTATCGAC GGCTAGGACG AGACCAGTAT TGGGGGACTC CAGCACCAAA TAGCCACCGG CCAGAAGAAT CTTACCCGGC GCAGAAACTG TCACCTTCTT TATAGAGGTC AGCATGGACA CCACACTATC ATTAAGCTTG ACCAATCTCG ACCAAATATC GATTTGTTCA CAGTCACT
|
Protein sequence | MFGGGGRKKK DESLSQAMKP ENPPSGGGSV TGFDPEGLER AAKAARDLDN SRNASAAIEL IKTQEATKQH EAAAKRAEMD AYAQQLRAQS IEKEADEARK TLDAQTQHEQ RRAEYQDQLE RKRQVDMLNA QKYMQEEQLK KQEEMVARQE EMRRKTAQYE AELRTKTEIA KAKAEAEGRI AQERQNHDLI LDKVRLEASE SRDTVLKAIQ DGGKLLGEGL SSYLNDTEKL RNTALTITGI AVGVYAARTS IGITGRFVEA RLGKPSLVRE TSRMTVSQFF TSPVASSRRI LGIGVHEQDA LKGIVLEDSL DTQLRKVAVS TAHTKKNRAP FRHLLLHGPP GTGKTMFARQ LAQHSGLDYA VLTGGDIAPL GREAVTELHK LFDWAKTSRR GLLLFVDEAD AFLQSRENSR ISEDQRNALN AFLFRTGTES DQFMMVYASN QPAQFDEAVM DRIDEMVEFD LPGPHERRKM IAVYIDKYLL NPPNRWTRKV ETIDIGDAEI EEVVRETEGF SGRAISKLAI AWQAAAYGTD GAILDRETFF KTISCGIKL
|
| |