Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46320 |
Symbol | |
ID | 7201504 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 916898 |
End bp | 920214 |
Gene Length | 3317 bp |
Protein Length | 1022 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180731 |
Protein GI | 219119962 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAGGATCC CTTGGTCGTC GAGAAGACTG GCTGGTGTTA TCATCGTCGG TGTGTAGTAT CGTTATCGTC ATCATGGCGA CTTCCAACTT TTTCCACAAG CCCGAGTTGG CCTTGCGACG CGCGCAGGAA CTCGAAGGCA TCCATCAGGA CGACGCCGCC TTGGCCATGT TGCACGAAGT GCTATCCTCA CGCCGTCATC GTACCTGGAG TCCAGCCTTT GAACAAATCA TGATCGTGTA CGTTGATCTC TGCCTCAAGC TGTACAAAAC ACGCGAAGCC AAAGATGGTC TGCATCAGTA CCGTAATTTG AGTCAAACAC AGGCTCCGGG ATCGTTGGAA AAGGTGATTC GCTACTTACT CACCGCGGCG GAGAAAAAGT GTACCGAAGC CAAAGCTTCC GCGGACGCCC AGCAGGATCA GATTTTGGAA GAGAATGGGA ATGGCGACGA CGAAGACGGT TTCCGGGCTT CTCCGCAGGC AATCTTGCTA TCCACCATGT CCACGGACCC GGCCAAATCG CAACGCGATT CCGCCCTCCT CGTGCCGTCC CTCAAGTTTC TCTGGGAAAC CTACCGAGCC GTTCTGGACA TTTTGCGATC CAACTCCAAA CTAGAACACG TCTACCACTT TGCGGCACAG AGCGCCTTGC AGTTCTGTCA AGATTACAAA CGTCGAATGG AGTTTAGGAA TCTTTGTGAC ATGCTCAGAC TCCATCTCGG CAATCTGCGC CAGTACGGCA ATCTCGACGC CAACGATGAC GGAAAAACCA ACAACAAGGT ACGTATGCCA CGAATTGACC TTTCGCTCGA CAATCCCCCG AAACGGTGCA CTCACACTGC TCTTAAACTT TGGTGGGTCT CGCGTTTGCG CCATTAGGTC CGCGGTTGGG AAGGTTGGAC TACGGAATCG ATCGAGTTGC ACCTACAAAC GCGCTTCATA CAGCTCGAAA CCGCCAGCGT CCTGCACCGC TACACCGAAG GATTCCGGAC ATGCGAGGAT ATTTTCAATA TTCTCCAAAT CAGCCAGGCT CGTCGCAAGC ACAATCCCGA CGTGCCCGCA CCCAAGGCCA AGCTCATGGC ATCCTACTAC GAAAAGCTCA CGACCCTGTT TTGGGTTTCG GAAAATTACC TTTTCCACGC CTTTGCCTGG TACAAGTACT ATACCTTGTG CAAAGAGTTC AATCGTGGCA TGTCGGAAGA CACGAAGCGT ATGCAGGCGT CGGCCGTCCT ATTGGCCGCC TTGTGCATCC CTTCCGTCCC CGCGAACGAA AGCAAGGCCT CGCATTCCAA CCAACATACT ATTGCCACCA CGGTCGAAGA CGGGATAGTG AAGCAAAAAA TGGCCCGCAT GGCTACCTTG TTGGGCTTTC ATACGCGCAA CCCGACCCGC GACGCCTTGC TGGCGGAAAT CCGTAGCAAG GGGATTCTGG AACAGGCTCC GGCCTACTTG AGAGAACTCT ACGAGCTTTT GGAAGAAACC AACGATCCGC TTATTATGGT TCAAAAGGCC AAGCCGCTTT TGGAACAGCT GCAGCAAGAG CTCGGGGCCA CCACGTCTAA CGATGTCAAG AACGACGACG TCGATGACAC GACCTTGGGT CGCTACGTTA AGCCCATCAC CAACGTGCTG TTACTGAAAC TCATTCGCAA CTTGTCGGCA GCTTACCACA CGGTATCAAT GGATCATTTG AAATCTCTCA CGTCTGGTCT CGATCTGAGC TTCGAACAGG TGGAAAAGAC CATCGTCACG AGTTCAAAAA CGCTGGCTGT GCGTCTGGAT CACCGTGCCG GCTGTCTACG TTTCGGTAAC GTGCAACTGG AATCGGACGC AATGCGCTCG CAGCTGGTAA ACTTATCGAA GCAGCTACAG GCCGTGTCCA ACGTTCTGAC CCCACCGGAT CGGCAAAGCG TGTTGCAGTC CCGACTGTCC ACGTACCAAT CGGTCCGCGA AAATCTCCAC GCCGAACACG CGGCCGTGCT GGAACGTAAA AACTTGATCG AAACGCGCAA AGAAGAAACC GAACGAGTAG CGCAAGAAAA GTCTAAGCAA GAAGCCCGTG TCAAGGCGGA AGAGGAAGCA GCGCGTAAGG CCGAAGAAGA ACAGCGCATC GTGCGGGAAC AGCGCTTGCG CGAGATTGAA AAGCAGCGCA AAATTCAACA AGAGTTGGAC AATCAAGAGA AGAAACGGTT TCTGGCCGCC ATGGGAAAAA AGACGGAGGA TATTTCGGAA GAGCAAATCG CCAAGATCGA TACGGAAGCC TTGCAGCGGG AGCACGAAGC AAAGATCAAC AAGGAACGTG AGGAAGCCGA ACGCAAAACT CGTGAGACGG CCAAGAAACT GGATTATTTG GTCCGGGCGA TTCGTATCGA GGAACTGCCT CTGATCAAGA AGAAGTACGA AGAAAAGACG AAATTGGACA AGGAGCGGTA CGAACAAGAG AACATCGAGA AGGCGCAAAA GGCTAAGTTG CAATGGGAAG CCGATGTGAA AGACAAGGCT GTGTTGGAAT CCCACAACGT CTTTGCCTAC TGCTCTGAGT TTGAAAACTC CGTAATGGTG GGACGCCAAG CCGAGCATGA CGTAATTTGC CAAAGAGCCG AAGAAGAGGC AGAAATGGAA GCCGAAAAGG CCAAAATTTC CCGAGCTCGC AAGCGGAAAG CAGCGGAGGA GAAGCTTATG GCTGCCGAAG CTGCCCGTGA GGCAGAGGAA GAAGCTCGCC GGAAGGAAGA GGAGGAGAAG CGCAAGAAAG ATGAGGCTCG CCGGGAGCGA GAAGCCAAAG AGGAAGAGCG CCGGCGGGCA GAAGACGAAC GAATGGAGGA AGAGCGTCGA AAAAAGGCAG GTCCTGCCAA GTACATTCCT CCATCACAAC GCTTGGCTAG CGGTGGCGAA CGTGGTGGCG GCGGTGGAGA AGACCGCCCC AGTCGATTTG GTGGTGCCGG ATCCTACCCT GGAGGTGGAC GTTATGAAGG CCGTTCGGAC GATCGAGGCG GTGGCTGGCG CGGTGGTGGC GACCGAAGTG GTGATTATCG TAGAGGTGGA GATGATCGCA GAGGTGGAGA TGATCGCAGA GGTGGAGATG ATCGCAGAGG TGGAGACGAT CGCAGAGGTG GAGACGATCG TAGAGAAGGA GACGATCGTA GAGGAGGCGC ATACGGAGGA GATCGTCGTG GCGGCGCAGG GAGTGGTAGC TACAACGATC GTCGTGGGCC CCCTTCGGAC GGCAACAGTC GTTGGCGCTA AGGAATTTCA TTGGTGGCCG TGAAGAAACA AAAGTAGCTT TTACACAGAG GTTAGACGTC TTTCTATTTT ACGAGTA
|
Protein sequence | MATSNFFHKP ELALRRAQEL EGIHQDDAAL AMLHEVLSSR RHRTWSPAFE QIMIVYVDLC LKLYKTREAK DGLHQYRNLS QTQAPGSLEK VIRYLLTAAE KKCTEAKASA DAQQDQILEE NGNGDDEDGF RASPQAILLS TMSTDPAKSQ RDSALLVPSL KFLWETYRAV LDILRSNSKL EHVYHFAAQS ALQFCQDYKR RMEFRNLCDM LRLHLGNLRQ YGNLDANDDG KTNNKVRGWE GWTTESIELH LQTRFIQLET ASVLHRYTEG FRTCEDIFNI LQISQARRKH NPDVPAPKAK LMASYYEKLT TLFWVSENYL FHAFAWYKYY TLCKEFNRGM SEDTKRMQAS AVLLAALCIP SVPANESKAS HSNQHTIATT VEDGIVKQKM ARMATLLGFH TRNPTRDALL AEIRSKGILE QAPAYLRELY ELLEETNDPL IMVQKAKPLL EQLQQELGAT TSNDVKNDDV DDTTLGRYVK PITNVLLLKL IRNLSAAYHT VSMDHLKSLT SGLDLSFEQV EKTIVTSSKT LAVRLDHRAG CLRFGNVQLE SDAMRSQLVN LSKQLQAVSN VLTPPDRQSV LQSRLSTYQS VRENLHAEHA AVLERKNLIE TRKEETERVA QEKSKQEARV KAEEEAARKA EEEQRIVREQ RLREIEKQRK IQQELDNQEK KRFLAAMGKK TEDISEEQIA KIDTEALQRE HEAKINKERE EAERKTRETA KKLDYLVRAI RIEELPLIKK KYEEKTKLDK ERYEQENIEK AQKAKLQWEA DVKDKAVLES HNVFAYCSEF ENSVMVGRQA EHDVICQRAE EEAEMEAEKA KISRARKRKA AEEKLMAAEA AREAEEEARR KEEEEKRKKD EARREREAKE EERRRAEDER MEEERRKKAG PAKYIPPSQR LASGGERGGG GGEDRPSRFG GAGSYPGGGR YEGRSDDRGG GWRGGGDRSG DYRRGGDDRR GGDDRRGGDD RRGGDDRRGG DDRREGDDRR GGAYGGDRRG GAGSGSYNDR RGPPSDGNSR WR
|
| |