Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45871 |
Symbol | |
ID | 7200969 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 521926 |
End bp | 525550 |
Gene Length | 3625 bp |
Protein Length | 944 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180065 |
Protein GI | 219118591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAGC AAGTTCAGGC AGCCCTTGAC CAAATGGTCG AGCCTCTGTT GGACCTGCAG AATCGAGGTG TCTTTTCCCG GGACGAAATT CGCTCCATCG TGGATCGTCG TCGACAATCG GAATACGCTC TACGCCGCCG AGGAAAACTC CGCAAGGCCG ATTTTATCGG GTACATACAA GCCGAAACTC AACTGGAAGC CTTGCGGGCT TTGCGTGTGC AAAGAATTAC TCGAGAAGAG CGTCGAGCGA ATCGAGGCAA AACGTCGCAG GATGATGCTA AGGACAGCAA TACTTCCGGC ACGAGCAAAA TTGGTGACCG CCACATCGTT CAGCTTATTC ATCTGCTATG GACACGTACG CTACGAAAAT TCCGCGACGT GAGCTTCTTT CTGGAGTACG CCGAATTTTG TCGATCCCAA AAATCGTTTG CCAAACTTGC GACTGTATAC GCCCAGGCCT TGGCTTTGCA TCCGAAACAA ACGGGACTTT GGATAGCTGC TGCCTCGCAC GAATTTTTTC AATCAAGTAC GCCGCATACG GCTCGAATTC TGTTGCAGCG TGGAATTCGG GTTAACCCCA CATCGCCCGA TTTGTGGTTA CAGAGCCTGG TTATGGAGCT ACATTTGGTA CAAAAATTAC GGGGTCGTCG CGATATTCTG CGGGGTGCTG GTCGTGGTGG TGAAGAAGAA GAAGATAAAG AATTCTCAGA GCACAAGATT GCTCGTTTGC TATACGATAA TGCGATTCAA GCCATTCCCG ATCGTGTCGA GTTTCGTTTA CAGTGTTTGG ATCAATGTCG CTTGTTTCCG AATACAGAAG ATTTGCAAGC TTACATTCAT ATTTCAATGG AACGAGATTG TTCGTCTCGC CCCGAAGCGT GGATCGCTCG GGCCATGCAC GAATGGGAAC GCCATCGAAA GCTAGGAGAG AATGAGAAGA GTAGTATTGG TTTTTTACAG TCAAACGGAC TTGGCGATGA TTTGAACCAA CGTCAAGAGG ACAACCCGCA GCGAAAAAAG GCTCGAACGC TTCTCCATCA TGAGCAGGAA GCTAGAGACG ATGTACTGAA GGTGATCAAA CAGGCTGTGG ATACCCTTTC GTCTCATGAA ATGTATCTGC AAGGAATTCG CTTCCTTCTC TCATACATGC AGGAATTAGC TGAAGAATTA GAAGAGAGCG ACGATAGCAA AGAAAAGCTA GAACGAAGTC AAAGGTTTCT TTTTCAGCTC TTCGATCAGG CAAAGACCCT TCATTTCGAA ACCTCGGACT TAATTCTCGA GCAGGTAGCG TTTTTGGCTC TTGTTGAGTC GGATCAAGAA GCTAGCCAAT GCATTCAAAA CTTTGTAGAG AACCATCCTA CCAAGGCTTC GATTGATGTC TGGCGCCGCT ATGCTGCAAT GTCACCCGAT CAAGCCGTTG ATATTCTTGA AGAAGCTGCA AAGTATATAT CTGTCGAGCA AGAAGCGCAC ATGGTTGTTC TGTTGGAGCT CTTTGGTGCC AAACTAGCCA TTCCAGACGA GAAGAGCTTG CCGGTCCTCT TTCAAACGAT ACTTTTGCTA GCTCCTGGTT TTGGGGAGAT GAAGGATGTA GAGGACCCAG TGTATGGAAT AAAGAATATT TCATGTGCCT GCTTGCAATA TTTGCGATAC GCCTCTAAGC ATCATGGGCT AAACGAAATG CGGAGAATTT ATCAGGCTGT CTTGTTCGAG TCCTCTTTGG GAACCTTGGA AGGTGGTGAT CCATCCTTGA TAAACATTTT CTTGGAAGAG GCAATGAAGG CTGAAGAAGA ATTCGGCAGT AGTAAGGAGA CTCGATGTCA GCTTAGGCGC CTGTGTGACA TCGCGTTGCA GTTGTTTGAA AACACCGACG AGGAGGAGAA ATATAGACGG CGAAAGGAAA ACATTCTATA CGGGTAAATA CAAGAGGGGG GTTGAGGAGG CGATTACTTT TGTAGCTATG ATCAAGCTGA CGCTAGTGTT GCTGTTTGTG TCATAGAAGG CTTCGAACAC CCTCTGATCT CTCTATACAT GGATTGCCCT GTTCGGTCGG GACGAATATG GTCCAAATAT TCGATGTTGA TGCACCAGAC TTTTCAGCGG AAGCTTTCGA AGTCGCCTAT AGTAAGTTCC GAGTCGTAAA AATTCGCCAC GCTTACAATA TGAAGGAACA ACGATCTTTT TCCTGGAAGA ACATTGGTCC AATTTTTGAT AATTTAGACG CAAGTGACAA AGAGTCTTGG TGTCTCGAAA CAAATGAGGG TGAGCCCTGT TCTCCGGAAT CTTTCCTGCA GTCCAAGCTT GACACGTCTA GGGCATACTG CAGCTTTTTG ATACAACAAG ACGAGGATGC TTATAAATCA GCAATGGATT TGCTTCCAAT TTCTGAACTT AGTTGGACAA ATTGGTCGTA TGAGCCCGCC CTTTGGATGT TTTTCGGCAG AAACCGGAAG GGCAATGATC CTTTGGAAGG ACGTCCAGAA CATACGGATG CCATTTTACA CGACGGAACT TGGCACTATC AGTTGTCCGG TGGAAAGGAA TGGTACCTAC GACCAACAAA GGAGCTCTTG AAACATATGG ATGGCCATTT AACGAAAGCA GAGCGAAGGC TTTGGAGCGA ATCGAGTCGC GTTTGTGTCG CGTGTGAAGA GGGAAGCATC CTCGTGATCA AGTAAGTCGT GGCTTTTGCT AAATCGCTGG TTTCGTGTTG TGTATGAAAA CGCATCTGGC TCATGAGTCT ATTATTTAGC ACGAAGCTGT GGTTTCATAG AACAGTAATT CCATCTCAAA AGCAACCATC CGTATCTTAC GCCAGAGATT TTCGCTTCGA TTTGAATGCT GCATGCATCC GAGATAGTAA GGGAATGACA AATGTAGACG GCCTCTACGC GACGAGTGAT ATCGAAGAAG GAACAATCAT TTTTACAGAG AATGATATGC CTGAATGCGA GCTTCACCGG TCGTCCACGG ATCCAAACTG TGAAGTCGTA GCGTTGGATG ACGGGTCCAG TGCAGTTGTG TCGATGAAAG CTATAACTGC TGGAGAATTC TTCAGTGTGC TCGACTCTGA GAGTGAAGAG GAAGGTGAGG TTGATCCTCC TGGATCAACT TAAAAGTAAT CGGAATGCTT TCTAAAAGCA TCTCGAAAAG TAGCATATCT GTGTCATCAT TGACAGTGGA TATGGACGTC GCTAAGAATC AACGAATCCT TCTCGGAGAA TAAATTGCCA GGCTTCCTAC CTTTCACTTG AACATGCTCC CATCGACCCC TAGATCCTAG GCCATCCATT GTCACTACAT GCTCAATCGT GCCATTACCG TCAAAAACAG ACTGAACAAC GTTGATCTTA CCATTGCCCT TGATTTGGAA GATAGGATCT TTGCGGGGCT CTCCAGAAGC TATTTTGGGC CGACGAAATG TAACTCCTTC ACACCATCCA CCGTTGCCCC TCCATACTAT AGTTCCAGAT ATTTCAACAA CTACGTTTGC TGGGTTGTAT TCATCTCCCA CGATGCGGAT CGGGACATTG ACTGTCACTG TTCCTTTAAT CCAGTAGTGC CCGTCACCGA GCTCTGTAAC ATATATATAT ATATATTAGA ATCTGTACTT TTCCGGCCAA AAAAA
|
Protein sequence | MAEQVQAALD QMVEPLLDLQ NRGVFSRDEI RSIVDRRRQS EYALRRRGKL RKADFIGYIQ AETQLEALRA LRVQRITREE RRANRGKTSQ DDAKDSNTSG TSKIGDRHIV QLIHLLWTRT LRKFRDVSFF LEYAEFCRSQ KSFAKLATVY AQALALHPKQ TGLWIAAASH EFFQSSTPHT ARILLQRGIR VNPTSPDLWL QSLVMELHLV QKLRGRRDIL RGAGRGGEEE EDKEFSEHKI ARLLYDNAIQ AIPDRVEFRL QCLDQCRLFP NTEDLQAYIH ISMERDCSSR PEAWIARAMH EWERHRKLGE NEKSSIGFLQ SNGLGDDLNQ RQEDNPQRKK ARTLLHHEQE ARDDVLKVIK QAVDTLSSHE MYLQGIRFLL SYMQELAEEL EESDDSKEKL ERSQRFLFQL FDQAKTLHFE TSDLILEQVA FLALVESDQE ASQCIQNFVE NHPTKASIDV WRRYAAMSPD QAVDILEEAA KYISVEQEAH MVVLLELFGA KLAIPDEKSL PVLFQTILLL APGFGEMKDV EDPVYGIKNI SCACLQYLRY ASKHHGLNEM RRIYQAVLFE SSLGTLEGGD PSLINIFLEE AMKAEEEFGS SKETRCQLRR LCDIALQLFE NTDEEEKYRR RKENILYGRL RTPSDLSIHG LPCSVGTNMV QIFDVDAPDF SAEAFEVAYN ASDKESWCLE TNEGEPCSPE SFLQSKLDTS RAYCSFLIQQ DEDAYKSAMD LLPISELSWT NWSYEPALWM FFGRNRKGND PLEGRPEHTD AILHDGTWHY QLSGGKEWYL RPTKELLKHM DGHLTKAERR LWSESSRVCV ACEEGSILVI KTVIPSQKQP SVSYARDFRF DLNAACIRDS KGMTNVDGLY ATSDIEEGTI IFTENDMPEC ELHRSSTDPN CEVVALDDGS SAVVSMKAIT AGEFFSVLDS ESEEEGEVDP PGST
|
| |