Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42770 |
Symbol | |
ID | 7196145 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1055927 |
End bp | 1061194 |
Gene Length | 5268 bp |
Protein Length | 1086 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177212 |
Protein GI | 219110921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTTGTTAT TGTAGTTGTT ATTGTTGGTA GTGTTTGAAT TACAGGAACA AGCCTAAAAC CATGTGGGGT TCGTCGTTTA CGGACTGGGC CAAAAAGGCG CAGGAAGAAT TGCAGGAACA GGCGGCTCAC TTGACGGTCG CCACGCCCTC TAGTTTATTC AATCTCGACG CCATGCAACA ACAGGAAGAC GAAGCAGCAA CAGCCAAAGC AGAAGTGTCC GTAACAACCA ACGACGTGAA TGACACTGGA TCATTACCAC CACCCGTCAC GACGTCGTGG ACTTCTCCGC TGCCGCCATC CTTGTCGGTG CCCCGCGTCC ACGCAACGTC CCGACCAAAA CCGTCGCTCC TTGTTCCGTC GGCCGTTGCC GAACAGCGGG AAACACTCCG CACGTCGACC GACCGAAAGC CGCCGCAAAA AATGTCGTTA CCGGAAGCGA CGACTGGAGC ATTGCACGCA CCCGTACGGT CCCATGCGGA TGGATGGGAG GAGAATCTGG ACTCGCACGA CTTGGAAGAC GGAACCCCCG CAACCGTCGA CGACGCCGCT GACTATAATC CACAATCTGT TCTTCCCGAC CGAGCTGCCG TAATGGACGC GCCAGTGGCG CCCATCGCGG GGCATCAACC AATCCACGAT ACACGTGCGC ACGACGAACT CCCCCGGTCC GACGAGGAGT CGTCACGATG CACGGAAGGC TGACGACGAA ACCAGTCCCC GCATCAATGA ATCCGACGAG GACATGAACG ATGACGACCA CGACAACTTT GACGACAAAG ACGAAGGACT ACCTTCTGTT CACAAGATAC CGCTCGAATT GCCGGACGCG AAAGGCCAGT CTCTGTCGGA TCCGGCGACG CCAACCGCGA GTGTTGAAAA TTTGTCGTCG GTCGAGAATG TGGTCGACCT ACCACACGAC GCGTCCAATC CGGTCGCAGC GTTTGTCCCC GCCATCACGG ACCCGGATCC TACCACAACG TCCAATGGAA CGCTCGACGC CGTCGACGCG GATTTGCCTG CCACCAGAAT CACCAGCACT GTACCGCTTA AGAAATTGCT CGTTCTGTGT TCCATGAACT CTCTCAACAA GACGGCACAC AAACGTCAAG AACGAGCTTT TACCATTCTC CACGCCCGTC AGATTCTCTA CGACGTTGTG GACGGAGCCG ACCCGCAGCA CAAATCCTGG CGGGAAGAAT TGTTCACACT GGCCCACGCG GCCAAGGGAG AGTATCCGCA ATTCTTTCTC ATGGACGTGG ACGACGGTAG CACTACCTAC TGGGGACCGT GGGATCGGTT GGAATACGCC AACGACAACG GCAACCTGGC GGAAGAGTTG ACTGGACGGT TGCAGACCGG CTGGTCACCG GAAGACCACG TGGCGGATTC CGCTTTGCCA CCATCGCGAT TCGCCACCCA TCAACACGCT AGCGCCGTCG CTATGGACGA CGAACGGGAA CAATTTTCGC AGCAAATGCA ACGCGTCGAA GTCAATCACG CCGCCGAACG ACAAGCCCTG GAAACGGAAC ACGCCCGCGC ATTGGAACAA GCACTGGCCA GTACGAATCA TGACGCATGT ATCACGGAAC GGGTGGCGTT GCAGGAAAAG TACGAAACCG CCTTGGATCA AAAGAACGAC CAATTACACG ACCTGGTACG GGTCAACGAA GGGTACAAGC TCAAACTGGA AGTATTGCAA CGGGAGGTGA CCGGAACACA GCAACTCTTG CAAGCGCGGG ATGGTGACCT GGGTCAAGCA GCGCAAGCCC ACCACGATCA ATTGGTAAGT CTGCAGTCCC AATTAGTGGA AAGTTCGCAA CGAGCGACGG AAGCAAATGA GCAAGTGGAG AGTCTCAAGG CTGCTTTGGA AACGTCTCGA GCTGATTTGG CGGGTAGCAA GCAAGAGCTG GCTGATCTCA AGGCTCGCGT CAAAGTGGTG GCTACGGAGC TGAAAGACCG CAGAGTGGAG TGTCGCGAAT TGCATACAAA AGCGGACGAA TTGAACGCAG TCAACCTTGA CCTCAAATCT CGGGTGGACG AGTTGAAGTC ACAACTCACG CACCAAAACC GCAACGGATC CGAAAAACAA GAAGAAATGG AACAGCTCAA GGTCAAACTT GTCGACGCAG CCATCGTCTT GGAGCAGGCC GAGAATCGGG TGCAAGAAGC CAAGTCGGAA GGCGAAAAGG CTCTGGCTGA TTATAAACGC AAAGCGCAAA ATTCGTTGTC GATGGCCAAT GCGCGGACAG CGGCAGCCGT TCAGGCCAAG GAAGAAGCCG AGCTCGAGGC ACGGGCGGCT CGAAGTACCG CCGATTCTTC GATGGATCGG GCAGTCAAAG CCGAGATTGC CAGTAGGGAG GCGTTGGCGG AAGCGAAGGC CTACGTTGCG GCAATGGAAA AAGAAAAATC CGAGGCTATA CAGAAATTTG AAGCGGCTGG AGCCGAAACG AAGTCTGCTC AGGAACAAGC AGCCAAACTT CAAGAGGATT TGTCGCAAGC AGTTGAATCT AAATCTGGAA TTGCTGAGCA GTTGCGACAG TCGACTTCCC GACTTGAGTC AGAACAAGAA AGATCGGCAT CGCTGAGAGA AGAACTGTCA AAGATGCAAC ACAAGATCGC AGAAGTGCAG GACGACAGTG CCATTCTCCG AGCACAACTT AAACGAAGCG AATCTGAGCT TACGACTGTC AAAGAGTCGA TGGGCGAGGG CTCCATGGAG TCAAAACCAG TAAGCATGGA GGAAAATGGA GTGTCAGAAA AAGCTACCGA CAACGAAACG ATCCGAATCT TGCAAGAGGA ACTTCAGGAT GCCAACGCTG CTATTGAAGA AATGAAGGAC GCTTTGAAGA GCGTTGTTGA GATGAATGGC TCTGTACCAA CGAAATTTCA ATTGGAATCG ACAGACCAGA CCAATTATGG CTTGGACGGG TCTCGAAACG GAACGCATTC GAATGGGGGT AATGACGCCA CACCGTTGTT TTTCGCAATG GAAAAGCAGG CGGAACTCAA GACTGCACGA AACGAAATCA ATCGACTTGC GAATATTCTC GCTGACGTTC AATCAGAAAA GATGGAGGCG GTTGACGCAA TGGAGGACAT GCGGAGAAAG ATGGAAGAAT CCGAGTCGAA GCTTAATCGC TTCGAAAAGT TGATGCCGAT ACATGATCCT GGAAACACAA ACGGCACTTG CAGCGAGACC AACAGTGGCG CTACGAACAT TGAGTATTTA AAAAACATTA TGCTCAGTTT CCTTAACGCG AAGACAGCCG CTGAAAAGAA GAATCTTGTT CCAGTGATTG GCGCTGTTTT GTGTCTCACG CCTCACGAAC AAGCTGCTGC CGCGCAGAAC ATCGATCAAG CAACGAGTTT GGGAGGTGTT GGTCAGAGTC TCTTCGAATC CTTGAGCGGA CGCTTGTCTT GAAACTGTAG TAACCGATTT AAGCTTGCTT CTAATTGTTT ACAGATAAAA TTACTGTTCT GATGATATTC CTGTACTGAT CCCTTCCCCT TTTCAGGGTA CGCCGTGAAG GCGTTCCCAC ACTTTGTCGA AGAAAATCTT GCGCCAACCC GGCTCGAGAC GGGGCGCTCT ACTTTTCGCT TCTTTCAGCG AAACTAGCTG CTCTTTGATT TGAGGTGGAC ATTTACGGCA CTTCAAGATA TGAGCGTGAA TATTCTGTGA AGTGCTGTTA GTCGAAAGGC TCCTGGAAGA AATCGGAAAA TATTTCCCTA AACCTGCGTG CCCATGGCAA TGCCGACACT GAAAGCCGGG ACAACCAACA GGACCTTTCG ATCGTGCGAC AAAGCGATCA GCTTCTGTGA ATTGGCAGTT TTCAACCTGA CGCATCAGAA AGTACACATA CGACGGCACC ATGCTCATGT CATCGGTAGA TACAATATCC TCTCCATCTT GAAGGTACGC TCTTTTGACG GAAACGTCTG AATCACTGGC TGGTTTTTCG ACCTTATCGG CGCTGTTTGA GGGATCAGCT CTTCCAGAAA TATCTGTCCC AAATCGGATT CCGTCGTCCG TATCGACCAT CCCCAAAGAC TTTGCAGAAT CAGCCCAATA CTGTCGAGTG GTTGGTGTCC ATGAATCGAT ACTGCTGAGC TGAGTCAATT CTCTTTTTTC ACTTTCAGAA ATTTGGTGAC ATAAAGGTAG GTGGACGCGC TGCCAGCGCT TCACTGATTC GTAAAGACCA GACATGGAAG TAGGAAAGGA GACAGCTGCA ATGGCTACCC CATCACCTGT ACTAGTCTTG CAGAATCGGC ATCGAACGCC TACTTGATGC AAACCAATAC GACCCCGTTT TGAAGACCGA GCGACGTCGT CTTGCGTCGC GGAAAATACT TCGACGCAAT TGGATCGAAT ATAGCAGTTC ATTTCCGAGA GCCATTCCGG ATCAGATTCA GCATTTGCTA AGGAAATTGA CCCTCGGAAC CATTCCCTGT CATGACTTGA TGTAACTTCA GCTTCTTCCG TTTCCACTCT AGTACTACTG TGGATACGCT CTGTACGCGG ATACGAATGG TTCTTTTGCA GTGCTAGGGC GTCCATTGGA TGCCATACAT GTATGGGACG ATTGCTGACT AGGACACGCC GCATTTGGCA GACTTCTTCA TGCATACAGG CCTGCTCAAA GGTATGAAAA GAGGCGACGT TGCAAAAATC ACAGACCCAT TCATTCGACA TTGATTGGGC GTATGTAAGC TGACGCATCT GGCTACCTTG TGAGCTCAAT GGAGAAGGTA TGCATTGCCG TCCCCAGCTA GCTCCCAACC ACTCTCTAGG CATTCCCGTT GGAACTCCAC CTCCCCGGGC AAAACCCACG GGCAAGCGGG AAGCTTCGGC AATTCGTCTT GGAGGGAAAG ACTCAAGTCG AGCGACCGTC GCATCAACAC CCCCGACGTT CGGTGAAAAG CGGACTCTAC TCTTAATCCT TCCGGGGGAG CCCTTTTCCG GAGAGACGAC TAAGGGTCCT TTCTGGGGAA TGTGCGATTC CCAGGAAGGC GATCGACCAT CCTCGGGATC CCGAGCTTCA ACAGCTCCTT TTCGGCTGCT TAAAGTAGGA GTCATCGTAG CAAGAGAGGC CAGCGCAAAC GCGGACGCTA ATTCGGTTGC GCCGACAGAG TTCTTCGACG TACAAGCTCG AGATTCTTCG CTCAGCAACA GTTTTTGTCG CTTCGCATGC GGCGACGGGG AGAAGTTGGT AAGTTCCCTC CGACAGGAGC GATCTTCCGC CATCGTTTCA GCTTTATAGT GCATGGAA
|
Protein sequence | MWGSSFTDWA KKAQEELQEQ AAHLTVATPS SLFNLDAMQQ QEDEAATAKA EVSVTTNDVN DTGSLPPPVT TSWTSPLPPS LSVPRVHATS RPKPSLLVPS AVAEQRETLR TSTDRKPPQK MSLPEATTGA LHAPVRSHAD GWEENLDSHD LEDGTPATVD DAADYNPQSV LPDRAAVMDA PVAPIAGHQP IHDTRAHDEL PRSDEESPRI NESDEDMNDD DHDNFDDKDE GLPSVHKIPL ELPDAKGQSL SDPATPTASV ENLSSVENVV DLPHDASNPV AAFVPAITDP DPTTTSNGTL DAVDADLPAT RITSTVPLKK LLVLCSMNSL NKTAHKRQER AFTILHARQI LYDVVDGADP QHKSWREELF TLAHAAKGEY PQFFLMDVDD GSTTYWGPWD RLEYANDNGN LAEELTGRLQ TGWSPEDHVA DSALPPSRFA THQHASAVAM DDEREQFSQQ MQRVEVNHAA ERQALETEHA RALEQALAST NHDACITERV ALQEKYETAL DQKNDQLHDL VRVNEGYKLK LEVLQREVTG TQQLLQARDG DLGQAAQAHH DQLSLKAALE TSRADLAGSK QELADLKARV KVVATELKDR RVECRELHTK ADELNAVNLD LKSRVDELKS QLTHQNRNGS EKQEEMEQLK VKLVDAAIVL EQAENRVQEA KSEGEKALAD YKRKAQNSLS MANARTAAAV QAKEEAELEA RAARSTADSS MDRAVKAEIA SREALAEAKA YVAAMEKEKS EAIQKFEAAG AETKSAQEQA AKLQEDLSQA VESKSGIAEQ LRQSTSRLES EQERSASLRE ELSKMQHKIA EVQDDSAILR AQLKRSESEL TTVKESMGEG SMESKPVSME ENGVSEKATD NETIRILQEE LQDANAAIEE MKDALKSVVE MNGSVPTKFQ LESTDQTNYG LDGSRNGTHS NGGNDATPLF FAMEKQAELK TARNEINRLA NILADVQSEK MEAVDAMEDM RRKMEESESK LNRFEKLMPI HDPGNTNGTC SETNSGATNI EYLKNIMLSF LNAKTAAEKK NLVPVIGAVL CLTPHEQAAA AQNIDQATSL GGVGQSLFES LSGRLS
|
| |