Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_13581 |
Symbol | |
ID | 7202037 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 543966 |
End bp | 547938 |
Gene Length | 3973 bp |
Protein Length | 1165 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181225 |
Protein GI | 219121754 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAAGACC AATACAGTCG CAAGACACCA TTGGAGCACG TCCTGTTGCG ACCGAGCATG TACGTCGGCT CGAACGAAAA GGCACAGCCA ATAGCTTGCT GGGTACCCGA TCCGATTCCA ACCCCACCGA CCGAAGAACT TTTAATGAAA ACGACAAGCG TTGTACAATA TGCAACAGTT CCTGATTATC CCATTAAAAT GGTCCGGCTT GAGTACGGAT TGGTACCGGC TTTAAACAAG GTGCGACACG GTTTGTTTGG TAGGGAAAAT CAAGATGTTT ACAGTTGCAC AAAAAAATCC TCTAATCCGG TCTCATCATT ACCTTTTACC TAGGTTTTTG ACGAAATTTT GGTGAATGCT GCCGACAATC TTCAACGTCA CCCAGACTCC TGCACCCGTA TCGATGTGAT TATTGACCCT GGTTCTGACA ATCGAGATCC CTTTATACGT GTTCGAAATG ATGGCAAAGG TATCCCTGTG CAGGTGCACC AAAAGGAAGG CATGTACGTA CCAGAAATGC TTTTTGGGCA CTTGCTTACG GGTTCCAACT TTGATGACAA TGAAAAGCGT TTGACGGGTG GTACGTCGTC GTTTTGAATT TGCTTAGTAA TTTTCACTTG CCAATGTCTG TGCTGCTTTC GTTTTCTCAC CCGCTGCTTG ATTTCAACCC AAGGACGACA TGGGTTCGGT GCAAAGTTGA CCAACATCTT CTCCAAATCG TTCACGGTCG AAACTTTGGA CTCGCGAAAA CGGCTGAAAT ATAGACAGAC TTGGTCGTCC AATATGACCA ATGCCTCCTC GCCGGAAATT GTTGAGGTGC CGGCCGATGC TACTGACTAT ACATGCGTTT CCTTCGTTCC AGATGTAGCT CGCTTGACCA ATGATGCATC AGCACTTTCA ATATCTAAGG AAGACTATGC CTTGATGTGT CGTCGAGTTT TGGATGTGGC GGGCTGCTCG GCAGGAAGGT TGAGGGTGAC ACTGAACGGA GTTGACGTTA CGATGGGATC CTTTCCCGAC TATGCTCGGC TGTATCGCAA GCAAGATTCA CTGCCTGTTT GTTTTGATGC CATCAATTCA CGATGGACTG TGGGTGTGGG CCTCTCGGAG ACTGGCTCCT TCGAAATGGT GTCGTTTGTC AACGGTATGG CGACAAGTCG AGGCGGGACG CACGTGAATT CTATACTACA ACAAGTCATA AAGAAAATTC AAGAGAAGAT CGAGAAGACT GATCCTGAAC TGGTTCAAAT GGCTTCCCAA GGCGTGATCC GAAGGCACCT ATTTGTGTCA GTAGCCGCGC TCATTGAAAA CCCGACTTTC GACTCCCAAA TGAAAGAGTG CCTAACGTCC AGTCCAGCTG ATTTCGGGAG CTCCTTCAGC CTGAGCGAAA GTTTTATCAA GTCTGTTTTA CAGAGCGAAG AGGAAGGAGG TCCAGGTATT GTAGAAGAGA TTCGTAGGGC AGCTCAGAGT AGACAGCAAG CAACGCTGTT GAAAGTGATC GGCGGGAAGA CGAGCAAGCG CCAGCTCTTG TCGATTCCAA AACTCGAGGA TGCCCACCAT GCTGGGACAA AAAGCGGATC GGAATGTACC TTGATTCTTA CCGAAGGTGA CTCTGCGAAA GCATTAGCGG TTGCCGGGCT TGAAGTCATT GGTCGAGCCC GGTATGGAGT TTTTCCTCTG CGTGGCAAAC TGCTCAACGT ACGTGAAGCA GCAGTATCCC AAATGGCAAA AAATGCTGAA GTAACGGCGC TATGTGCAAT AATCGGACTG GATTTCGACA AGACGTACGA GACAATTGAA GAACGAAGAA AATTGAGATA CGGTAAGAAG CACGGTGTTT TTATTCGTGT TCGGATGCTA AGGGTGATGC TCACACTTCA TATTCCTCCA GGGAAGGTTA TGCTGATGAC AGATCAGGAT ACAGGTAAGA TATATCTAAA GCAATCCGTG TGTAGCACTT GAAGAGGAGA AACACTAACA CTTTGAGTTT TCAGACGGCT CTCATATCAA AGGATTGGTC ATGAACTTTT TCCGATATTT TTGGCCGAAT TTGTTGAAAC CGCCCGTGGA CTTGCAAGTG AACGATGAGG ACGAAACCCC ACCGTTTCTC TCGTCATTTG TTACTCCTCT TCTCAAAGCG AGTAAGAAGT CTTCCAAAAC GGGTACTCTC TCGTTTTACT CGATGGCGGA ATACAAAAAA TGGCGAGGAT CGATTGAAGC AGAGGATTTC AAAAAGTGGA CCGTCAAATA CTACAAAGGT TTGGGAACAA GCACACCAGC TGAGGCCAGA GAGTACTTTT CTGCCTTCAA TGATCACTTC CGTCCGTTTT TGTGGAATTC AGACATCGAT GGAGAACTCC TTGACATGGT TTTTGATGGC GAGCGTGCGG CAGATCGGCG AGCTTGGATT CTAGATGTCT ATGACGAGAC TTCCAATTTG TTGAACGATC CGTCTGCGGG AAATAATGTA AGCTATGAAG ACTTCATCAA CAAAGAAATG ATTCATTTTT CAAATGCGGA CAACATTCGC AGCATACCTA GTGCCATTGA TGGGCTAAAA CCATCGCAGC GGAAGGTGCT GTACGCATGC TTTAAGAGAA AACTGAAGTC AGAGATAAAA GTCGCGCAGC TCACAGGATA TTGCGCGGAG CATACAGCAT ACCATCACGG AGAGGCGTCC TTGCAGTCTA CAAGTAAGTA ATTATGAGCG TTCGAATTGG GTTCAATCAT AATGAGTTCT GAACATATTC TTGTCGTCTC AGTAATTGGG ATGGCTCAGG ACTTTGTGGG CTCAAACAAC ATTAATCTTC TTGTGCCCTC CGGTCAATTC GGGACACGGA TTATGGGAGG TGCTGATGCA GCGTCGCCCC GTTACATTTA TACGTATTTA GCACCAATCG CACGCTCTCT ATTCCCGGAA GCCGATGATA CTTTGCTATC CTATCTAGAA GACGATGGGC AGCAGATTGA GCCGGAATTT TACTGCCCAA TTATTCCGCT TTTGGTGGTC AATGGTTGTC AAGGGATTGG TACTGGATGG AGCACATTTA TTCCCCCGCA CAGCGCCAAC GACGTGCTGG AATACATTCT TGCTAAGCTC GATGGGGCTG AGAGATTGCC GAAGATCCAT CCGTTTGCGA GAGGCTTTCA AGGAAGAATA GAGCCTGACC CAAATGGGAA TGGATATGTA TCTTTCGGCC GCTCGTCTTG TATATCCGAC AGGACCATTC TTATCGATGA GCTACCTCTG CGATGCTGGA CAAACAAATA CAAAGGAATA TTATTGAAGA TGCGCGACCG AGGAGAAGTC ACGAGTTTCG TTGAGAATCA CAACACGTCG AAGGTCTCGT TTCTAGTTAC TCTCAAGTCT GCCCAATTGG CACGAATGAC ACAGGCTGGC CTAGAAAAGA GTTTCAAGTT GAAAACGAAT TTGCAGACGA CAAACATGCA TGCTTTTAAT AAAGACGGTC AAATCTGCAA ATTCGATACT GCGGAAAGCA TTGTAGAAGC ATTCTTCCCC GTCCGCATGC AGTTGTATCA GGATCGAATC GCCCTTTTGC AGTCTTTGTT GAACTACGAG GCTTCTATAC TTCGCAACAA AGCTGGCTTC ATTAAAGCTG TAACTAGCGG AGACATCGAT CTTACGAGCG GTCGCAGATC AAAGCAGGAA ACTTCGAAAA AGTTAAAGGA ACATGGTTTC CTGGACTCAG TCGAGTTGAA TGCAATTAAA AATGAAAACG TCCTCTGGAA GAGACGTCAA TTTGCCTCTG AAAAGGGGGA GGGCGGTTCC AACTTAGAAC CAATGAACTT TGATTATCTC TTGAACATGC CGCTGTCTAG CTTAACTAGC GAAAAGATAC GAGAGCTTGG TGAACATGCC GCGACGAAAG ACAAAGAGTT GGAGGAAATG AAATCTACTA CTCCGGTGGA CCTGTGGCGG CGCGATTTAC AAAAGCTTGC TCTCCTATTG TAA
|
Protein sequence | LEDQYSRKTP LEHVLLRPSM YVGSNEKAQP IACWVPDPIP TPPTEELLMK TTSVFDEILV NAADNLQRHP DSCTRIDVII DPGSDNRDPF IRVRNDGKGI PVQVHQKEGM YVPEMLFGHL LTGSNFDDNE KRLTGGRHGF GAKLTNIFSK SFTVETLDSR KRLKYRQTWS SNMTNASSPE IVEVPADATD YTCVSFVPDV ARLTNDASAL SISKEDYALM CRRVLDVAGC SAGRLRVTLN GVDVTMGSFP DYARLYRKQD SLPVCFDAIN SRWTVGVGLS ETGSFEMVSF VNGMATSRGG THVNSILQQV IKKIQEKIEK TDPELVQMAS QGVIRRHLFV SVAALIENPT FDSQMKECLT SSPADFGSSF SLSESFIKSV LQSEEEGGPG IVEEIRRAAQ SRQQATLLKV IGGKTSKRQL LSIPKLEDAH HAGTKSGSEC TLILTEGDSA KALAVAGLEV IGRARYGVFP LRGKLLNVRE AAVSQMAKNA EVTALCAIIG LDFDKTYETI EERRKLRYGK VMLMTDQDTD GSHIKGLVMN FFRYFWPNLL KPPVDLQVND EDETPPFLSS FVTPLLKASK KSSKTGTLSF YSMAEYKKWR GSIEAEDFKK WTVKYYKGLG TSTPAEAREY FSAFNDHFRP FLWNSDIDGE LLDMVFDGER AADRRAWILD VYDETSNLLN DPSAGNNVSY EDFINKEMIH FSNADNIRSI PSAIDGLKPS QRKVLYACFK RKLKSEIKVA QLTGYCAEHT AYHHGEASLQ STIIGMAQDF VGSNNINLLV PSGQFGTRIM GGADAASPRY IYTYLAPIAR SLFPEADDTL LSYLEDDGQQ IEPEFYCPII PLLVVNGCQG IGTGWSTFIP PHSANDVLEY ILAKLDGAER LPKIHPFARG FQGRIEPDPN GNGYVSFGRS SCISDRTILI DELPLRCWTN KYKGILLKMR DRGEVTSFVE NHNTSKVSFL VTLKSAQLAR MTQAGLEKSF KLKTNLQTTN MHAFNKDGQI CKFDTAESIV EAFFPVRMQL YQDRIALLQS LLNYEASILR NKAGFIKAVT SGDIDLTSGR RSKQETSKKL KEHGFLDSVE LNAIKNENVL WKRRQFASEK GEGGSNLEPM NFDYLLNMPL SSLTSEKIRE LGEHAATKDK ELEEMKSTTP VDLWRRDLQK LALLL
|
| |