Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34147 |
Symbol | |
ID | 7197909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 1211776 |
End bp | 1213551 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178662 |
Protein GI | 219115733 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTTC CTCCGCTGTC ATCTTTGTGG ACCACCAACA CTCTCTTTGG AGAAACGCAA CAGCTCAGTG CTCGTCGGCA GTATCAACCA CAGCATCAAG ATCGAGACGA CAATCACGAT GGTCTTGATT TCTCACGAAA TGAACGTCTT CGGGACGATC CGGATGTCCA CATTGGCAAG CGACACTCCT TGAATAAAAT TAATCAATTT CTCTTCGGCT TCATGTACGA AGACGAGAAC GAGGAAGACG ACGATGATGT TTTGTACCGA TCTACGGTAG AACCCTCGAC ACCACTATGG AACCGACTCA ACCTGTCATT GTTCGGTATA CTCTGGTTGC TTTCTACGGC TGAAATTATT CCCGTTACGT TGGTGGTTTC CATGTGCGAT AATTTTGTGG AGGTATCGGA TGGAACGTGG ACCAGCGACG CTACGACGAT TCCGTCAACG ACTGCCACTT CCGCTGCAGC ACGCTTGGCA GCGGCTGCCG TTTTGGGCAC GTCCATGGGA AAACTGATCA ATGGACCCTT GACGGACGTG TGGGGAGCCC GCCGATCTTT ATGTGTTGCT TCCGGGGCCA TGGCGGTGGG TCTCGTCCTG CTGGCTACGG CAATGACGCT CGACACTGCC GCGATGGCTT GTTTTCTGGT GGAATACGCT TACGCCATAA CGTGGCCGTG CTGTGTGGTG ATTTTGGCGA CGCACTATCG CGGAAACGCA TCCGGTATGT ACGAGGGAGG CATCTACGTC ACATCACTCG CATCCCGTTT GGGAGCCCTT ACTGGTATTC CCCTCGCATC CGTGTTGTTG CGGCATACAA CAGAAGCAAC AACAACAACA ACAACAATGG TTGGATCCTC TTGGCGTTTG GTGGCTCTCT TGGCTGCATG GATTGCACTG GTTGCCTCGT CGGTGGTATA CTTTTATGTT GCGGATACAC CATCCCGAAA ACACCAACCA CAGAATCCGC TAGACCCAGT GTGGCTCAAC AAGTGGTTTC CCTCTTACGC CAACGGAAGT CGTCCTTTGA CGCTTTCGAC CACCTTACGA CTGATACCCT TTGTTGTACG CTACAACTTG GTTCCTTCCA TGAAGCATGT ACTGCGGAGT GGTACATTCT GGATGGTGGC CTTGGCGCAC ACGGGATCGA GTATGGTGCG CACGTCGGAA CGATTGATCG GCACATACTT GTCCGATACG AGTAACGATA GCCTCTCACA CAACCGTGCC GGCGGCCTTG CCGTCTTTTT GTCGCTGGGG ACCGTCATAG GTTTGCTCGT GGCTGGGAAT CTCTTTGCTA CTTTGCACGA ACGCGCGCGC AAACGACTCG TCACTCGACT GTATATTCTC ACAATTAGTG CCTGCTACGT ACTAGCCTTG TTGGCCATTC CGGCCGTGCG GAATACGTGG CAAGCACCGG AGCTGGTCAC GACCTTTCAA GTCATGGCCG TGGCTGTGGC CGGTTTCGGA ATTGCCGTAC AATTTTATCA TATTCCCGGT TTGGTCGGTG CGACCTTTGG CTGTGACAAA GGCTTGTTCG CGGCCTACGT CGACGGGGTT GCCTATTTGT TTGCTTCGTA CGTATGGCGC CTAGTGATTG GCAAGGCTGT CCAGGATCCC GTCGATGATG ACGGACTCGG GTGGGCCTAC GGATGGGCGG CGGTAGCCTT GTTGCTCATC CTTTCGGCGG TGCTCATGGT AGAGTTCATG GAACACTACT TTTGTCGATC ACTTCATCAA CAGGGCGGGA CTTACGAAAC AATTATCTTT GCGTAG
|
Protein sequence | MTLPPLSSLW TTNTLFGETQ QLSARRQYQP QHQDRDDNHD GLDFSRNERL RDDPDVHIGK RHSLNKINQF LFGFMYEDEN EEDDDDVLYR STVEPSTPLW NRLNLSLFGI LWLLSTAEII PVTLVVSMCD NFVEVSDGTW TSDATTIPST TATSAAARLA AAAVLGTSMG KLINGPLTDV WGARRSLCVA SGAMAVGLVL LATAMTLDTA AMACFLVEYA YAITWPCCVV ILATHYRGNA SGMYEGGIYV TSLASRLGAL TGIPLASVLL RHTTEATTTT TTMVGSSWRL VALLAAWIAL VASSVVYFYV ADTPSRKHQP QNPLDPVWLN KWFPSYANGS RPLTLSTTLR LIPFVVRYNL VPSMKHVLRS GTFWMVALAH TGSSMVRTSE RLIGTYLSDT SNDSLSHNRA GGLAVFLSLG TVIGLLVAGN LFATLHERAR KRLVTRLYIL TISACYVLAL LAIPAVRNTW QAPELVTTFQ VMAVAVAGFG IAVQFYHIPG LVGATFGCDK GLFAAYVDGV AYLFASYVWR LVIGKAVQDP VDDDGLGWAY GWAAVALLLI LSAVLMVEFM EHYFCRSLHQ QGGTYETIIF A
|
| |