Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32453 |
Symbol | |
ID | 7196983 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2507457 |
End bp | 2510862 |
Gene Length | 3406 bp |
Protein Length | 1120 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176992 |
Protein GI | 219110481 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCCG CCACCCGGCA AATGACGAGT GCAGCCGTCT ATGCCCACCT TTTGGACAAC GTACTTCTTC CCCAAGGGCA TCCTATCCGT CTCAGTTTTG AGCAGCAAGG ATATGAATCG GCTGATGATC TCCTGTGTAT TTTTGAGAAT GAACTTGAGT CTCTTGGATA CACTCCTTCT GTCCTTCCCG ACGGCCCGGA AAACCCGCCT ACCATTCCCC TTCTCATGGC GCACCGACAG ATCATACGTC ATTTCTTGCG CTGGCAGGCA TCTTTGGAAC AACAAAAGGG GACACCTTTG AAGAACTCCG AGCTTGTTGC ACTTAACAAT GAAGATTTTG TCATTTACCG TCGCTCAGCC CTTGGTCAAG TCTCGACAGC AACTGCACCG GTTAATGCTT CCCCAACTGT TCAGAGCCCC ATAGGAAAGA CACATTTGGC TGTTGAGGAC TTCAAGCGTG GGATCAAACG TGACAAAACT CACTATCCCG TGCTTAAAGA TGATCGGTAC TGGGACAACT TTTATCGGTC GTTTGTTGTT ACTGCCGTAA CACATAACGT TGACAAAGTT CTAGATCCGA CGTACATCCC TACCAATCCT TTGGAGAAAT CCCTTTTTGA AGAACAGAAC AAGTTTGTAT ATTCTGCTCT AGAGCATACT CTCCAGACGG ACATGGGCAA GAACATTGTA CGCGAGCATA GTTTCGACTT CAATGCCCAG GAAGTTTTCC GTAAGGTTGT GAAACACTAC ACAGAGTCCG CTAGCGCGAA GATTAGTTCG TCTACTACCC TGGGATACCT TACAACTGCA AAGTACGGAT CGTCATGGAC TGGCACAGCA GAAGGTTTTA TTCTTCACTG GAAAAATCAC TTGCGCATCT ACAACGACAC TGTTCCTGCT GGTGAACAGC TCCCTCAGCA ACTATGCCTT AGTCTTTTGG AGAATGCTGT TCATGATGTA CCTGAGCTTC GACAGGTAAA AATCACTGCA ACTCTTGACT TAGCAAAGGG AGGTAATCCT ATTAGCTATG ATGGTTATCT CAGTCTACTA CTCGCATCGG CATCACTCTA CGACAACGGC AATAATCTAT CTAATTCTCG TAGTGGCAAG AACAAGCGCA ACATCTATGC TAATGAACTA GAGTACAATC CGATGGATTT TGAGAGTAAA CCGGATGTAG ACTATGATAT AGATGTGTCA CCGACCGCAA TCTACAAAGC CAATGCTCAT GCCCGTAACA GCAGTTCCCG GAGTCGTACT CCGGCAGCTA ATCGCGAGCG ACCTTACATC CCTCGTGAAA TGTGGAACCT ACTCTTCGAC GATGCCAAAG CCATCCTCCA AGGCTTAAAA GCCCCCGGGA AGCAGGCCCC ATTGAATAAT AGTTCGCCAC ACCAATCGTT GCAGACCAAT ACGCACGATA CCATTGGCGC GGAACAAATC ACAACGGACA CCTTCCATGA TTGCGCACCC GAAACTGAAT TGCTTGCCCA CCTGACTGAG CGTGTTAGTC GCATGAGCGA CGGCGACATA CGTAACGTTC TTGCCGCATC TCGTGATGGT CCCCCCTATG ATGAGCCCAA ACCACTGCAA TCTAACGTAC TTCAATATCA AGTGTCTCGT CACAACGTCA TTGAAACTAC GGCAGCCCTC GTCGACCGTG GAGCCAATGG AGGTCTTGCC GGCAGTGATG TCATGGTCTT GCACAAAACA GGTCGTTCTG CAACCATCAC AGGCATCAAC GATCATACCT TGTCCGATTT GGACATTGTC ACCGCTGCTG GCTACACTGA ATCCCAAAAT GGCCCCATCA TTCTCATTAT GAACCAATAC GCCCATTTGG GACAGGGTAA AACTATCCAC TCCAGTGCAC AGCTTGAACA CTATCGCAAC CATGTCGAAG ACCGTTCCCG TACCGTAGGA GTTAACCAGC GAATTGTAAC ATTGGACGAC TACATCATCC CATTGCACAT TCGACAAGGA CTCGCGTATA TGGATATGCG GCGCCCTACC GACAAGGAAC TTGCGTCCCT TCCACACGTT GTCCTAACCT CCGACGTCGA CTGGGATCCC TCCGTACTTG ACCACGAAAT TGATCTCGCG ACCTCTTGGT ATGATGACAT ATACGATTTG CCTCAATCAC CTTACGTCGA ACCATGTTTT GACCATACAG GCAAATACCT CCATCGTCAC ATTTCCTTTT GCAACCATCG CGATGACGCC GTTGACCGTG TCTTATATTG CCAACAGCAC CTCGTCACGA AAAATGTGCA AGATTATGAG GCCCTTCGTC CGTGTTTTGG ATGGGTCTCT GCTGAAACCG TTCGCAAGAC CATCATGGCG ACCACGCAGC ATGCACGCGA AGTATATAAC GCTCCGTTAC GCAAACATTT TAAGTCTCGC TTTCCCGCTC TAAATGTACA CCGTCGTAAT GAACCAGTTG CTACCGATAC CATTTGGTCC GACACCCCTG CTGTCGATAA TGGTGCTAAA TTTGCACAAC TTTTCGTTGG TCGACGGTCC CTTGTCACCG ACGCTTACCC CATGAAAACT GATAAAGAGT TTGTCAATAC CCTTGAGGAC CATATCCGTT ACCGGGGTGC CATGGACAAA TTGATTAGCA ATCGTGCCCA GGTTGAAATC AGCAAAAAGG TCACCGATAT TACACGCGCA TATAATATCG ACCAGTGGCA AAGTGAACCA AACCATCAAC ACCAAAACTT TGCCGAACGT CGTATCGCCA CTATCGAGGC TAATACCAAC AACATTCTCA ATCTTTCCGG TGCCCCTGAT TCCGCCTGGT TACTTTGCGT GACATATGTT TGTTATGTTT TCAACCATTT GGCACATGAT TCACTAGATA ACCGCACTCC CCTTGAAGTC CCCACCGGCT CCACGCCTGA TATCAGTGTT CTCCTTCAGT TTCATTTTTG GGAACCGGTC TATTATAAGC TCGAAAATGC GACATTTCCT TCTGGTGGTA CTGAACAACA AGGACGTTTT GTTGGCATCG CCGACTCCGT CGGCGACGCT CTCACTTATA AGATCCTTAC CCACACCACC AATCGCATTC TTCATTGCTC TAGTGTCCGT TCTGCGACCA TTCCCGGACA AACCAACCTA CGCCTTACGC CACAGGATGG GGAGAGTGGT CCTAAACCCA TCAACTTTAT CAAGTCGCGT AGAACCGAAA ACAAAAATTC CTATGCCATT AAGGAGTTGC CTGGTTTCAC ACCTGATGAC CTTATAGGTT GTACGTTCCT CACCGACACT CGGGATGATG GGGAGCGTTT GAAGGCACGA ATCACGCGGA AAATATTGGA CCCAGACAAG CCCTCGGATG TAAAGGTCCT TGTCGAAATC AATGATGGTG AATATGACGA GATTCTAGCA TACAACGAAA TTCTAG
|
Protein sequence | MVPATRQMTS AAVYAHLLDN VLLPQGHPIR LSFEQQGYES ADDLLCIFEN ELESLGYTPS VLPDGPENPP TIPLLMAHRQ IIRHFLRWQA SLEQQKGTPL KNSELVALNN EDFVIYRRSA LGQVSTATAP VNASPTVQSP IGKTHLAVED FKRGIKRDKT HYPVLKDDRY WDNFYRSFVV TAVTHNVDKV LDPTYIPTNP LEKSLFEEQN KFVYSALEHT LQTDMGKNIV REHSFDFNAQ EVFRKVVKHY TESASAKISS STTLGYLTTA KYGSSWTGTA EGFILHWKNH LRIYNDTVPA GEQLPQQLCL SLLENAVHDV PELRQVKITA TLDLAKGGNP ISYDGYLSLL LASASLYDNG NNLSNSRSGK NKRNIYANEL EYNPMDFESK PDVDYDIDVS PTAIYKANAH ARNSSSRSRT PAANRERPYI PREMWNLLFD DAKAILQGLK APGKQAPLNN SSPHQSLQTN THDTIGAEQI TTDTFHDCAP ETELLAHLTE RVSRMSDGDI RNVLAASRDG PPYDEPKPLQ SNVLQYQVSR HNVIETTAAL VDRGANGGLA GSDVMVLHKT GRSATITGIN DHTLSDLDIV TAAGYTESQN GPIILIMNQY AHLGQGKTIH SSAQLEHYRN HVEDRSRTVG VNQRIVTLDD YIIPLHIRQG LAYMDMRRPT DKELASLPHV VLTSDVDWDP SVLDHEIDLA TSWYDDIYDL PQSPYVEPCF DHTGKYLHRH ISFCNHRDDA VDRVLYCQQH LVTKNVQDYE ALRPCFGWVS AETVRKTIMA TTQHAREVYN APLRKHFKSR FPALNVHRRN EPVATDTIWS DTPAVDNGAK FAQLFVGRRS LVTDAYPMKT DKEFVNTLED HIRYRGAMDK LISNRAQVEI SKKVTDITRA YNIDQWQSEP NHQHQNFAER RIATIEANTN NILNLSGAPD SAWLLCVTYV CYVFNHLAHD SLDNRTPLEV PTGSTPDISV LLQFHFWEPV YYKLENATFP SGGTEQQGRF VGIADSVGDA LTYKILTHTT NRILHCSSVR SATIPGQTNL RLTPQDGESG PKPINFIKSR RTENKNSYAI KELPGFTPDD LIGCTFLTDT RDDGERLKAR ITRKILDPDK PSDVKHTTKF
|
| |