Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_981 |
Symbol | |
ID | 7199047 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 210152 |
End bp | 211938 |
Gene Length | 1787 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185150 |
Protein GI | 219129972 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0232666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCACTCGGCA CGGTGCAAGC CCAGGACGTT GGGAAGATTG TCCAAGTTTC GGGGACCGTC GTCCGCGCGA GTCCCGTACA AATGTACGAA TCGGCGAGGA CTTTCCAGTG TCGGGGATCG CAGGGTTGCC AGCGTATCGT CCGGGTCCAC GCCGATCTGG AACAACGACA CAACGCTTTG GTGACCCCGA CACGCTGTCC GCTACTCGGG AACGACAACG GCGTCCGCTG TCGGGGAACC AACCTACAGG TCGTCGACGG CGGCTCCGTT CATACCGATT ATCAAGAAGT CAAAATACAG GAAGCTGCAG CGCGACTCGG GGTCGGTCAC ATTCCGCGCA GTTTGTTGAT TAAACTGCAG CACGATTTGG TCGATCAGGT CCAGCCGGGT GACGAGGTCA TCGTCGTCGG TAGTCTACTC GCACAGTGGC ACCAACCTAA CGTACAGCCC GACGTGGAAT GTCACGTCGG AATCGCCATG ACGGCACATT CAATCCGGGT GGTGGCCGAA AAAAACTCGT CGGCGTGGAA GAATGCAGGA ACTGGGGGCG GCCACGGCGT CGGCGAACTG GACAAATTCC GCAAAGAGTT TGACACCTAC TGGAGTGAAC CGTCCCACCA GAAACAACCC GTTGCGGCCC GTGACTTTAT TTGCAAGGCC GTTTGTCCGA AACTGTATGG TCTGCAAGTG ATCAAGTTGG CCCTCCTACT GACCTTGACC GGGGGTGTTT CGTCCGACGC GTACCCTACC AGTTCTGCCG ACTCGTCGAC GGAGTCCGCG TCGACATCGA AACCCGCCCA AACGGACGCC CCTGAACCCT TTCAAACAAT CACCAATGAG GCCAGCCATC CATCCTCGCG ATCCGCCACT TATTTTGGCG GTGACACCGC CGCGGCATCC AAACCGCCGA AAAAGCAGGA GCAAGCAGTC CACACGAGAC GACGGGATCA ATCGCATTTG CTGCTGGTCG GGGATCCCGG TAAGTTCCGT CGTTAAAAGT TGAATTCTCT TTTGTGTGCC GCTTTATACT CACCCTGTCC TCTGACGGTA TGCTGGATCC ACACGTTACC GTCAAAGGGA CGGGCAAATC TCAGTTCCTC CGCTTTGCCG CTGCCCTGTG TCCCCGGTCG GTGCTAACGA CGGGTGTAGG GACCACATCG GCGGGTCTCA CATGCGCTGC CGTACGCGAA GGGAGTGGGA AGGAATTCTC CTTGGAAGCG GGAGCTCTGG TCTTGGCCGA CAAGGGCGTG TGCTGTATCG ACGAATTCGG TTGCATTCAG GAAAAAGATC GGACCACCAT CCACGAGGCC ATGGAACAGC AAACGCTATC CGTCGCCAAG GCCGGCATCG TGTGCAAGCT CAATTGCCGG GCGACCATTA TTGCCGTCAT GAATCCCCGC GACTGCCTAT ACGACAACCA CGCCAGTCTC TCCTACAACA CCGGTCTCGG TACGCCGCTC CTTTCCCGTT TCGATCTTAT TTTCAAGCTG GTCGACACGT CCGACGCCGA GCGCGACAGC AAGGTGACAA CGTATTTGTT GAATCGGGCC ATCCAAGGGG CAGGTTTCGA CGTTGCGGAT GCGGGCGACG ACGCCAATCT GGAAGAACCT TGGACGATGG AAAAGTTACG GGCCTATATT GCGGTCGTGA AAGAACGTTT CCTGCCGGTT ATTAGCGACG AGGCGGCCAC TTTACTGGAA CGGCACTACG AAAAGTGTCG GTCATCGCAA AGTAACACCA TTCCCGTGAC GGTTCGTTTT CTCGAGTCTC TCATTCGTTT GTCGCAA
|
Protein sequence | SLGTVQAQDV GKIVQVSGTV VRASPVQMYE SARTFQCRGS QGCQRIVRVH ADLEQRHNAL VTPTRCPLLG NDNGVRCRGT NLQVVDGGSV HTDYQEVKIQ EAAARLGVGH IPRSLLIKLQ HDLVDQVQPG DEVIVVGSLL AQWHQPNVQP DVECHVGIAM TAHSIRVVAE KNSSAWKNAG TGGGHGVGEL DKFRKEFDTY WSEPSHQKQP VAARDFICKA VCPKLYGLQV IKLALLLTLT GGEQAVHTRR RDQSHLLLVG DPGTGKSQFL RFAAALCPRS VLTTGVGTTS AGLTCAAVRE GSGKEFSLEA GALVLADKGV CCIDEFGCIQ EKDRTTIHEA MEQQTLSVAK AGIVCKLNCR ATIIAVMNPR DCLYDNHASL SYNTGLGTPL LSRFDVADAG DDANLEEPWT MEKLRAYIAV VKERFLPVIS DEAATLLERH YEKCRSSQSN TIPVTVRFLE SLIRLSQ
|
| |