Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50796 |
Symbol | |
ID | 7197659 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 461132 |
End bp | 463168 |
Gene Length | 2037 bp |
Protein Length | 457 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178515 |
Protein GI | 219115439 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTAATCACGA ACTTGCCCGT TTGTCGCTTT ACCGTCTGGA GTAATTTCGA CGTTCTTGGA AATTCCGAAC TCCTCCTTCC GCTCTATCGT TTCGTGGTTG CGAACGTCGC GAGGGCCAAG CAACCCACAG AACAGCACGG GGAACATTAG GTATATCACT GTCAGTCCAC CTTTCCGGTC TTGTCCTGCC TTTTGCCCGT GTGCATCCTC CGCTCGTGTT TGGTGTGAGG GCTTCCGGTT CTATCTCCAA CGGGTCGGCT TTCGTTTCGA CGTCCATCCG AAAGAGTATA ACAGTGCCGA GCGCAAACCG CATCCTTCGC CCAAAGCGTT TTGATTGATT GTGATAGCCA CTTGCGCGAG ACAAAAACAG ACCGCATTTC TCGTCGATCG GTGTGTTTCC GTTTGCAAGC CTTGTTTATA GGGTAGCGTC CACTTTCAGC AATCCAAAAC CCCGAGTACA GAAGGCCGTA GCCGTAGCGT CCCCTTCCGT AGGACTTTGA TCCAGCAGCA CAGAACTACC TCACTATTCA CCATGGGCGA CAATAAAGAT GAACTCGACC AGCAGATTGA AATGTTCAAG GTCAAAAAGC TCATGAAGAA TTTAGAAGCC GCACGCGGAA ACGGAACGTC CATGATTTCC TTGATTATTC CCCCGGGAGA CCAGATTTCC CGGGTGAACA AGATGCTTTC CGACGAGTAC GGGACGGCCT CCAACATCAA GTCGCGAGTC AATCGGTTGT CGGTACTCTC CGCCATTACC AGCACACAGC AGCGTCTGAA GCTTTACAAC AAGTGTCCCA AAAATGGTGC GTTGGTGCGG TGGAGGACAG CGGAACGCGG CGTGCGGAAA GCAAAAGTGT GCTCACATCC GTGTCTTCTT TTCTGATTTG GAACAGGACT CGTAATTTAC TGTGGAACAG TGATTACTGA GGATGGGAAA GAACGTAAGG TCAATATAGA CTTTGAACCT TTTAAACCCA TCAACACCTC ACTTTACCTC TGTGACAACA AGTTTCACAC GGAGGACCTT CAGGAGTTGT TGATGGACGA CGAAGCATTT GGCTTTCTTG TAATGGACGG TAACGGCTGT CTTTACGGAA CCGTACAGGG CTCGAATCGT GAAATTCTAC ACAAGTTTAG TGTAGATTTG CCCAAAAAGC ACGGACGCGG TGGACAGTCC GCCTTGCGTT TTGCTCGTTT GCGTCTGGAA AAGCGACACA ACTACGTCCG CAAAGTGGCG GAACTAGCCA CACAACTTTT TGTAACGGAT GGACAGAGAC CTAACGTACA AGGTCTCGTT TTAGCGGGTT CGGCTGATTT CAAGAGTGAG CTGATGCGCA GTGACTTGTT CGATCAGCGG CTATCCAAGA TCGTAGTCAA GATGGTGGAT GTTTCGTACG GCGGTGAGCA AGGCTTTAAC CAAGCTATTG AAATGTCCGC CGACGCGCTG GCCAACGTCA AGCTAATGAA GGAAAAGAAA CTTTTACAAA AGTATATGGA CGAAATCAGC CAGGATACCG GAAAGTTTTG TTTCATGGTA GAGGACACAC TCAAGGCCTT GGATCTTGGT GCTGTGGAAG ACCTGATTAT TTGGGACAAC TTAGAGGTGA TGCGATACGT TCTGCGGAAT CAGACAACGG GCGAAGAGAA GGTGATCCAT TTGACCCAAG AGCAAGAGAG TGACGATTCC CACTTTCGAG ATTCCGAAAC AGGAACGGAA TTGGAGACGG TGGAAAAGGA AACGTTTGTG GAGTGGATGG CGAACAACTA TAAGAGCTTT GGTTGTAATC TGGAATTCGT GACTGACCGT TCCGGCGAAG GAACTCAATT TGTGAAAGGC TTTGGAGGGA TCGGTGGGAT TCTGCGATGG AAGGTTGATT TCGTTGAGCT GAACAACTTT GAAGAGGCTA ACCAGCTCGA CGCGGAATCC AACGAAGACG ATGCTTCCGA CCAGGAATCT GAATATGGCT TTGACGACGG TGACTTTGGA TTCTAAGCTT TTTCGCGCGT ATTTAAGTTA ACTTCGTCTT TGTAGCA
|
Protein sequence | MGDNKDELDQ QIEMFKVKKL MKNLEAARGN GTSMISLIIP PGDQISRVNK MLSDEYGTAS NIKSRVNRLS VLSAITSTQQ RLKLYNKCPK NGLVIYCGTV ITEDGKERKV NIDFEPFKPI NTSLYLCDNK FHTEDLQELL MDDEAFGFLV MDGNGCLYGT VQGSNREILH KFSVDLPKKH GRGGQSALRF ARLRLEKRHN YVRKVAELAT QLFVTDGQRP NVQGLVLAGS ADFKSELMRS DLFDQRLSKI VVKMVDVSYG GEQGFNQAIE MSADALANVK LMKEKKLLQK YMDEISQDTG KFCFMVEDTL KALDLGAVED LIIWDNLEVM RYVLRNQTTG EEKVIHLTQE QESDDSHFRD SETGTELETV EKETFVEWMA NNYKSFGCNL EFVTDRSGEG TQFVKGFGGI GGILRWKVDF VELNNFEEAN QLDAESNEDD ASDQESEYGF DDGDFGF
|
| |