Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32173 |
Symbol | |
ID | 7196238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1878654 |
End bp | 1880381 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177392 |
Protein GI | 219111281 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.623051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGATT GTGGAGCTAC AGGGTCGGCT TTGGATCGAG TTGCACGATC GCTCTTGGGC GGTTCCCAAG CTTCCGACGG CAATATTTCA AAGCAAGTGG CTTCGCTAAT GGGACACGCT GCCGCAACGG GAAACATTGC GTTGTTGCAG ACGCCCATGT CACAGTCAGC ACCGCGACAA CTAACAATGC CTCAAGCATC TGTGGCTACG ACATTCACGG TGAAACAGCC TGCGCAAGCT TCGGCGCAGC AGCAACAACC GATAGCAATT GGGACTCCTT CGGAGCTGGT TTCTCAACAA CCGTCATTCT ACGCCACTAC GCAATATTCA CAGCATCAAA ATGATGCTAT GATTCTTCAA CATAATTTTC ATCAGCATCC AATTATGGCC ATGCAACAAC AGCAGCAGCA AGTTATGATG CAAATGGCCC ATCAACAGCA ACAGATGGCA CATATGGTAA AACAGCAACA GCTTATGATT CGTGCTCAAA ATCAGGTATC CGAACACCAT CATCATCGAA TAAGAGAGAG TAATGAACAA AAATCAGTCT CCTTGGACAA TTGGCAAAAT GGCGTTGACA AAGAATTTGG GCAGCAAGTG TCCGATATGG CTCCAGTAGG AATGCACGAG GGGGTGACAC AAGGCGTCTC GATGGAAGAA TTGGCGGCGG CATGGGCCGA GGTTGCTGAC GACAACATTA CGGTAGGACA CGATGGACTT GCACAAGGTG CTACAATAGA GGAACTGGCA GCTGCTTGGG CCCAAGCCGA GGCCGAATAC GACTCCGTTG ATGCAGCTAC CAATCTTTGG AACGACACCA ATGATCCCGT ATACGAGTTT CTCAACACGG AGAAACCGGA ACGCGTGGAT CAGCAGGACT GGATGGAGCA GGGATTGCGA GAATTCAATG CAGGAAATCT GAAAGAAGCT GTGAAAGCAT TTGAAATTGA ATTGCAATAC TGCAATGGAG ATAATTCAGC GGCGTGGAAA ATGCTGGGTC GCTGTCACGC CGAGAACGAT ATGGATAGAG AAGCCATTGT GTGCTTGGAA CAAGCTGTGG ATCGCGATCC GTACTCTCCC GAAGCGCTAC TATTGCTGGG AGTGAGCTAC GTCAACGAAC TTAATCATGC CAAAGCACTA AAGAATCTGA AAGCCTGGAT TACGCACAAT CCCAAATTTG CAGGTATGGA GCTGCAGGTA GATATGTACC GGGATTCATT GGTTGACCAA GAATCAGCAT TTGACGAAGT ACAACGACTG TTGGTACAAG CACTAGAGTA CGATCCGGTT GATGCATCAG ACGTATTGGA AGCCATGGGA GTTGTGTACA ACGTAAGTCG AGACTACGTA GCTGCTGGGG GTGCATTTCG CAGGGCCCTA GACGCTCGAC CAGACGATTA TCAGTTATGG AACAAACTTG GCGCAACGTT GGCAAATGGA AATCAAAGTC AAGAGGCTTT GCCGGCTTAC CATAAAGCAT TACAACTGAA ACCAAAGTAC GCAAGAGCGT GGTTAAACAT GGCTATCTCC CATTCCAATC TTCAAAACTA CGATGAGGCA GCTCGGTGTT ACCTTCAAAC GTTGAGCCTC AATCCTGCAG CGATTCATTG CTGGAGTTAT CTCCGTATTG CGTTGTCCTG CTCGGAGCGC TGGGATTTGA TTCAGCATGC GGCATCCCAA AATCTTGAGG CGTTCAAAGA TCTTTTTGAT TTCGTCATTT ATTCTTGA
|
Protein sequence | MADCGATGSA LDRVARSLLG GSQASDGNIS KQVASLMGHA AATGNIALLQ TPMSQSAPRQ LTMPQASVAT TFTVKQPAQA SAQQQQPIAI GTPSELVSQQ PSFYATTQYS QHQNDAMILQ HNFHQHPIMA MQQQQQQVMM QMAHQQQQMA HMVKQQQLMI RAQNQVSEHH HHRIRESNEQ KSVSLDNWQN GVDKEFGQQV SDMAPVGMHE GVTQGVSMEE LAAAWAEVAD DNITVGHDGL AQGATIEELA AAWAQAEAEY DSVDAATNLW NDTNDPVYEF LNTEKPERVD QQDWMEQGLR EFNAGNLKEA VKAFEIELQY CNGDNSAAWK MLGRCHAEND MDREAIVCLE QAVDRDPYSP EALLLLGVSY VNELNHAKAL KNLKAWITHN PKFAGMELQV DMYRDSLVDQ ESAFDEVQRL LVQALEYDPV DASDVLEAMG VVYNVSRDYV AAGGAFRRAL DARPDDYQLW NKLGATLANG NQSQEALPAY HKALQLKPKY ARAWLNMAIS HSNLQNYDEA ARCYLQTLSL NPAAIHCWSY LRIALSCSER WDLIQHAASQ NLEAFKDLFD FVIYS
|
| |