Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49942 |
Symbol | |
ID | 7198543 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 366871 |
End bp | 368549 |
Gene Length | 1679 bp |
Protein Length | 446 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184790 |
Protein GI | 219129215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATTCGGGT CGGACAAGGG GAGAAGATGA CCCCCGTTGG CTTGGGTATT TTGTACATAT ATACACACAC TATACACACA CACACACGTG TATACACACT ACACACACGC CATGCGTGAG GATCCTTTAG AGTCGAGCAC ACGGTCGGTG GTTTCGGTAC GTTCGCGGGA ACGACGGAAA CCGGGGTTGA CCAACGCTCG GAATCGGACG TCCCGTTGCA TGACGGGTCT CCTCGTCGGA CTCGCCGGAC TAATGCTTCT TTCTCTGACA CAAATGGTAC GCTTACACCG GGAACCGTTG GTAGGCATGG GGGGCCTTTC CTCTCTGCCT TGGAGCAACG GTGAATGGAA GCTAGGGCAA CAACAACAAC AACAACAACA ACAACAACAA CGCCATCGAA TCGGCGACGA CAATCACACG GCACACAACA ACGACCAAAC GGCACTGCGC CTCCCGCGTC TGGCCTCTCC CAACACGACC ACGAATCACA GCGATACGAT GATCGTTCTG GATCAGCAGC TATCCTTTCC GGAAGACAAG CGTGTCATTC TGGAGGTACT CCGCGCGGCT GGCGTCACCT CGATCCAGGC CCCGTCCTGG GAACTCCTCC CCGATACCCG CACACTCACC ACACTCTACG GTCCCATTGA TCAACCCATT GTCCTAGGTC TTGAAACATG TCAAGCCTTC CGCGACGCCA TACCCCAGGC GGAACGGTAC GTCGCACCCG CCGGAATGTT CAACACCGGA ACCAACGCCT TGGAACGACA CTTGACCAAC AACGTTTTCG GTGTACAAAA AGCCTGGCAA GTTCCGTGGG GCAAACACCG GACCGAAAGC AAACGATTGC ATCACGCCGC CGTGTCCCTG GAAGGAATCA ATCAAACGGC AGTCTTGCCT ATTGTCGTTA TACGGGATCC ATTCTTTTGG ATGCAAAGGT GCGTACAAGT GGTCACGGGG AATCGGCCCG GGTGCGAGGC GACTACCTGT GCACCGCCGA CATGCAAAGC GCGTTCCTCA CACACAGCCA TTATGTTCAA ACATTCGGAA TGGACATACT ACTTGACAAA TTAGCATGGT ACGTGGTTAA TGTCGTCTTG GCCATGGTCT GCTCCATTCC CATACCATAC CGCCCACACG AATCCTTCTG GTTTACTCTC CCGGTACAGT GCAAACATCC CTACGCCGCC AAATGGGCCC GCGGCGCGCA ACGCTGTCCG GGTTTAAAAA CGCTGCCTCG AGATTTCAGA CGCTTTAGCG CCGATAAGCT GCCCCGCAAT CAAACCAGCT TTCGCGTCAA GGTCATTTTT AGCGCCACCG ATGTACAATT TTGGGATTCG CTCGTACATT TATGGAGTCG ATGGTACCGC GATTACTGGG AGGCACCCTA CCCACGTTTG ATGATCCGTT TCGAAGACTT GCTGCTGCAT TCCGACGACA TTGTCCAAAG CATTGCGGAA TGCGTGGGCG GTACCGCCAA CCGCAACCAC GTGGTGGAAA CGGGGACGAG CAAGAACCAC GGTAGCGGCG CGGACTTTGT CAAGGCCGTG ATCAAAACGG GAGACTTGGG AATGCGGCTC AAACACTTGA CCCAACCCGA TCTTCACTAC GCGACCGAAC ACTTGGATGC CGAACTGATG CAAGCATTTC GATACAATCT TTCCACAGTC AATAGATAG
|
Protein sequence | MREDPLESST RSVVSVRSRE RRKPGLTNAR NRTSRCMTGL LVGLAGLMLL SLTQMVRLHR EPLVGMGGLS SLPWSNGEWK LGQQQQQQQQ QQQRHRIGDD NHTAHNNDQT ALRLPRLASP NTTTNHSDTM IVLDQQLSFP EDKRVILEVL RAAGVTSIQA PSWELLPDTR TLTTLYGPID QPIVLGLETC QAFRDAIPQA ERYVAPAGMF NTGTNALERH LTNNVFGVQK AWQVPWGKHR TESKRLHHAA VSLEGINQTA VLPIVVIRDP FFWMQSMCKH PYAAKWARGA QRCPGLKTLP RDFRRFSADK LPRNQTSFRV KVIFSATDVQ FWDSLVHLWS RWYRDYWEAP YPRLMIRFED LLLHSDDIVQ SIAECVGGTA NRNHVVETGT SKNHGSGADF VKAVIKTGDL GMRLKHLTQP DLHYATEHLD AELMQAFRYN LSTVNR
|
| |