Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49047 |
Symbol | TDH |
ID | 7195421 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 426512 |
End bp | 428573 |
Gene Length | 2062 bp |
Protein Length | 606 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | l-threonine ammonia-lyase |
Protein accession | XP_002183744 |
Protein GI | 219127023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.866709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCCTTTTCG TCCCAACAGT GCATTACAAC TATTTTGGTA TCGCAAACCT TTGCTCTGGT TTCGCAATCG ACGAAGAATT CTAGTGCGTC GCGTCCCATA GACTCTTATA ACGTATCCTC CTTTCACGCT CCCCCCGTAT GATATCCTCT AGTATTCGTC TCGTGACTCG CCAGGGAGGA GCAGCCCGTG GAGCCGTCAC TGGACCTCCC CGGCTCTTGG CGTGTCGCAT GATTCCCATG GCATCCCAGT CGACGACAGC CGAACAAGTC GAAGACGAAC CGGCGATAAA GTACGCCTTG AATCCCCAAG GCGAGGTCGT CAAGGATGCG GAACTGATTG ACTTTCCAGG AAGCAGCCCG GGCCAGAAAC CCATTCTCCT CAACGCCAAG GAACACGTGG TAGGATACCT GAGCAAGATT TTGAACGCAC GGGTGTACGA TGTGGCTATT GAATCCGAAC TACAACACGC CAAGAATCTC AGCGCGGTAC GTACGCCGTG TCGAAGTGAT CGATGATTGA TTCCTAGTCT TCTGGAAAAC GTTGGTCCGG AAGGCGAGAC ACTCACTGAT ATCCTTTCGT ACGAATTAGC ATTTGAAAAA TACCGTTCTA CTCAAACGTG AAGATACGCA ACCGGTATTT TCGTTCAAAA TTCGTGGCGC CTACAACAAA ATGTCGCACT TGAGTCGAGA CTTGTTGGAC AAGGGCGTGG TGTGTTGTTC CGCCGGCAAC CACGCCCAAG GTGTCGCGCT GTCGGCGAAA ATGTTGGGTT GTCGGGCCGT CATTGTCATG CCGCTGGCGA CACCAGCGAT CAAAGTCAAC GCGGTCCGGA TACACGGCGG GCCCAGTGTC GAAGTACGGC TTTTTGGTAA CAATTACGAC GAAGCCGCCA CGGAAGCCAA GCGGCTGCAG CTCGAAGACG GCATGACCTT TGTGCATCCG TTCGACGATC CACTAGTTAT TGCTGGACAA GGCACGATCG GTATGGAAAT TCTGAAAGAA TGCGTCTCTC GACCATTGGA TGCAATCTTT GTTTGCTGTG GCGGTGGTGG TATGCTGGCC GGCATTGCGG CCTACGTGAA ACGTGTCCGA CCAACGGTCA AGGTGATTGG CGTCGAAGCC GCCGACGCGG CTGGCATGAC GGCGTCGCTG CGGGAAGGCA AACTGGTGAC GTTGGACTCC GTCGGTCTCT TTGCTGACGG CGCGGCGGTC CGGCGTGTCG GTGACGAAAC CTTTCGTGTT TGTAAAACAC TGGTCGACGA CATGATCACA GTTGACACGG ACGAAATATG CAACGCCATT AAATTAACGT ACAACGATGC TCGCGTCGTT TTGGAGCCGG CGGGGGCCCT GGCGGTCGCT GGGATGCGCA AATACGTACA CACCAACGAA CTATCCGGAC AGACTCTAGT GGCTATAACC TCCGGTGCCA ACATGGATTT CGATCGTTTG CGCTTTGTGG CGGAACGTGC CGATGGATCC GAGCGCACAC TAGCCGTTAC CATCCCGGAG AAACCCGGCT CGTTCCGCCA GCTTTATAGT CTTATTTGGC CACGCAATGT AACGGAATTC TCCTACCGTT TTGAGACGGA CGGGGATGCA CACGTACTGA TTTCGTTCCA GCCCGTAATG AATATTGAAA ACGACTTTGA AGGTATTATG CACGAACTCG GCGAGAACGG ATTTGACTGC TTGGATTTGA GTCACAACGA ACTTGCCAAA GTGCACGTTC GACACTTAGC CGGCGGTCGG TCAATTGTTA ACAATGAACG AGTCTTTCGG TTTGACTTTC CAGAGTCCCC CGGGGCACTT CAACGCTTCC TGTTGAGTTT GGACATGGGA TGGAACGTCA GCCTTTTCCA TTACCGAAAC CACGGCGACG ATTTTGGACG AGTTTTGGTT GGCATTCAAG TCACCGAGTC AGACGACAAG AAACTGAAGT CATTCCTTGG CAATCTCGGA TATCGATACG AAGAAGAAAC GAACAACCCT GTCTACCGGG CTTTCTTGTG TCAAACTAAT GGAAAGGGTT TGAACGATTT GTCAAACAAC AGCAGCAAAT AG
|
Protein sequence | MISSSIRLVT RQGGAARGAV TGPPRLLACR MIPMASQSTT AEQVEDEPAI KYALNPQGEV VKDAELIDFP GSSPGQKPIL LNAKEHVVGY LSKILNARVY DVAIESELQH AKNLSAHLKN TVLLKREDTQ PVFSFKIRGA YNKMSHLSRD LLDKGVVCCS AGNHAQGVAL SAKMLGCRAV IVMPLATPAI KVNAVRIHGG PSVEVRLFGN NYDEAATEAK RLQLEDGMTF VHPFDDPLVI AGQGTIGMEI LKECVSRPLD AIFVCCGGGG MLAGIAAYVK RVRPTVKVIG VEAADAAGMT ASLREGKLVT LDSVGLFADG AAVRRVGDET FRVCKTLVDD MITVDTDEIC NAIKLTYNDA RVVLEPAGAL AVAGMRKYVH TNELSGQTLV AITSGANMDF DRLRFVAERA DGSERTLAVT IPEKPGSFRQ LYSLIWPRNV TEFSYRFETD GDAHVLISFQ PVMNIENDFE GIMHELGENG FDCLDLSHNE LAKVHVRHLA GGRSIVNNER VFRFDFPESP GALQRFLLSL DMGWNVSLFH YRNHGDDFGR VLVGIQVTES DDKKLKSFLG NLGYRYEEET NNPVYRAFLC QTNGKGLNDL SNNSSK
|
| |