Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1802 |
Symbol | |
ID | 6316372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1865992 |
End bp | 1867368 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642644176 |
Product | tyrosine phenol-lyase |
Protein accession | YP_001917962 |
Protein GI | 188586417 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3033] Tryptophanase |
TIGRFAM ID | [TIGR02618] tyrosine phenol-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.106432 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00367591 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAAGAG AGTTTCAAGC AGAACCTTTC AGAATTAAAA CAGTAGAGCC AATCAAAATG ACTACAAAAG AAGAGCGAGA AGAGGCTATA CAAAAAGCAG GGTTCAACAC TTTCCTATTA GATTCTAAAG ATGTTTACAT CGATTTATTA ACTGACAGCG GAACTAATGC CATGAGTGAT TATCAATGGG CAGGTCTTAT GATGGGTGAT GAGGCTTATG CCGGAAGTAA AAACTGGTAT AACTTAGAAA AAGCAGTACA AGAATGTTTT GGTTTCGAAT ATGTAGTTCC AACTCATCAA GGTAGGGGCG CAGAAAACAT TTTATCTCAG ATCATGATTA AAGAGGGAGA TTACATCCCA GGAAATATGT ATTTTACAAC TACTAAGGCC CATCAAGAAT TTGCCGGAGG TACTTTTAGA GATGTGATAA TTGACAAGGC ACATGATTCA CAAGCTGAGC ATCCTTTTAA GGGAAATGTA GACCTAGAAA AATATCAAGC TTTGATTGAT GAAGTAGGTC CGGAACGAAT TCCATATATC TGTGTTGCAG TTACTGTAAA TATGGCGGGT GGACAGCCTG TTAGCATGGA AAATTTACGC AAGGTTAAAG AAATATCTGA TAAATATGGT ATAAAAGTAA TGTTTGATGC TACTCGTTGT GTGGAAAATG CTTACTTTAT CAAAGAAAGA GAAGAAGGGT ATCAGGATAA ATCCATTGCT GAAATTTTAA AAGAAATGAT GAGTTATGGT GACGGAGCTA CTATGAGTGG TAAAAAAGAT CCCTTAGTAA ATATTGGTGG TTTCTTGGCT ATGAATGACG AAGAGCTATA TCAAAGAGCT ACTCAACTAG TTGTTGTGTA CGAGGGAATG CCCACTTATG GTGGAATGGC AGGAAGAGAT ATGGAAGCCA TGGCGAGAGG AATATATGAA TCACTAGATT ATCACTACAT TAGTCACAGA GTCAATCAGG TAAGGTATTT AGGAGAGAAG CTAGAGGCTG GCGGAGTTCC AATTGTCAAA CCTATCGGTG GTCATGCAAT ATTTGTAGAT GCCGAGAGAT TCTTGTCCCA TTTACCTAGA GAAGAATTTC CAGCCCAGGC CTTGGCTGCT AATATTTATA GAGATTCTGG AGTAAGGACT ATGGAAAGAG GTACTGTTTC TGCTGGAAGA GATAAAGATG GCAACAATAT TTTCCCCAAA TTAGAAACAG TTAGATTAAC TATTCCTAGA AGAGTTTACA CTTATGCGCA TATGGACGTT GTTGCGGACT CGATTATCAA TCTTTATGAA AATAGAGAAA AAATCAAAGG TTTACGATTT GCTTATGAAC CACCAGTATT AAGATTCTTT ACCGCAAAAT TTGAAGAAAT AAAATAA
|
Protein sequence | MSREFQAEPF RIKTVEPIKM TTKEEREEAI QKAGFNTFLL DSKDVYIDLL TDSGTNAMSD YQWAGLMMGD EAYAGSKNWY NLEKAVQECF GFEYVVPTHQ GRGAENILSQ IMIKEGDYIP GNMYFTTTKA HQEFAGGTFR DVIIDKAHDS QAEHPFKGNV DLEKYQALID EVGPERIPYI CVAVTVNMAG GQPVSMENLR KVKEISDKYG IKVMFDATRC VENAYFIKER EEGYQDKSIA EILKEMMSYG DGATMSGKKD PLVNIGGFLA MNDEELYQRA TQLVVVYEGM PTYGGMAGRD MEAMARGIYE SLDYHYISHR VNQVRYLGEK LEAGGVPIVK PIGGHAIFVD AERFLSHLPR EEFPAQALAA NIYRDSGVRT MERGTVSAGR DKDGNNIFPK LETVRLTIPR RVYTYAHMDV VADSIINLYE NREKIKGLRF AYEPPVLRFF TAKFEEIK
|
| |