Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_1739 |
Symbol | |
ID | 4116639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 1768858 |
End bp | 1770480 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638036537 |
Product | histidine ammonia-lyase |
Protein accession | YP_644511 |
Protein GI | 108804574 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase [TIGR01226] phenylalanine ammonia-lyase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0504276 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGAGA CCCTGCCTGC GACCGGCTAC GGGCTAGAGC TGGACGGGCG CTCGCTGGGT CTTGAGGACG TGGTCGCGGT GGCCCGGGGG GAGGCCGGCG AGTGCGTCCT ATCCGGCGCG GCGGCGGAGA GGGTAGAGGA GGCGAACCGT CTGAAGCGGG AGCTCATCGC CTCCGAGCGT CCCATCTACG GGGTGACCAC CGGCTTCGGG GACAGCGCGC ACCGCCAGAT CTCGCCCGCC AGGACGGCCG AGCTGCAGAA GAACATCCTG CGCTTTCTGG GCAACGGGAT CGGGCCGCTG GCCCCGCCCG AGGTCGTGCG GGCCACCATG CTGCTCCGGG CCAACTGCAT GGCCCGGGGC AATTCCGGGG TGCGCCGGGA GCTGGTGGAG CTGCTGCTGG CGTTCGTCAA CCACGACGTG CTGCCGCCCA TCCCCGAGCG TGGTTCCTGC GGGGCGAGCG GGGATCTCGT CCCGCTCTCC TATCTGGGCT CCGCGCTCAC CGGGCACGGC GAGGTGCTCC ACCGCGGGGA GTGGCGGCCG GTGGGGGAGG TGCTCGAGGA GCTCGGGCTC GCGCCGCTCG AGCTGGAGGC CAAGGAGGGG CTCGCCATAA CCAACGGCAC CTCCTTCATG AGCGCCTTCG CCGCGCTCGC CGTGTGGGAC GCCGGGGAGC TGGCCTTCGT GTGCGACCTG TGCACGGCCA TGGCCTCCGA GGCGCTGCTC GGCAACCGGG CGCACTTCCA CCCCTTCATC CACGAGAACA AGCCGCACCC CGGGCAGGTG GAGAGCGCGC GCGTCATCCG CGGGCTGCTC GAGGGCTCCG GGCTCTCCAC CGAGATAGAC CAGGTGCTCT CCGGGGACGG CCTCGGGGGG AGGGGCTACC GGGAGCTGGA GCGCAACATC CAGGACAAGT ACTCCATACG CTGCGCGCCG CACGTGAACG GTGTGCTCCG GGACACCCTC GGCTGGGTCC GGCGGTGGGT GGAGGTCGAG ATGAACTCCT CCGACGACAA CCCCCTCTTC GACGCGGAGG GGCGCGCCGT CCACAGCGGG GGCAACTTCT ACGGCGGGCA CATCGTGCAG GCCATGGACT CCCTGAAGGT CGCGCTCGCC AGCGTCGCCG ACCTTATGGA CCGGCAGCTG GAGCTCGTGG TAGACGAGAA GTTCAACAAC GGGCTCACCC CCAACCTCAT CCCGTTCTTC GACCCCGAGG GGCCGCAGGC GGGGCTGCAC CACGGCTTCA AGGGGATGCA GCTCGCCTGC TCCTCGCTGG TGGCCGAGGC CTGCAAGCTG TCCAGCCCGG TGAGCGTCCA CTCCCGCTCC ACAGAGGCGC ACAACCAGGA CAAGGTCAGC ATGGGGACCA TCGCGGCGCG CGACGCCAGG ACCATCGTGG AGCTCGCGCA GAACGTGGCG GCCATCCACC TCATCGCCGT CTGCCAGGCG CTGGATCTGA GGGGCACGCA GAGCATGGCG CCGAGGACGC GGGAGGCCCA CCGGCTGGTG CGCGAGCGGG TGCCCTTCCT CGACGCGGAC CGGCGGATGG AGGAGGACAT CCGCCGGGTG GTGGAGATGA TCAAAGCCCG GGAGCTCTCC CGGGCGCTGG GGTACCAGGA TGCCTCTGCC TGA
|
Protein sequence | MRETLPATGY GLELDGRSLG LEDVVAVARG EAGECVLSGA AAERVEEANR LKRELIASER PIYGVTTGFG DSAHRQISPA RTAELQKNIL RFLGNGIGPL APPEVVRATM LLRANCMARG NSGVRRELVE LLLAFVNHDV LPPIPERGSC GASGDLVPLS YLGSALTGHG EVLHRGEWRP VGEVLEELGL APLELEAKEG LAITNGTSFM SAFAALAVWD AGELAFVCDL CTAMASEALL GNRAHFHPFI HENKPHPGQV ESARVIRGLL EGSGLSTEID QVLSGDGLGG RGYRELERNI QDKYSIRCAP HVNGVLRDTL GWVRRWVEVE MNSSDDNPLF DAEGRAVHSG GNFYGGHIVQ AMDSLKVALA SVADLMDRQL ELVVDEKFNN GLTPNLIPFF DPEGPQAGLH HGFKGMQLAC SSLVAEACKL SSPVSVHSRS TEAHNQDKVS MGTIAARDAR TIVELAQNVA AIHLIAVCQA LDLRGTQSMA PRTREAHRLV RERVPFLDAD RRMEEDIRRV VEMIKARELS RALGYQDASA
|
| |