Gene Rxyl_1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_1739 
Symbol 
ID4116639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp1768858 
End bp1770480 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content70% 
IMG OID638036537 
Producthistidine ammonia-lyase 
Protein accessionYP_644511 
Protein GI108804574 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase
[TIGR01226] phenylalanine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0504276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAGA CCCTGCCTGC GACCGGCTAC GGGCTAGAGC TGGACGGGCG CTCGCTGGGT 
CTTGAGGACG TGGTCGCGGT GGCCCGGGGG GAGGCCGGCG AGTGCGTCCT ATCCGGCGCG
GCGGCGGAGA GGGTAGAGGA GGCGAACCGT CTGAAGCGGG AGCTCATCGC CTCCGAGCGT
CCCATCTACG GGGTGACCAC CGGCTTCGGG GACAGCGCGC ACCGCCAGAT CTCGCCCGCC
AGGACGGCCG AGCTGCAGAA GAACATCCTG CGCTTTCTGG GCAACGGGAT CGGGCCGCTG
GCCCCGCCCG AGGTCGTGCG GGCCACCATG CTGCTCCGGG CCAACTGCAT GGCCCGGGGC
AATTCCGGGG TGCGCCGGGA GCTGGTGGAG CTGCTGCTGG CGTTCGTCAA CCACGACGTG
CTGCCGCCCA TCCCCGAGCG TGGTTCCTGC GGGGCGAGCG GGGATCTCGT CCCGCTCTCC
TATCTGGGCT CCGCGCTCAC CGGGCACGGC GAGGTGCTCC ACCGCGGGGA GTGGCGGCCG
GTGGGGGAGG TGCTCGAGGA GCTCGGGCTC GCGCCGCTCG AGCTGGAGGC CAAGGAGGGG
CTCGCCATAA CCAACGGCAC CTCCTTCATG AGCGCCTTCG CCGCGCTCGC CGTGTGGGAC
GCCGGGGAGC TGGCCTTCGT GTGCGACCTG TGCACGGCCA TGGCCTCCGA GGCGCTGCTC
GGCAACCGGG CGCACTTCCA CCCCTTCATC CACGAGAACA AGCCGCACCC CGGGCAGGTG
GAGAGCGCGC GCGTCATCCG CGGGCTGCTC GAGGGCTCCG GGCTCTCCAC CGAGATAGAC
CAGGTGCTCT CCGGGGACGG CCTCGGGGGG AGGGGCTACC GGGAGCTGGA GCGCAACATC
CAGGACAAGT ACTCCATACG CTGCGCGCCG CACGTGAACG GTGTGCTCCG GGACACCCTC
GGCTGGGTCC GGCGGTGGGT GGAGGTCGAG ATGAACTCCT CCGACGACAA CCCCCTCTTC
GACGCGGAGG GGCGCGCCGT CCACAGCGGG GGCAACTTCT ACGGCGGGCA CATCGTGCAG
GCCATGGACT CCCTGAAGGT CGCGCTCGCC AGCGTCGCCG ACCTTATGGA CCGGCAGCTG
GAGCTCGTGG TAGACGAGAA GTTCAACAAC GGGCTCACCC CCAACCTCAT CCCGTTCTTC
GACCCCGAGG GGCCGCAGGC GGGGCTGCAC CACGGCTTCA AGGGGATGCA GCTCGCCTGC
TCCTCGCTGG TGGCCGAGGC CTGCAAGCTG TCCAGCCCGG TGAGCGTCCA CTCCCGCTCC
ACAGAGGCGC ACAACCAGGA CAAGGTCAGC ATGGGGACCA TCGCGGCGCG CGACGCCAGG
ACCATCGTGG AGCTCGCGCA GAACGTGGCG GCCATCCACC TCATCGCCGT CTGCCAGGCG
CTGGATCTGA GGGGCACGCA GAGCATGGCG CCGAGGACGC GGGAGGCCCA CCGGCTGGTG
CGCGAGCGGG TGCCCTTCCT CGACGCGGAC CGGCGGATGG AGGAGGACAT CCGCCGGGTG
GTGGAGATGA TCAAAGCCCG GGAGCTCTCC CGGGCGCTGG GGTACCAGGA TGCCTCTGCC
TGA
 
Protein sequence
MRETLPATGY GLELDGRSLG LEDVVAVARG EAGECVLSGA AAERVEEANR LKRELIASER 
PIYGVTTGFG DSAHRQISPA RTAELQKNIL RFLGNGIGPL APPEVVRATM LLRANCMARG
NSGVRRELVE LLLAFVNHDV LPPIPERGSC GASGDLVPLS YLGSALTGHG EVLHRGEWRP
VGEVLEELGL APLELEAKEG LAITNGTSFM SAFAALAVWD AGELAFVCDL CTAMASEALL
GNRAHFHPFI HENKPHPGQV ESARVIRGLL EGSGLSTEID QVLSGDGLGG RGYRELERNI
QDKYSIRCAP HVNGVLRDTL GWVRRWVEVE MNSSDDNPLF DAEGRAVHSG GNFYGGHIVQ
AMDSLKVALA SVADLMDRQL ELVVDEKFNN GLTPNLIPFF DPEGPQAGLH HGFKGMQLAC
SSLVAEACKL SSPVSVHSRS TEAHNQDKVS MGTIAARDAR TIVELAQNVA AIHLIAVCQA
LDLRGTQSMA PRTREAHRLV RERVPFLDAD RRMEEDIRRV VEMIKARELS RALGYQDASA