Gene Rmet_5047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5047 
SymbolhutH 
ID4041909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1734019 
End bp1735575 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content67% 
IMG OID637980468 
Producthistidine ammonia-lyase 
Protein accessionYP_587178 
Protein GI94313969 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.114235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTACC AAGAAAATCA GCCAGTCTTC GCCCTGCATC TGCAACCCGG TGAGCTCACG 
CTCGCGCAAC TGCGTGCCGT GCATCAGCAG CCCGTCAAGA TCACGCTCGA CGAGCGCGCC
TTCCCCGCCA TCGACCGCAG CGTGGCGTGT GTCGAAAACA TCATCGCCGA GGGCCGCACG
GCCTACGGCA TCAACACCGG ATTCGGCCTG CTGGCACAGA CCCGCATTGC CCGCGAGGAC
CTGGAGAACC TGCAGCGTTC GCTGGTGCTT TCCCACGCGG CAGGTGTGGG CGAGCCGATC
GACGACGCGC TGGTGCGCCT GATCATGGTG CTCAAGATCA ACAGCCTGGC GCGTGGCCTG
TCTGGCATCC GCCGCAAGGT GATCAGCGCG CTGATCGCGC TGGTGAACGC CGAGGTGTAC
CCGTGCATTC CGCTGAAGGG CTCCGTTGGC GCGTCGGGCG ATCTGGCGCC GCTGGCCCAT
ATGTCGTTGC TGCTGCTGGG CGAAGGCCGG GCACGCCATC GCGGCGAATG GCTGTCGGCT
CGCGAAGCGC TGGCCATTGC CGATTTGCAG CCGCTGACGC TCGCCGCCAA GGAAGGGCTG
GCGCTGCTGA ATGGCACGCA GGTTTCCACC GCGTATGCGC TGCAGGGTTT GTTCCAGGCG
GAAGACCTGT ACGCGGCGGC CAGCGTTTGC GGCGCGCTGA CCGTCGAGGC GACGCTCGGC
TCGCGCGCTC CGTTCGATCC GCGCATTCAC GCCGCCCGTG GCCAGCGCGG CCAGATCGAC
GCGGCGGCGG TCTATCGTCA CCTGCTGGGG GAGACCAGCC AGCTCGGCCA ATCGCACGCG
CATTGCGACA AGGTGCAGGA CCCGTACTCG CTGCGCTGCC AGCCCCAGGT CATGGGCGCC
TGCCTGACGC AGATCCGCAA TGCGGCGGAT GTGCTGGGCG TCGAAGCGAA CTCGGTTTCC
GACAATCCGC TGGTATTCGC GCAGGAAGGC GATATCATCT CGGGCGGCAA TTTCCACGCG
GAACCGGTGG CAATGGCCGC GGACAACCTC GCGCTGGCGC TGGCCGAGAT CGGCTCGCTC
TCCGAGCGGC GTGTGTCGCT GATGATGGAC CAGCACCTGT CGCAGTTGCC GCCGTTCCTG
GTCGCCAATG GCGGCGTGAA CTCCGGCTTC ATGATCGCGC AGGTCACGGC CGCCGCGCTG
GCATCGGACA ACAAAGCGCT TGCGCATCCG GCCAGTGTCG ATAGCCTGCC TACCTCGGCG
AACCAGGAAG ACCATGTGTC GATGGCGCCC AACGCTGGCA AGCGTCTCTG GGAGATGGCC
AGCAACGTCA AGGGCATCGT GGCGATCGAA TGGCTGGCGG CCTGCCAGGG CATGGACTTC
CGTGAAGGCG GCAAGACCAC CGAAGCGCTG GAGCGCGCGC GCGGCCTGCT GCGTCAGTCG
GTGCCGTTCT ACGACAAGGA CCGGTACTTC GCGCCGGATA TCGAAGACGC CAGCGTGCTG
ATCGCCGAGC GTCACCTGAC GGCCCTGCTG CCTGCAGGCA TCCTGCCGAG CGTCTGA
 
Protein sequence
MSYQENQPVF ALHLQPGELT LAQLRAVHQQ PVKITLDERA FPAIDRSVAC VENIIAEGRT 
AYGINTGFGL LAQTRIARED LENLQRSLVL SHAAGVGEPI DDALVRLIMV LKINSLARGL
SGIRRKVISA LIALVNAEVY PCIPLKGSVG ASGDLAPLAH MSLLLLGEGR ARHRGEWLSA
REALAIADLQ PLTLAAKEGL ALLNGTQVST AYALQGLFQA EDLYAAASVC GALTVEATLG
SRAPFDPRIH AARGQRGQID AAAVYRHLLG ETSQLGQSHA HCDKVQDPYS LRCQPQVMGA
CLTQIRNAAD VLGVEANSVS DNPLVFAQEG DIISGGNFHA EPVAMAADNL ALALAEIGSL
SERRVSLMMD QHLSQLPPFL VANGGVNSGF MIAQVTAAAL ASDNKALAHP ASVDSLPTSA
NQEDHVSMAP NAGKRLWEMA SNVKGIVAIE WLAACQGMDF REGGKTTEAL ERARGLLRQS
VPFYDKDRYF APDIEDASVL IAERHLTALL PAGILPSV