Gene Rmet_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_0231 
Symbol 
ID4037017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp246473 
End bp248074 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content67% 
IMG OID637975604 
Producthistidine ammonia-lyase 
Protein accessionYP_582386 
Protein GI94309176 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCACG CCCATCCGGC CGACATCGAC GGCCACCACC TGACCCCCGA CACCGTCGCC 
GCCATCGCAC GCGGCCAGCG CGCCGCCATC GTCCCGGAGC CCGTCCTCGG CAAGGTTGCC
GATGCCCGCG CCCGCTTCGA GCAGGTGGCT GCGGCCAATG TGCCGATCTA CGGCGTCTCC
ACGGGCTTTG GCGAACTGGT ACACAACTGG GTCGACATCG AACATGGCCG TGCGCTGCAG
GAGAACCTGT TGCGCAGCCA TTGCGCGGGT GTGGGTCCGC TGTTTTCGCG CGACGAGGTC
CGCGCGATGA TGGTCGCGCG TGCCAATGCG TTGGCACGCG GATACTCGGC GGTGCGGCCG
GCCGTTATCG AACAACTGCT GAAGTATCTG GAAGCCGGCA TCACGCCAGC CGTGCCGCAG
GTGGGTTCGC TCGGGGCCAG CGGTGATCTC GCGCCTCTGT CGCACGTCGC CATCACGCTG
ATCGGCGAAG GCAAGGTGCT GACCGAGGAT GGCGGTACGG CACCCACGGC CGAAGTCCTG
CGCGAGCGTG GCATCACGCC GCTCGCGCTG GCGTACAAGG AAGGGCTGGC GCTGATCAAC
GGGACATCGG CCATGACCGG GGTGTCGTGC CTGTTGCTGG AGACGCTGCG CGCGCAGGTC
CAGCAGGCCG AGATCATCGC GGCGCTGGCG CTCGAAGGAC TATCCGCCTC GGCCGATGCC
TTCATGGCCC ATGGGCACGA CATCGCCAAA CCGCATCCGG GACAGATCCG CTCGGCGGCG
AACATGCGCG CGCTGCTGGC CGATTCGGCA CGGCTCTCCG GACATGGCGA ACTGTCCGCC
GAGATGAAGA CACGCGCGGG CGAGGCCAAG AACACCGGCA CTGGCGTGTT CATCCAGAAG
GCCTACACGC TGCGCTGCAT TCCGCAGGTG CTTGGCGCGG TGCGCGATAC GCTCGACCAT
TGCGCCACCG TGGTCGAGCG CGAACTGAAT TCATCGAATG ACAATCCGCT GTTCTTCGAA
GACGGCGAGC TGTTCCACGG CGGCAACTTC CACGGCCAGC AGGTGGCATT CGCAATGGAC
TTCCTGGCCA TCGCCGCCAC GCAACTGGGC GTGGTGTCGG AGCGCCGCCT GAACCGCCTG
CTGAGCCCGC ATCTGAACAA CAATCTGCCG GCGTTCCTGG CGGCGGCGAA CGAGGGGTTG
TCGTGCGGGT TTGCCGGGGC ACAGTATCCG GCCACGGCGT TGATTGCCGA GAACCGCACG
ATCTGCAGCC CGGCGAGCAT CCAGAGTGTG CCGTCGAACG GCGACAACCA GGATGTGGTC
AGCATGGGGC TGATCGCTGC CCGCAATGCC CGCCGCATTC TCGACAATAA CCAGTACATC
CTCGCGCTGG AGTTGCTGGC GTCATGTCAG GCCGCCGAAC TCGCGGGCGC GGTCGAGCAA
CTGGCGCCGG CAGGCCGCGC CGTGTTCGCG TTCGTGCGGG AGCGCGTGCC GTTCCTGTCG
ATCGATCGCT ATATGACCGA CGACATCGAG GCTATGGCGG CGCTGCTCCG TCAGGGCGCG
CTGGTTGAGG TCGTGCGTGG CGCGGGCATC GAACTGGCCT GA
 
Protein sequence
MPHAHPADID GHHLTPDTVA AIARGQRAAI VPEPVLGKVA DARARFEQVA AANVPIYGVS 
TGFGELVHNW VDIEHGRALQ ENLLRSHCAG VGPLFSRDEV RAMMVARANA LARGYSAVRP
AVIEQLLKYL EAGITPAVPQ VGSLGASGDL APLSHVAITL IGEGKVLTED GGTAPTAEVL
RERGITPLAL AYKEGLALIN GTSAMTGVSC LLLETLRAQV QQAEIIAALA LEGLSASADA
FMAHGHDIAK PHPGQIRSAA NMRALLADSA RLSGHGELSA EMKTRAGEAK NTGTGVFIQK
AYTLRCIPQV LGAVRDTLDH CATVVERELN SSNDNPLFFE DGELFHGGNF HGQQVAFAMD
FLAIAATQLG VVSERRLNRL LSPHLNNNLP AFLAAANEGL SCGFAGAQYP ATALIAENRT
ICSPASIQSV PSNGDNQDVV SMGLIAARNA RRILDNNQYI LALELLASCQ AAELAGAVEQ
LAPAGRAVFA FVRERVPFLS IDRYMTDDIE AMAALLRQGA LVEVVRGAGI ELA