Gene Smed_4658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4658 
Symbol 
ID5318821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1168408 
End bp1169943 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content65% 
IMG OID640776456 
Producthistidine ammonia-lyase 
Protein accessionYP_001313388 
Protein GI150376792 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0234611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0199218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCA TTCTCAGGCC CGGCTCGGTT CCGCTCAGCG ATCTGGAAAC GATATACTGG 
ACTGGCGCGC CGGCGCGCCT CGACCCTGCC TTCGATGCTG GTGTGGCCAA GGCTGCAGCG
CGGATTGCCG AGATCGTCGC GGGCAATGCG CCCGTCTACG GCATCAATAC AGGTTTCGGC
AAACTGGCTT CGATCAAGAT CGACAGCGCC GACGTGGAAA CGTTGCAGCG CAATCTGATC
CTCTCCCATT GCTGCGGCGT CGGCCAGCCG CTCACGGAAA ACATCGTGCG GCTGATCATG
GCACTGAAGC TGATCTCTCT CGGCCGCGGC GCCTCCGGTG TGCGGCTCGA ACTCGTCCGG
CTCCTCGAAG CGATGCTGGA CAAGGGCGTG ATCCCGCTCA TCCCGGAGAA AGGCTCCGTA
GGCGCGTCCG GAGACCTCGC GCCGCTTGCG CACATGGCCG CGGTGATGAT GGGCCACGGC
GAGGCCTTCT ATGCCGGCGA ACGCATGGCG GGTGCAGCGG CGCTGCGGGC TGCGGGGCTT
TCTCCCGTCA CGCTTGCCGC CAAAGAGGGC CTCGCCTTGA TCAACGGCAC CCAGGTCTCG
ACGGCTCTCG CCCTTGCCGG GCTCTTCCGC GCCCACCGCG CCGGCCAGGC GGCACTTATC
ACCGGCGCCC TTTCGACCGA CGCGGCCATG GGCTCTTCCG CCCCCTTCCA TCCGGATATT
CATACGCTTC GCGGCCATAA AGGCCAGATC GACACGGCCG CCGCCTTACG GCACCTGCTG
ACTGGCTCCC CGATTCGCCA AAGCCATATC GAGGGCGACG AGCGCGTGCA GGATCCCTAT
TGCATCCGCT GCCAGCCACA GGTCGACGGC GCCTGCCTCG ACCTCCTGCG TTCCGTCGCA
GCCACCTTGA CGATCGAAGC CAACGCCGTC ACCGACAATC CGCTGGTGCT TTCGGACAAT
TCCGTCGTCT CGGGCGGCAA TTTCCATGCC GAACCGGTAG CCTTTGCCGC CGACCAGATC
GCGCTTGCGG TGTGCGAAAT CGGCGCCATT GCCCAGCGCC GCATCGCCCT TCTGGTCGAC
CCCGCGCTCA GCTACGGCCT GCCGGCTTTC CTCGCCAAGA AACCGGGTCT CAATTCCGGA
CTGATGATTG CGGAGGTCAC GTCGGCGGCG TTGATGTCGG AAAACAAGCA GCTCTCCCAT
CCAGCCTCCG TCGACTCGAC GCCCACGTCT GCAAATCAGG AAGACCACGT GTCCATGGCC
TGCCACGGTG CGCGCCGACT TCTGCAGATG ACGGACAACC TCTTTGCGAT CGTCGGCATC
GAGGCGCTCG CTGCGGTGCA GGGTATCGAG TTCCGCGCGC CGCTCACCAC CAGCCCGGAA
CTTCAGAAGG CCGCCGCTGC CGTGCGCAGC ATCTCGCCCA GCATCGAGGA AGATCGCTAC
ATGGCCGACG ACCTGAAGGC CGCGGCCTAT CTCGTGGCGT CGGGTCAGCT CGCCGCCGCC
GTCTCCGCCG GCATTCTTCC CAAACTGGAG AACTGA
 
Protein sequence
MTIILRPGSV PLSDLETIYW TGAPARLDPA FDAGVAKAAA RIAEIVAGNA PVYGINTGFG 
KLASIKIDSA DVETLQRNLI LSHCCGVGQP LTENIVRLIM ALKLISLGRG ASGVRLELVR
LLEAMLDKGV IPLIPEKGSV GASGDLAPLA HMAAVMMGHG EAFYAGERMA GAAALRAAGL
SPVTLAAKEG LALINGTQVS TALALAGLFR AHRAGQAALI TGALSTDAAM GSSAPFHPDI
HTLRGHKGQI DTAAALRHLL TGSPIRQSHI EGDERVQDPY CIRCQPQVDG ACLDLLRSVA
ATLTIEANAV TDNPLVLSDN SVVSGGNFHA EPVAFAADQI ALAVCEIGAI AQRRIALLVD
PALSYGLPAF LAKKPGLNSG LMIAEVTSAA LMSENKQLSH PASVDSTPTS ANQEDHVSMA
CHGARRLLQM TDNLFAIVGI EALAAVQGIE FRAPLTTSPE LQKAAAAVRS ISPSIEEDRY
MADDLKAAAY LVASGQLAAA VSAGILPKLE N