Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4658 |
Symbol | |
ID | 5318821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1168408 |
End bp | 1169943 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640776456 |
Product | histidine ammonia-lyase |
Protein accession | YP_001313388 |
Protein GI | 150376792 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0234611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0199218 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCA TTCTCAGGCC CGGCTCGGTT CCGCTCAGCG ATCTGGAAAC GATATACTGG ACTGGCGCGC CGGCGCGCCT CGACCCTGCC TTCGATGCTG GTGTGGCCAA GGCTGCAGCG CGGATTGCCG AGATCGTCGC GGGCAATGCG CCCGTCTACG GCATCAATAC AGGTTTCGGC AAACTGGCTT CGATCAAGAT CGACAGCGCC GACGTGGAAA CGTTGCAGCG CAATCTGATC CTCTCCCATT GCTGCGGCGT CGGCCAGCCG CTCACGGAAA ACATCGTGCG GCTGATCATG GCACTGAAGC TGATCTCTCT CGGCCGCGGC GCCTCCGGTG TGCGGCTCGA ACTCGTCCGG CTCCTCGAAG CGATGCTGGA CAAGGGCGTG ATCCCGCTCA TCCCGGAGAA AGGCTCCGTA GGCGCGTCCG GAGACCTCGC GCCGCTTGCG CACATGGCCG CGGTGATGAT GGGCCACGGC GAGGCCTTCT ATGCCGGCGA ACGCATGGCG GGTGCAGCGG CGCTGCGGGC TGCGGGGCTT TCTCCCGTCA CGCTTGCCGC CAAAGAGGGC CTCGCCTTGA TCAACGGCAC CCAGGTCTCG ACGGCTCTCG CCCTTGCCGG GCTCTTCCGC GCCCACCGCG CCGGCCAGGC GGCACTTATC ACCGGCGCCC TTTCGACCGA CGCGGCCATG GGCTCTTCCG CCCCCTTCCA TCCGGATATT CATACGCTTC GCGGCCATAA AGGCCAGATC GACACGGCCG CCGCCTTACG GCACCTGCTG ACTGGCTCCC CGATTCGCCA AAGCCATATC GAGGGCGACG AGCGCGTGCA GGATCCCTAT TGCATCCGCT GCCAGCCACA GGTCGACGGC GCCTGCCTCG ACCTCCTGCG TTCCGTCGCA GCCACCTTGA CGATCGAAGC CAACGCCGTC ACCGACAATC CGCTGGTGCT TTCGGACAAT TCCGTCGTCT CGGGCGGCAA TTTCCATGCC GAACCGGTAG CCTTTGCCGC CGACCAGATC GCGCTTGCGG TGTGCGAAAT CGGCGCCATT GCCCAGCGCC GCATCGCCCT TCTGGTCGAC CCCGCGCTCA GCTACGGCCT GCCGGCTTTC CTCGCCAAGA AACCGGGTCT CAATTCCGGA CTGATGATTG CGGAGGTCAC GTCGGCGGCG TTGATGTCGG AAAACAAGCA GCTCTCCCAT CCAGCCTCCG TCGACTCGAC GCCCACGTCT GCAAATCAGG AAGACCACGT GTCCATGGCC TGCCACGGTG CGCGCCGACT TCTGCAGATG ACGGACAACC TCTTTGCGAT CGTCGGCATC GAGGCGCTCG CTGCGGTGCA GGGTATCGAG TTCCGCGCGC CGCTCACCAC CAGCCCGGAA CTTCAGAAGG CCGCCGCTGC CGTGCGCAGC ATCTCGCCCA GCATCGAGGA AGATCGCTAC ATGGCCGACG ACCTGAAGGC CGCGGCCTAT CTCGTGGCGT CGGGTCAGCT CGCCGCCGCC GTCTCCGCCG GCATTCTTCC CAAACTGGAG AACTGA
|
Protein sequence | MTIILRPGSV PLSDLETIYW TGAPARLDPA FDAGVAKAAA RIAEIVAGNA PVYGINTGFG KLASIKIDSA DVETLQRNLI LSHCCGVGQP LTENIVRLIM ALKLISLGRG ASGVRLELVR LLEAMLDKGV IPLIPEKGSV GASGDLAPLA HMAAVMMGHG EAFYAGERMA GAAALRAAGL SPVTLAAKEG LALINGTQVS TALALAGLFR AHRAGQAALI TGALSTDAAM GSSAPFHPDI HTLRGHKGQI DTAAALRHLL TGSPIRQSHI EGDERVQDPY CIRCQPQVDG ACLDLLRSVA ATLTIEANAV TDNPLVLSDN SVVSGGNFHA EPVAFAADQI ALAVCEIGAI AQRRIALLVD PALSYGLPAF LAKKPGLNSG LMIAEVTSAA LMSENKQLSH PASVDSTPTS ANQEDHVSMA CHGARRLLQM TDNLFAIVGI EALAAVQGIE FRAPLTTSPE LQKAAAAVRS ISPSIEEDRY MADDLKAAAY LVASGQLAAA VSAGILPKLE N
|
| |