Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6923 |
Symbol | |
ID | 8022951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 375509 |
End bp | 377044 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644833784 |
Product | histidine ammonia-lyase |
Protein accession | YP_002984918 |
Protein GI | 241666834 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.19647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.781218 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCA CGCTCCACCC GGGGTCCGTC TCGCTCAAGG ATCTCGAAAC CGTCTACTGG ACCGGCGTGC CGGCAAGGCT CGATCCCGCC TTCGATGCCG GGATCGCCAA GGCCGCTGCC CGTATCGCCG AGATCGCCGC CGGCAACGCG CCGGTCTACG GCATCAATAC CGGCTTCGGC AAACTCGCCT CGATCAAGAT CGACAGCGCC GATGTGACCA CCTTGCAGCG CAATCTCATC CTGTCGCATT GCTGCGGCGT CGGCGCGCCA CTGCCTGAGA ATATCGCGCG GCTGATCATG GCGCTCAAGC TGGTCTCGCT TGGGCGCGGT GCCTCCGGCG TGCGGCTGGA GCTGGTGCGG CTGATCGAAG GCATGCTGGA AAAGGGCGTC ATTCCGCTGA TTCCGGAAAA GGGCTCTGTC GGCGCCTCCG GCGATCTTGC CCCGCTTGCC CATATGGCGG CGGTGATGAT GGGCGAGGCC GAAGCCTTCT TCGCCGGCGA ACGCCTCCCG GGCGCGCAAG CCCTCGAAAG GGCCGGGCTG AAACCAGTGG TGCTCGCCGC CAAGGAGGGT CTCGCTCTCA TCAACGGCAC CCAGACCTCG ACGGCGCTGG CACTTGCCGG CCTCTTCCGC GCCCATCGCG CCGCGCAGGC GGCTCTGATT ACCGGCGCCA TGTCCACCGA TGCCGCCATG GGCTCTTCGG CGCCCTTTCA TCCGGATATT CACACACTGC GTGGCCACAA GGGCCAGATC GACACGGCCG CCGCACTTCG GGCGCTCCTC GAAAACTCCA TCATTCGCCA GAGCCACATC GAGGGCGACG AGCGCGTGCA GGATCCCTAT TGCATCCGCT GCCAGCCGCA GGTCGACGGT GCCTGCCTCG ATCTGTTGCG CTCGGTTGCC CGCACCCTCG AAATCGAGGC CAACGCGGTC ACCGACAATC CGCTGGTGCT TTCGGACAAT TCCGTCGTCT CCGGCGGCAA TTTCCACGCC GAACCCGTCG CCTTCGCCGC CGACCAGATC GCGCTCGCCG TCTGCGAGAT CGGCGCGATT TCGCAGCGCC GCATCGCGCT GCTCGTCGAT CCGACCCTCT CCTACGGCCT GCCGGCCTTT CTCGCCAAGA AGCCGGGCCT GAACTCCGGC CTGATGATCG CCGAGGTGAC GTCAGCGGCG CTGATGTCCG AAAACAAGCA GATGTCGCAC CCCGCCTCGG TCGATTCGAC CCCGACATCG GCCAATCAGG AAGACCATGT CTCCATGGCC TGCCATGGCG CCCGCCGCCT GCTCGCCATG ACCGAGAACC TGTTCGGCAT CATCGGCATC GAGGCGCTGA CCGCCGCCCA AGGCGTCGAA CTGCGCGCGC CGCTGTCGAC CAGCCCGGAG CTTGGCAAGG CGATCACGGC CATCCGCACC AAGGTGGCGA GCCTCGACGT TGACCGCTAC ATGGCAAACG ACCTCGCCGC CGCCGCCGAA CTGGTGGCGA CGGGTGCGCT GAACGCCTCG GTTTCTTCTG GGATCTTGCC GGTTCTGGAG AGCTGA
|
Protein sequence | MTITLHPGSV SLKDLETVYW TGVPARLDPA FDAGIAKAAA RIAEIAAGNA PVYGINTGFG KLASIKIDSA DVTTLQRNLI LSHCCGVGAP LPENIARLIM ALKLVSLGRG ASGVRLELVR LIEGMLEKGV IPLIPEKGSV GASGDLAPLA HMAAVMMGEA EAFFAGERLP GAQALERAGL KPVVLAAKEG LALINGTQTS TALALAGLFR AHRAAQAALI TGAMSTDAAM GSSAPFHPDI HTLRGHKGQI DTAAALRALL ENSIIRQSHI EGDERVQDPY CIRCQPQVDG ACLDLLRSVA RTLEIEANAV TDNPLVLSDN SVVSGGNFHA EPVAFAADQI ALAVCEIGAI SQRRIALLVD PTLSYGLPAF LAKKPGLNSG LMIAEVTSAA LMSENKQMSH PASVDSTPTS ANQEDHVSMA CHGARRLLAM TENLFGIIGI EALTAAQGVE LRAPLSTSPE LGKAITAIRT KVASLDVDRY MANDLAAAAE LVATGALNAS VSSGILPVLE S
|
| |