Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4004 |
Symbol | |
ID | 8014813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4080813 |
End bp | 4082426 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826573 |
Product | histidine ammonia-lyase |
Protein accession | YP_002977784 |
Protein GI | 241206688 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase [TIGR01226] phenylalanine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.187247 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.249864 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCA TAATTCTAGA CGGTGACAGC CTGACGATCA AAGATACCGT TCGCATCGCG CGCCAGGGCG CCAAGGTCGC GCTCGCCGAT GCAGCCCGCG CCGAAATCAT CAAGGTGAGA AACTATATCG AGGAAAACTG GCTGACCGAA AACGCGCCGC CGACCTACGG TTTCAATACC GGCGTCGGCA AGCTCAAGGA TTATGCCATC AACCAGGCCG ATAACGACCG CTTCCAGCGC AATATCGTGC TCTCTCATTG CTCCGGCATC GGAGAGCCGG CGTCGGAAGA AATCGTCCGC GCCATGATGG CCGTCCGCAT CAACGCCTTC TGCCTCGGAG TTTCCGGCCT GCGGATCGAG GTGGTTGATC GTCTTGTTGA GATGTTGAAC CGCGGCGTTC ACCCTGTGGT GCCGATCCAG GGGTCGGTCG GCGCGTCTGG CGATCTGGCG CCGCTCGCGC ACATGGTTTC GGTGCTGATC GGCTATGAGG AGGCGGAAGC CTATTACCAG GGCGAACGCA TGCCGGCGCC GCAGGCGCTG GAAAAAGCCG GCATTTTCCC AATTGCTTTC GATCTCAAGG CGAAGGACTG CCTTGCCCTC ATCAATGGCA ACAGCCTCTG CGCGGCCATG GCGGTTCTCA ACCTCCACGA CGCCGAGATG CTGATGAAGA CAGCCGATGC GGCCGGCGCG CTCAGTCTGG AGGCGATCCG CGGCGAGCAG GCGGCGTTCG ATCCCCGCAT TCATCTTGTG CGCAAGCAGC CCGGGCAGAT CGCAACTGCG GAAAATATCC GTCGCATTAT CGAGGGCAGC CGTCGCACGA CCGAGGCGGC GCGTGCGGTG CGCCTCGAGG ACGATATCCT GCATCCGAAA CACACCGCTC GAATCCAGGA TCAGTATTCC TTCCGTTGCC TGCCGCAGGT GCATGGAAGC TGCCGCGACC AGTTGGAGCA CGCCAAGGAG CTGATCACGC GCGAGCTCAA CGCCGCGACC GATAATCCGC TCGTCTTCTG GAACGAACTC GGCGCGCTGG AATTTCTGTC CGGCGGCAAC TTTCATTGCG AACCCATCGC TTTTGCCATG GACTTGCTGA CCATCGCTTT GGTGGAAATC GGCAATATTT CCGAGCGCCG CCTGTTCTCG CTCTGCGACA CGACATTGAA CTACGGCCTG CCGCCGAACC TTGCCGGCAA GCCGATCGGC CTGAATTACG GCTATGGCAT CATCTCGACG GCTGCGGCGT CCGTCGCATC GGAAAACAAG ACGCTGGCTT TCCCCGCCGT TGCCGATACC ATCCCGACCA AGAGCAGCCA GGAAGACCAT GTTTCGATGG CGACATGGGC ATGCCGCAAG ACGCGTCAGG TGGTCGACAA CATGCCGAAG ATCCTTGGTG TCGAATGCCT GCTTGCGGCC CGCGCCATCT TCCTGACCGA AGAGGCACTC GGCGGCTACA AGCTCGGGAC CGGCAGCCAG GCGCTCTATG ACGCGCTTCG CGACGCGATC CCGTTCCAGC AGGAGGACAG CTACATGCCC AAGCAGACCA CACCGGCTCT CGAGATCGTG CGGTCCGGCG CATTTCTCGA GACCATCGAG AACAAGATCG GCGCCCTGAA ATAG
|
Protein sequence | MNAIILDGDS LTIKDTVRIA RQGAKVALAD AARAEIIKVR NYIEENWLTE NAPPTYGFNT GVGKLKDYAI NQADNDRFQR NIVLSHCSGI GEPASEEIVR AMMAVRINAF CLGVSGLRIE VVDRLVEMLN RGVHPVVPIQ GSVGASGDLA PLAHMVSVLI GYEEAEAYYQ GERMPAPQAL EKAGIFPIAF DLKAKDCLAL INGNSLCAAM AVLNLHDAEM LMKTADAAGA LSLEAIRGEQ AAFDPRIHLV RKQPGQIATA ENIRRIIEGS RRTTEAARAV RLEDDILHPK HTARIQDQYS FRCLPQVHGS CRDQLEHAKE LITRELNAAT DNPLVFWNEL GALEFLSGGN FHCEPIAFAM DLLTIALVEI GNISERRLFS LCDTTLNYGL PPNLAGKPIG LNYGYGIIST AAASVASENK TLAFPAVADT IPTKSSQEDH VSMATWACRK TRQVVDNMPK ILGVECLLAA RAIFLTEEAL GGYKLGTGSQ ALYDALRDAI PFQQEDSYMP KQTTPALEIV RSGAFLETIE NKIGALK
|
| |