Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5927 |
Symbol | |
ID | 6977314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 345297 |
End bp | 346832 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643393380 |
Product | histidine ammonia-lyase |
Protein accession | YP_002278198 |
Protein GI | 209546308 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.14133 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATTA CGCTCCATCC GGGCTCCGTC TCGCTCAAGG ATCTCGAGAC CATCTATTGG ACCGGCGCGC CGGCCAGGCT CGATCCCGCC TTCGATGCCG GCATCGCCAA GGCCGCCGCC CGCATCGCCG AGATCGCCGC CGCCAATGCG CCGGTTTACG GCATCAATAC CGGCTTCGGC AAACTCGCCT CGATCAAGAT CGACAGCGCC GACGTCGCCA CGCTGCAGCG CAATCTCATC CTGTCGCATT GCTGCGGCAT CGGCGCGCCG CTGCCGGAAA ATATCGTCCG GCTGATCATG GCGCTGAAGC TGGTTTCGCT CGGGCGTGGC GCCTCCGGTG TGCGGCTGGA GCTGGTGCGG CTGATCGAAG GCATGCTGGA TAAGGGCGTC ATTCCGCTGA TCCCGGAAAA GGGCTCGGTC GGCGCCTCCG GCGATCTTGC CCCGCTTGCC CATATGGCCG CCGTGATGAT GGGCGAGGCC GAAGCCTTCT TCGCCGGCGA ACGTCTGACT GGTGCCGAAG CCCTGGAAAG GGCCGGGCTG AAACCGGTCG TGCTTGCCGC CAAGGAGGGT CTGGCGCTGA TCAACGGCAC CCAGACCTCG ACGGCCCTGG CGCTTGCCGG CCTCTTCCGC GCCCATCGCG CCGCACAGGC GGCTCTCATC ACCGGCGCCA TGTCCACAGA TGCCGCCATG GGCTCGTCGG CGCCTTTCCA TCCGGATATT CATACGCTCC GCGGCCACAA GGGCCAGATC GACACGGCAT CCGCGCTGCG CGCCCTGCTC GAACAATCGG TCATCCGCCA GAGCCATATC GAAGGCGATG AGCGCGTTCA GGATCCCTAC TGCATCCGCT GCCAGCCGCA GGTCGACGGC GCTTGCCTCG ATATCTTGCG CTCGGTCGCC CGCACGCTTG AAATCGAGGC CAATGCGGTC ACCGACAATC CGCTGGTGCT GTCGGACAAT TCCGTCGTCT CCGGCGGCAA TTTCCACGCC GAACCCGTCG CCTTCGCCGC CGACCAGATC GCCCTTGCCG TCTGCGAGAT CGGCGCGATT TCCCAGCGCC GCATCGCGCT GCTGGTCGAC CCGGTGCTTT CCTACGGCCT GCCGGCCTTC CTCGCCAAGA AGCCGGGCCT GAACTCCGGC CTGATGATCG CCGAGGTCAC CTCGGCGGCG CTGATGTCGG AGAACAAGCA GATGTCGCAC CCGGCCTCGG TGGATTCGAC CCCGACGTCG GCGAACCAGG AAGACCATGT CTCGATGGCC TGCCACGGCG CCCGCCGCCT GCTCGGCATG ACCGAGAACC TGTTCGGCAT CATCGGCATC GAGGCGCTGA CCGCTGCCCA GGGCGTCGAA CTGCGCGCGC CTTTGTCGAC CAGCCCGGAG CTTTTGAAGG CGATCGCCGC AATCCGCAGC AAGGTGCCGA GCCTCGACAG CGACCGCTAT ATGGCGGGCG ATCTCGCCGC CGCCGCCGAA CTGGTCGCGA CGGGCGCCCT GAACGCCGCC GTTTCCTCGG GCATTCTGCC GGTTTTGGAG GGCTGA
|
Protein sequence | MTITLHPGSV SLKDLETIYW TGAPARLDPA FDAGIAKAAA RIAEIAAANA PVYGINTGFG KLASIKIDSA DVATLQRNLI LSHCCGIGAP LPENIVRLIM ALKLVSLGRG ASGVRLELVR LIEGMLDKGV IPLIPEKGSV GASGDLAPLA HMAAVMMGEA EAFFAGERLT GAEALERAGL KPVVLAAKEG LALINGTQTS TALALAGLFR AHRAAQAALI TGAMSTDAAM GSSAPFHPDI HTLRGHKGQI DTASALRALL EQSVIRQSHI EGDERVQDPY CIRCQPQVDG ACLDILRSVA RTLEIEANAV TDNPLVLSDN SVVSGGNFHA EPVAFAADQI ALAVCEIGAI SQRRIALLVD PVLSYGLPAF LAKKPGLNSG LMIAEVTSAA LMSENKQMSH PASVDSTPTS ANQEDHVSMA CHGARRLLGM TENLFGIIGI EALTAAQGVE LRAPLSTSPE LLKAIAAIRS KVPSLDSDRY MAGDLAAAAE LVATGALNAA VSSGILPVLE G
|
| |