Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0827 |
Symbol | hutH |
ID | 6794060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 820313 |
End bp | 821833 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642775104 |
Product | histidine ammonia-lyase |
Protein accession | YP_002145747 |
Protein GI | 197251912 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.294508 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCA TGACACTCAC TCCGGGGCAG TTAAGCCTCT CTCAACTGTA CGATGTCTGG CGTCATCCGG TACAGCTTCG GCTGGACGCC AACGCCATTG ACGGCATTAA CGCCAGCGTC GCGTGCGTCA ATGATATCGT TGCCGAAGGG CGTACCGCCT ACGGCATCAA CACCGGTTTC GGCCTGCTGG CGCAGACGCG CATCGCTGAT GAAGACTTGC AAAATCTACA GCGATCGTTG GTGCTGTCTC ACGCCGCGGG CGTAGGCGAC CCACTGGATG ACGCGATGGT GCGTCTGATC ATGGTATTGA AAATCAATAG TCTGGCGCGC GGGTTTTCCG GGATTCGGTT GAGCGTGATA GAGGCGCTCA TCGCTCTGGT GAATGCCGGC GTGTATCCGC TGATTCTGGC AAAAGGCTCG GTTGGCGCGT CGGGCGACTT AGCGCCGCTG GCGCATCTGT CGCTGACGCT GCTTGGCGAA GGAAAAGCGC GCTGGCAGGG TGAATGGCTT CCGGCGCAGA CGGCGCTAAA AAAGGCCGGA CTGGAGCCTG TCGCGCTGGC GGCCAAAGAG GGGCTGGCGC TGCTTAACGG CACCCAGGCG TCGACGGCCT TCGCGCTGCG CGGTTTGTTT GAGGCACAGG AGTTATTTGC CTCCGCCGTC GTCTGCGGCG CGCTGACCAC CGAAGCCGTA CTGGGATCGC GCCGTCCTTT TGATGCGCGC ATTCATGCCG CGCGCGGACA ACAGGGGCAA ATTGACGTCG CGCGGTTGTT CCGGCATCTG CTGACCGATA CCAGTGCTAT TGCTGAATCA CATCACCACT GTCATAAAGT TCAGGACCCG TATTCCCTTC GCTGCCAGCC GCAGGTAATG GGCGCTTGTC TGACCCAGTT ACGTCAGACG AAAGAGGTAC TGCTGGCTGA GGCCAACGCG GTGTCCGATA ATCCGCTGGT CTTTGCCGAT GCGGGGGAGG TGATCTCCGG CGGTAACTTC CATGCTGAAC CGGTTGCCAT GGCGGCGGAT AATCTGGCGC TGGCGATAGC GGAAATCGGT GCGCTATCGG AGCGGCGCAT CGCGCTGATG ATGGATAAAC ATATGTCGCA GTTACCGCCG TTCCTGGTAA AAAATGGCGG CGTTAATTCC GGTTTTATGA TTGCCCAGGT CACCGCCGCG GCGCTCGCCA GCGAGAATAA AGCGCTGGCG CACCCGCACA GCGTGGATAG CCTGCCGACC TCGGCGAATC AGGAAGATCA TGTCTCAATG GCCCCGGCGG CAGGACGGCG ACTCTGGGAA ATGGCGGCGA ATACCCGCGG CATCATTGCC GTGGAGTGGC TGGCTGCCTG TCAGGGGATA GATTTACGGG AAGGGTTAAC CTCCAGCCCG TTACTGGAAC AGGCGCGGCA GACACTGCGC GAGCAGGTGG CGCACTATAC GCAGGATCGT TTTTTCGCGC CCGATATTGA GTGTGCGACG GCGCTGCTGG CGCAAGGCGC GTTACAGCGT CTGGTGCCGG ACTTTATGTG A
|
Protein sequence | MNTMTLTPGQ LSLSQLYDVW RHPVQLRLDA NAIDGINASV ACVNDIVAEG RTAYGINTGF GLLAQTRIAD EDLQNLQRSL VLSHAAGVGD PLDDAMVRLI MVLKINSLAR GFSGIRLSVI EALIALVNAG VYPLILAKGS VGASGDLAPL AHLSLTLLGE GKARWQGEWL PAQTALKKAG LEPVALAAKE GLALLNGTQA STAFALRGLF EAQELFASAV VCGALTTEAV LGSRRPFDAR IHAARGQQGQ IDVARLFRHL LTDTSAIAES HHHCHKVQDP YSLRCQPQVM GACLTQLRQT KEVLLAEANA VSDNPLVFAD AGEVISGGNF HAEPVAMAAD NLALAIAEIG ALSERRIALM MDKHMSQLPP FLVKNGGVNS GFMIAQVTAA ALASENKALA HPHSVDSLPT SANQEDHVSM APAAGRRLWE MAANTRGIIA VEWLAACQGI DLREGLTSSP LLEQARQTLR EQVAHYTQDR FFAPDIECAT ALLAQGALQR LVPDFM
|
| |