Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0886 |
Symbol | hutH |
ID | 6871050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 880837 |
End bp | 882357 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642784081 |
Product | histidine ammonia-lyase |
Protein accession | YP_002214756 |
Protein GI | 198242428 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCA TGACACTCAC TCCGGGGCAG TTAAGCCTCT CTCAACTGTA CGATGTCTGG CGTCATCCGG TACAGCTTCG GCTGGACGCC AGCGCCATTG ACGGCATTAA CGCCAGCGTC GCGTGCGTCA ATGATATCGT TGCCGAAGGG CGTACCGCCT ACGGCATCAA CACCGGTTTC GGCCTGCTGG CGCAGACGCG CATCGCTGAT GAAGACTTGC AAAATCTACA GCGATCGTTG GTGCTGTCTC ACGCCGCGGG CGTAGGCGAC CCACTGGATG ACGCGATGGT GCGTCTGATC ATGGTATTGA AAATCAATAG CCTGGCGCGC GGGTTTTCCG GAATTCGGTT GAGCGTGATA GAAGCGCTCA TCGCTCTGGT GAATGCCGGC GTGTATCCGC TGATTCCGGC AAAAGGCTCG GTTGGCGCGT CGGGCGACTT AGCGCCGCTG GCGCATCTGT CGCTGACGCT GCTTGGCGAA GGAAAAGCGC GCTGGCAGGG CGAATGGCTT CCGGCGCAGA CGGCGCTAAA AAAGGCCGGA CTGGAGCCTG TCGCGCTGGC GGCCAAAGAG GGGCTGGCGC TGCTTAATGG CACCCAGGCA TCAACGGCCT TCGCGCTGCG CGGTTTGTTT GAGGCACAGG AGTTATTTGC CTCCGCCGTT GTCTGCGGCG CGCTGACTAC CGAAGCCGTA CTGGGATCGC GCCGTCCTTT TGATGCGCGC ATTCATGCCG CGCGCGGACA ACAGGGGCAA ATTGACGTCG CGCGGTTGTT CCGGCACCTG CTGACCGATA CCAGCGCCAT TGCTGAATCA CATCACCACT GTCATAAAGT TCAGGACCCG TATTCCCTTC GCTGCCAGCC GCAGGTAATG GGCGCTTGTC TGACCCAGTT ACGTCAGACG AAAGAGGTAC TGCTGGCTGA GGCCAACGCG GTATCCGATA ATCCGCTGGT CTTTGCCGAT GCGGGGGAGG TGATCTCCGG CGGTAACTTC CATGCTGAAC CGGTTGCCAT GGCGGCGGAT AATTTGGCGC TGGCGATAGC GGAAATCGGC GCGCTATCGG AGCGGCGCAT CGCGCTGATG ATGGATAAAC ATATGTCGCA GTTACCGCCG TTCCTGGTGA AAAATGGCGG CGTTAATTCC GGTTTTATGA TTGCCCAGGT CACCGCCGCG GCGCTCGCCA GCGAGAATAA AGCGCTGGCG CACCCGCACA GCGTGGATAG CCTGCCGACC TCGGCGAATC AGGAAGATCA TGTCTCAATG GCCCCGGCGG CAGGGCGGCG ACTCTGGGAA ATGGCGGCGA ATACCCGCGG CATCATTGCC GTGGAGTGGC TGGCGGCCTG TCAGGGGATA GATTTACGGG AAGGGTTAAC CTCCAGCCCG TTACTGGAAC AGGCGCGGCA GACACTGCGC GAGCAGGTGG CGCACTATAC GCAGGATCGT TTTTTCGCGC CCGATATTGA GTGTGCGACG GCGCTGCTGG CGCAAGGCGC GTTACAGCGT CTGGTGCCGG ACTTTATGTG A
|
Protein sequence | MNTMTLTPGQ LSLSQLYDVW RHPVQLRLDA SAIDGINASV ACVNDIVAEG RTAYGINTGF GLLAQTRIAD EDLQNLQRSL VLSHAAGVGD PLDDAMVRLI MVLKINSLAR GFSGIRLSVI EALIALVNAG VYPLIPAKGS VGASGDLAPL AHLSLTLLGE GKARWQGEWL PAQTALKKAG LEPVALAAKE GLALLNGTQA STAFALRGLF EAQELFASAV VCGALTTEAV LGSRRPFDAR IHAARGQQGQ IDVARLFRHL LTDTSAIAES HHHCHKVQDP YSLRCQPQVM GACLTQLRQT KEVLLAEANA VSDNPLVFAD AGEVISGGNF HAEPVAMAAD NLALAIAEIG ALSERRIALM MDKHMSQLPP FLVKNGGVNS GFMIAQVTAA ALASENKALA HPHSVDSLPT SANQEDHVSM APAAGRRLWE MAANTRGIIA VEWLAACQGI DLREGLTSSP LLEQARQTLR EQVAHYTQDR FFAPDIECAT ALLAQGALQR LVPDFM
|
| |