Gene SeD_A0886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0886 
SymbolhutH 
ID6871050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp880837 
End bp882357 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content60% 
IMG OID642784081 
Producthistidine ammonia-lyase 
Protein accessionYP_002214756 
Protein GI198242428 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCA TGACACTCAC TCCGGGGCAG TTAAGCCTCT CTCAACTGTA CGATGTCTGG 
CGTCATCCGG TACAGCTTCG GCTGGACGCC AGCGCCATTG ACGGCATTAA CGCCAGCGTC
GCGTGCGTCA ATGATATCGT TGCCGAAGGG CGTACCGCCT ACGGCATCAA CACCGGTTTC
GGCCTGCTGG CGCAGACGCG CATCGCTGAT GAAGACTTGC AAAATCTACA GCGATCGTTG
GTGCTGTCTC ACGCCGCGGG CGTAGGCGAC CCACTGGATG ACGCGATGGT GCGTCTGATC
ATGGTATTGA AAATCAATAG CCTGGCGCGC GGGTTTTCCG GAATTCGGTT GAGCGTGATA
GAAGCGCTCA TCGCTCTGGT GAATGCCGGC GTGTATCCGC TGATTCCGGC AAAAGGCTCG
GTTGGCGCGT CGGGCGACTT AGCGCCGCTG GCGCATCTGT CGCTGACGCT GCTTGGCGAA
GGAAAAGCGC GCTGGCAGGG CGAATGGCTT CCGGCGCAGA CGGCGCTAAA AAAGGCCGGA
CTGGAGCCTG TCGCGCTGGC GGCCAAAGAG GGGCTGGCGC TGCTTAATGG CACCCAGGCA
TCAACGGCCT TCGCGCTGCG CGGTTTGTTT GAGGCACAGG AGTTATTTGC CTCCGCCGTT
GTCTGCGGCG CGCTGACTAC CGAAGCCGTA CTGGGATCGC GCCGTCCTTT TGATGCGCGC
ATTCATGCCG CGCGCGGACA ACAGGGGCAA ATTGACGTCG CGCGGTTGTT CCGGCACCTG
CTGACCGATA CCAGCGCCAT TGCTGAATCA CATCACCACT GTCATAAAGT TCAGGACCCG
TATTCCCTTC GCTGCCAGCC GCAGGTAATG GGCGCTTGTC TGACCCAGTT ACGTCAGACG
AAAGAGGTAC TGCTGGCTGA GGCCAACGCG GTATCCGATA ATCCGCTGGT CTTTGCCGAT
GCGGGGGAGG TGATCTCCGG CGGTAACTTC CATGCTGAAC CGGTTGCCAT GGCGGCGGAT
AATTTGGCGC TGGCGATAGC GGAAATCGGC GCGCTATCGG AGCGGCGCAT CGCGCTGATG
ATGGATAAAC ATATGTCGCA GTTACCGCCG TTCCTGGTGA AAAATGGCGG CGTTAATTCC
GGTTTTATGA TTGCCCAGGT CACCGCCGCG GCGCTCGCCA GCGAGAATAA AGCGCTGGCG
CACCCGCACA GCGTGGATAG CCTGCCGACC TCGGCGAATC AGGAAGATCA TGTCTCAATG
GCCCCGGCGG CAGGGCGGCG ACTCTGGGAA ATGGCGGCGA ATACCCGCGG CATCATTGCC
GTGGAGTGGC TGGCGGCCTG TCAGGGGATA GATTTACGGG AAGGGTTAAC CTCCAGCCCG
TTACTGGAAC AGGCGCGGCA GACACTGCGC GAGCAGGTGG CGCACTATAC GCAGGATCGT
TTTTTCGCGC CCGATATTGA GTGTGCGACG GCGCTGCTGG CGCAAGGCGC GTTACAGCGT
CTGGTGCCGG ACTTTATGTG A
 
Protein sequence
MNTMTLTPGQ LSLSQLYDVW RHPVQLRLDA SAIDGINASV ACVNDIVAEG RTAYGINTGF 
GLLAQTRIAD EDLQNLQRSL VLSHAAGVGD PLDDAMVRLI MVLKINSLAR GFSGIRLSVI
EALIALVNAG VYPLIPAKGS VGASGDLAPL AHLSLTLLGE GKARWQGEWL PAQTALKKAG
LEPVALAAKE GLALLNGTQA STAFALRGLF EAQELFASAV VCGALTTEAV LGSRRPFDAR
IHAARGQQGQ IDVARLFRHL LTDTSAIAES HHHCHKVQDP YSLRCQPQVM GACLTQLRQT
KEVLLAEANA VSDNPLVFAD AGEVISGGNF HAEPVAMAAD NLALAIAEIG ALSERRIALM
MDKHMSQLPP FLVKNGGVNS GFMIAQVTAA ALASENKALA HPHSVDSLPT SANQEDHVSM
APAAGRRLWE MAANTRGIIA VEWLAACQGI DLREGLTSSP LLEQARQTLR EQVAHYTQDR
FFAPDIECAT ALLAQGALQR LVPDFM