Gene SNSL254_A2252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2252 
SymbolhisC 
ID6482338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2160620 
End bp2161699 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content56% 
IMG OID642737599 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002041341 
Protein GI194444826 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.124333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG AAAACACTCT CAGCGTCGCT GACTTAGCCC GTGAAAATGT CCGCAACCTG 
GTACCGTATC AGTCCGCCCG CCGTCTGGGC GGTAATGGCG ATGTCTGGCT GAACGCGAAT
GAATTCCCGA CAGCGGTGGA GTTTCAGCTC ACCCAACAAA CGCTTAACCG CTACCCGGAA
TGCCAGCCAA AGGCCGTGAT TGAAAACTAC GCGCAATATG CTGGCGTAAA GCCGGAGCAG
GTGCTGGTCA GCCGCGGCGC GGATGAAGGG ATCGAACTGG TGATCCGCGC CTTCTGCGAA
CCGGGGAAAG ACGCCATTCT CTACTGCCCG CCCACTTACG GTATGTACAG CGTCAGCGCC
GAAACCATTG GCGTAGAGCG CCGGACGGTT CCCGCGCTTG AAAACTGGCA GCTGGATCTA
CAGGGGATTT CCGACAACCT TGACGGCGCA AAAGTGGTGT TCGTTTGTAG CCCCAATAAC
CCCACCGGGC AACTTATCAA CCCGCAGGAT CTACGCACGC TGCTGGAGTT GACACGCGGT
AAAGCGATAG TCGTCGCCGA CGAAGCTTAT ATTGAGTTTT GCCCGCAGGC CACGCTGACA
GGCTGGCTGG TTGAATATCC TCATCTGGTT ATCCTGCGCA CATTGTCGAA AGCTTTTGCG
CTGGCGGGTC TGCGCTGCGG CTTTACGCTG GCTAATGAAG AGGTGATCAA CCTGCTGTTA
AAAGTGATCG CCCCTTATCC GCTTTCTACG CCAGTGGCGG ATATCGCCGC CCAGGCGCTG
AGCCCGCAGG GAATAAACGC AATGCGCGAT CGCGTGGCGC AGACAGTGCA GGAACGTCAG
TATCTGGTGA ATGCCCTGCA ACAGACCGCC TGCGTAGAAC ACGTCTTTGA CTCTGAAACC
AACTATATTC TGGCGCGGTT TACCGCCTCC AGCAGCGTGT TTAAATCCTT ATGGGATCAG
GGCATTATCT TACGCGATCA GAATAAACAA CCTTCTTTAA GCGGCTGCCT GCGGATTACG
GTCGGCACCC GCCAGGAAAA CCAGCGCGTC ATTGACGCCT TACGTGCGGA GCCAGTATGA
 
Protein sequence
MSTENTLSVA DLARENVRNL VPYQSARRLG GNGDVWLNAN EFPTAVEFQL TQQTLNRYPE 
CQPKAVIENY AQYAGVKPEQ VLVSRGADEG IELVIRAFCE PGKDAILYCP PTYGMYSVSA
ETIGVERRTV PALENWQLDL QGISDNLDGA KVVFVCSPNN PTGQLINPQD LRTLLELTRG
KAIVVADEAY IEFCPQATLT GWLVEYPHLV ILRTLSKAFA LAGLRCGFTL ANEEVINLLL
KVIAPYPLST PVADIAAQAL SPQGINAMRD RVAQTVQERQ YLVNALQQTA CVEHVFDSET
NYILARFTAS SSVFKSLWDQ GIILRDQNKQ PSLSGCLRIT VGTRQENQRV IDALRAEPV