Gene EcHS_A2160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2160 
SymbolhisC 
ID5594693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2138253 
End bp2139323 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID640921293 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001458832 
Protein GI157161514 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG TGACTATCAC CGATTTAGCG CGTGAAAACG TCCGCAACCT GACGCCGTAT 
CAGTCGGCGC GTCGTCTGGG CGGTAACGGC GACGTCTGGC TGAACGCCAA CGAATACCCC
ACAGCCGTGG AGTTTCAGCT TACTCAGCAA ACGCTCAACC GCTACCCGGA ATGTCAGCCG
AAAGCGGTGA TTGAAAATTA CGCGCAGTAT GCAGGCGTAA AACCGGAGCA GGTGCTGGTC
AGCCGTGGCG CGGACGAAGG TATTGAACTA CTGATTCGCG CTTTTTGCGA ACCAGGTAAA
GACGCCATCC TCTACTGCCC GCCAACGTAC GGCATGTACA GCGTCAGCGC TGAAACCATT
GGCGTCGAGT GCCGCACAGT GCCGACGCTG GACAACTGGC AACTGGACTT ACAGGGCATT
TCCGACAAGC TGGACGGCGT AAAAGTGGTC TATGTTTGCA GCCCCAACAA CCCGACCGGG
CAACTGATCA ACCCGCAGGA TTTTCGCACT CTGCTGGAGT TAACCCGCGG TAAAGCGATT
GTGGTTGCCG ATGAAGCCTA TATCGAGTTT TGCCCGCAGG CATCGTTGGC TGGCTGGCTG
GCGGAATATC CGCACCTGGC TATTTTGCGC ACACTGTCGA AAGCTTTTGC TCTGGCGGGC
CTTCGTTGCG GATTTACGCT GGCAAACGAA GAAGTCATCA ACCTGCTGAT GAAAGTGATC
GCCCCCTACC CGCTCTCGAC GCCGGTTGCC GACATTGCGG CCCAGGCGTT AAGCCCGCAG
GGGATCGTCG CCATGCGCGA ACGGGTGGCG CAAATTATTG CAGAGCGCGA ATACCTGATT
GCCGCACTGA AAAAAATCTC CTGCGTGGAG CAGGTTTTTG ACTCTGAAAC CAACTACATT
CTGGCGCGCT TTAAAGCCTC CAGCGCAGTG TTTAAATCTT TGTGGGATCA GGGCATTATC
TTACGTGATC AGAATAAACA ACCCTCTTTA AGCGGCTGCC TGCGAATTAC CGTCGGAACC
CGTGAAGAAA GCCAGCGCGT CATTGACGCC TTACGTGCGG AGCAAGTTTA A
 
Protein sequence
MSTVTITDLA RENVRNLTPY QSARRLGGNG DVWLNANEYP TAVEFQLTQQ TLNRYPECQP 
KAVIENYAQY AGVKPEQVLV SRGADEGIEL LIRAFCEPGK DAILYCPPTY GMYSVSAETI
GVECRTVPTL DNWQLDLQGI SDKLDGVKVV YVCSPNNPTG QLINPQDFRT LLELTRGKAI
VVADEAYIEF CPQASLAGWL AEYPHLAILR TLSKAFALAG LRCGFTLANE EVINLLMKVI
APYPLSTPVA DIAAQALSPQ GIVAMRERVA QIIAEREYLI AALKKISCVE QVFDSETNYI
LARFKASSAV FKSLWDQGII LRDQNKQPSL SGCLRITVGT REESQRVIDA LRAEQV