Gene ECH74115_2954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2954 
SymbolhisC 
ID6968328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2729298 
End bp2730368 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID643386794 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002271262 
Protein GI209396323 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0000000139891 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACCG TGACTATTAC CGATTTAGCG CGTGAAAACG TCCGCAACCT GACGCCGTAT 
CAGTCGGCGC GTCGTCTGGG CGGTAATGGC GACGTCTGGC TGAACGCCAA CGAATATCCC
ACAGCCGTGG AGTTTCAGCT TACTCAGCAA ACGCTCAACC GCTACCCGGA ATGCCAGCCG
AAAGCGGTGA TTGAAAATTA CGCGCAATAT GCAGGCGTAA AACCGGAACA GGTGCTGGTC
AGCCGTGGCG CGGACGAAGG TATTGAACTA CTGATTCGCG CTTTTTGCGA ACCGGGTAAA
GACGCCATCC TCTACTGCCC GCCAACGTAC GGCATGTACA GCGTCAGCGC TGAAACCATT
GGCGTCGAGT GCCGCACAGT GCCGACGCTG GAAAACTGGC AACTGGACTT ACAGGGCATT
TCCGACAAGC TGGACGGCGT AAAAGTGGTC TATGTTTGCA GCCCCAACAA CCCAACCGGG
CAACTGATCA ATCCGCAGGA TTTTCGCACC CTGCTGGAGT TAACGCGCGG TAAGGCGATT
GTGGTTGCCG ATGAAGCCTA TATCGAGTTT TGCCCGCAGG CATCGCTGGC TGGCTGGCTG
GCGGAATATC CGCACCTGGC TATTTTGCGC ACACTGTCGA AAGCTTTTGC TCTGGCGGGC
CTTCGTTGCG GATTTACGCT GGCAAACGAA GAAGTCATCA ACCTGCTGAT GAAAGTGATT
GCCCCCTACC CGCTCTCGAC GCCGGTTGCC GACATTGCGG CCCAGGCGTT AAGCCCGCAG
GGAATTATCG CCATGCGCGA ACGGGTGGCG CAAATTATTG CAGAACGCGA ATACCTGATT
GCCGCACTGA AAGAGATCCC CTGCGTGGAG CAGGTTTTTG ACTCTGAAAC CAACTACATT
CTGGCGCGCT TTAAAGCCTC CAGTGCAGTG TTTAAATCTT TGTGGGATCA GGGCATTATC
TTACGTGATC AGAATAAACA ACCCTCTTTA AGCGGCTGCC TGCGAATTAC CGTCGGAACC
CGTGAAGAAA GCCAGCGCGT CATTGACGCC TTACGTGCGG AGCAAGTTTG A
 
Protein sequence
MSTVTITDLA RENVRNLTPY QSARRLGGNG DVWLNANEYP TAVEFQLTQQ TLNRYPECQP 
KAVIENYAQY AGVKPEQVLV SRGADEGIEL LIRAFCEPGK DAILYCPPTY GMYSVSAETI
GVECRTVPTL ENWQLDLQGI SDKLDGVKVV YVCSPNNPTG QLINPQDFRT LLELTRGKAI
VVADEAYIEF CPQASLAGWL AEYPHLAILR TLSKAFALAG LRCGFTLANE EVINLLMKVI
APYPLSTPVA DIAAQALSPQ GIIAMRERVA QIIAEREYLI AALKEIPCVE QVFDSETNYI
LARFKASSAV FKSLWDQGII LRDQNKQPSL SGCLRITVGT REESQRVIDA LRAEQV