Gene ECH74115_2952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2952 
SymbolhisG 
ID6969917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2727092 
End bp2727991 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content55% 
IMG OID643386792 
ProductATP phosphoribosyltransferase 
Protein accessionYP_002271260 
Protein GI209396922 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0040] ATP phosphoribosyltransferase 
TIGRFAM ID[TIGR00070] ATP phosphoribosyltransferase
[TIGR03455] ATP phosphoribosyltransferase, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.837237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00000000491094 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGACA ACACTCGTTT ACGCATAGCT ATGCAGAAAT CCGGCCGTTT AAGTGATGAC 
TCTCGCGAAT TGCTGGCGCG CTGTGGCATT AAAATCAATC TTCACACCCA GCGCCTGATC
GCGATGGCAG AAAACATGCC GATTGATATT CTGCGCGTGC GTGACGACGA CATTCCCGGT
CTGGTAATGG ACGGCGTGGT AGATCTTGGG ATTATCGGCG AAAACGTGCT GGAAGAAGAG
CTGCTTAACC GCCGCGCCCA GGGTGAAGAT CCACGCTACT TTACCCTGCG TCGTCTGGAT
TTCGGCGGCT GCCGTCTTTC GCTGGCAACG CCGGTTGATG ATGCCTGGGA CGGCCCGCTC
TCCTTAAACG GTAAACGTAT CGCCACCTCT TATCCTCACC TGCTCAAGCG TTATCTCGAC
CAGAAAGGCA TCTCTTTTAA ATCCTGCTTA CTGAACGGTT CTGTTGAAGT CGCCCCGCGT
GCCGGACTGG CGGATGCGAT TTGCGATCTG GTTTCCACCG GTGCCACGCT GGAAGCAAAC
GGCCTGCGCG AAGTCGAAGT TATCTATCGC TCGAAAGCCT GCCTGATACA ACGCGATGGC
GAAATGGAAG AATCCAAACA GCAACTGATC GACAAGCTGC TGACCCGTAT TCAGGGTGTG
ATCCAGGCGC GCGAATCAAA ATACATCATG ATGCACGCAC CGACCCAACG TCTGGATGAA
GTCATCGCCC TGCTGCCAGG TGCCGAACGC CCAACCATTC TGCCGCTGGC GGGTGATCAA
CAGCGCGTAG CGATGCACAT GGTCAGTAGC GAAACCCTGT TCTGGGAAAC GATGGAAAAA
CTGAAAGCGC TGGGTGCCAG TTCAATCCTG GTCCTGCCGA TTGAGAAGAT GATGGAGTGA
 
Protein sequence
MTDNTRLRIA MQKSGRLSDD SRELLARCGI KINLHTQRLI AMAENMPIDI LRVRDDDIPG 
LVMDGVVDLG IIGENVLEEE LLNRRAQGED PRYFTLRRLD FGGCRLSLAT PVDDAWDGPL
SLNGKRIATS YPHLLKRYLD QKGISFKSCL LNGSVEVAPR AGLADAICDL VSTGATLEAN
GLREVEVIYR SKACLIQRDG EMEESKQQLI DKLLTRIQGV IQARESKYIM MHAPTQRLDE
VIALLPGAER PTILPLAGDQ QRVAMHMVSS ETLFWETMEK LKALGASSIL VLPIEKMME