Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2158 |
Symbol | hisG |
ID | 5594906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2136047 |
End bp | 2136946 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921291 |
Product | ATP phosphoribosyltransferase |
Protein accession | YP_001458830 |
Protein GI | 157161512 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0040] ATP phosphoribosyltransferase |
TIGRFAM ID | [TIGR00070] ATP phosphoribosyltransferase [TIGR03455] ATP phosphoribosyltransferase, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 71 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGACA ACACTCGTTT ACGCATAGCT ATGCAGAAAT CCGGCCGTTT AAGTGATGAC TCACGTGAAT TGCTGGCGCG CTGTGGCATT AAAATCAATC TTCACACCCA GCGCCTGATC GCGATGGCAG AAAACATGCC GATTGATATT CTGCGCGTGC GTGACGACGA CATTCCCGGT CTGGTGATGG ACGGCGTGGT AGACCTTGGG ATTATCGGCG AAAACGTGCT GGAAGAAGAA CTGCTTAACC GCCGCGCCCA GGGTGAGGAT CCACGCTACT TTACCCTGCG TCGTCTGGAT TTCGGCGGCT GTCGCCTTTC GCTGGCAACG CCGGTTGATG AAGCCTGGGA CGGTCCGCTC TCCTTAAACG GTAAACGTAT CGCCACCTCT TATCCTCACC TGCTCAAGCG TTATCTCGAC CAGAAAGGTA TCTCTTTTAA ATCCTGCTTA CTGAACGGTT CTGTTGAAGT TGCGCCGCGT GCCGGACTGG CGGATGCGAT TTGCGATTTG GTTTCCACCG GTGCCACGCT GGAAGCTAAC GGCCTGCGCG AAGTCGAAGT TATCTACCGC TCGAAAGCCT GCCTTATCCA GCGCGATGGT GAAATGGAAG AATCCAAACA GCAGCTGATC GACAAACTGC TTACCCGTAT TCAGGGTGTG ATCCAGGCGC GCGAATCAAA ATACATCATG ATGCACGCAC CGACCGAACG CCTGGATGAA GTCATCGCCC TGCTGCCAGG TGCCGAACGC CCAACTATTC TACCGCTGGC GGGTGATCAA CAACGCGTAG CGATGCACAT GGTCAGTAGC GAAACCCTGT TCTGGGAAAC GATGGAAAAA CTGAAAGCGC TGGGTGCCAG TTCAATCCTG GTCCTGCCGA TTGAGAAGAT GATGGAGTGA
|
Protein sequence | MTDNTRLRIA MQKSGRLSDD SRELLARCGI KINLHTQRLI AMAENMPIDI LRVRDDDIPG LVMDGVVDLG IIGENVLEEE LLNRRAQGED PRYFTLRRLD FGGCRLSLAT PVDEAWDGPL SLNGKRIATS YPHLLKRYLD QKGISFKSCL LNGSVEVAPR AGLADAICDL VSTGATLEAN GLREVEVIYR SKACLIQRDG EMEESKQQLI DKLLTRIQGV IQARESKYIM MHAPTERLDE VIALLPGAER PTILPLAGDQ QRVAMHMVSS ETLFWETMEK LKALGASSIL VLPIEKMME
|
| |