Gene EcolC_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1623 
SymbolhisG 
ID6065987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1804283 
End bp1805182 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content55% 
IMG OID641601038 
ProductATP phosphoribosyltransferase 
Protein accessionYP_001724608 
Protein GI170019654 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0040] ATP phosphoribosyltransferase 
TIGRFAM ID[TIGR00070] ATP phosphoribosyltransferase
[TIGR03455] ATP phosphoribosyltransferase, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.412547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACA ACACTCGTTT ACGCATAGCT ATGCAGAAAT CCGGCCGTTT AAGTGATGAC 
TCACGCGAAT TGCTGGCGCG CTGTGGCATT AAAATCAATC TTCACACCCA GCGCCTGATC
GCGATGGCAG AAAACATGCC GATTGATATT CTGCGCGTGC GTGACGACGA CATTCCCGGT
CTGGTAATGG ACGGCGTGGT AGACCTTGGG ATTATCGGCG AAAACGTGCT GGAAGAAGAG
CTGCTTAACC GCCGCGCCCA GGGTGAAGAT CCACGCTACT TTACCCTGCG TCGTCTGGAT
TTCGGCGGCT GCCGCCTTTC GCTGGCAACG CCGGTTGATG AAGCCTGGGA CGGCCCACTC
TCCTTAAACG GTAAACGTAT CGCCACCTCT TATCCTCACC TGCTCAAGCG TTATCTCGAC
CAGAAAGGCA TCTCTTTTAA ATCCTGCTTA CTGAACGGTT CTGTTGAAGT CGCCCCGCGT
GCCGGACTTG CGGATGCGAT TTGCGATCTG GTTTCCACCG GTGCCACGCT GGAAGCTAAC
GGCCTGCGCG AAGTCGAAGT TATCTACCGC TCGAAAGCCT GCCTGATCCA GCGCGATGGC
GAAATGGAAG AATCCAAACA GCAACTGATC GACAAACTGC TGACCCGTAT TCAGGGTGTG
ATCCAGGCGC GCGAATCAAA ATACATCATG ATGCACGCAC CGACCGAACG TCTGGATGAA
GTCATCGCCC TGCTGCCAGG TGCCGAACGC CCAACTATTC TACCGCTGGC GGGTGATCAA
CAGCGCGTAG CGATGCACAT GGTCAGTAGC GAAACCCTGT TCTGGGAAAC CATGGAAAAA
CTGAAAGCGC TGGGTGCCAG TTCAATCCTG GTCCTGCCGA TTGAGAAGAT GATGGAGTGA
 
Protein sequence
MTDNTRLRIA MQKSGRLSDD SRELLARCGI KINLHTQRLI AMAENMPIDI LRVRDDDIPG 
LVMDGVVDLG IIGENVLEEE LLNRRAQGED PRYFTLRRLD FGGCRLSLAT PVDEAWDGPL
SLNGKRIATS YPHLLKRYLD QKGISFKSCL LNGSVEVAPR AGLADAICDL VSTGATLEAN
GLREVEVIYR SKACLIQRDG EMEESKQQLI DKLLTRIQGV IQARESKYIM MHAPTERLDE
VIALLPGAER PTILPLAGDQ QRVAMHMVSS ETLFWETMEK LKALGASSIL VLPIEKMME