Gene EcSMS35_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1041 
SymbolhisG 
ID6143183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1062009 
End bp1062908 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content54% 
IMG OID641615928 
ProductATP phosphoribosyltransferase 
Protein accessionYP_001743120 
Protein GI170682673 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0040] ATP phosphoribosyltransferase 
TIGRFAM ID[TIGR00070] ATP phosphoribosyltransferase
[TIGR03455] ATP phosphoribosyltransferase, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.175546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACA ACACTCGTTT ACGCATAGCT ATGCAGAAAT CCGGCCGTTT AAGTGATGAT 
TCACGCGAAT TGCTGGCGCG CTGTGGCATT AAAATCAATC TTCACACCCA GCGCCTGATC
GCGATGGCAG AAAACATGCC GATTGATATT CTGCGCGTGC GTGACGACGA CATTCCCGGT
CTGGTAATGG ATGGCGTGGT AGACCTTGGG ATTATCGGCG AAAACGTGCT GGAAGAAGAG
CTGCTTAACC GCCGCGCCCA GGGTGAAGAT CCGCGCTACT TTACCCTGCG TCGTCTGGAT
TTTGGCGGCT GCCGACTTTC ACTGGCAACG CCGGTTGATG AAGCCTGGGA CGGCCCGCTC
TCCTTAAACG GTAAACGTAT CGCCACCTCT TATCCTCACC TGCTCAAGCG TTATCTCGAC
CAGAAAGGCA TCTCTTTTAA ATCCTGCTTA CTGAACGGTT CTGTTGAAGT CGCCCCGCGT
GCCGGACTGG CGGATGCGAT TTGCGATCTG GTTTCCACCG GTGCCACGCT GGAAGCAAAC
GGCCTGCGCG AAGTCGAAGT TATCTACCGC TCGAAAGCTT GCCTGATCCA ACGCGATGGC
GAAATGGAAG AATCCAAACA GCAACTGATC GACAAACTGC TGACCCGTAT TCAGGGTGTG
ATCCAGGCGC GCGAATCAAA ATACATCATG ATGCACGCAC CGACCGAACG TCTGGATGAA
GTCATCGCCC TGCTGCCAGG TGCCGAACGC CCAACTATTC TGCCGCTGGC GGGTGATCAA
CAGCGCGTAG CGATGCACAT GGTCAGCAGC GAAACCCTGT TCTGGGAAAC CATGGAAAAA
CTGAAAGCGC TGGGTGCCAG TTCAATCCTG GTCCTGCCGA TTGAGAAGAT GATGGAGTGA
 
Protein sequence
MTDNTRLRIA MQKSGRLSDD SRELLARCGI KINLHTQRLI AMAENMPIDI LRVRDDDIPG 
LVMDGVVDLG IIGENVLEEE LLNRRAQGED PRYFTLRRLD FGGCRLSLAT PVDEAWDGPL
SLNGKRIATS YPHLLKRYLD QKGISFKSCL LNGSVEVAPR AGLADAICDL VSTGATLEAN
GLREVEVIYR SKACLIQRDG EMEESKQQLI DKLLTRIQGV IQARESKYIM MHAPTERLDE
VIALLPGAER PTILPLAGDQ QRVAMHMVSS ETLFWETMEK LKALGASSIL VLPIEKMME