Gene ECH74115_5474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5474 
SymbolpurD 
ID6967135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5116559 
End bp5117848 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content58% 
IMG OID643389121 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_002273522 
Protein GI209396846 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.13476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAT TAGTGATTGG TAACGGCGGG CGCGAGCACG CGCTGGCCTG GAAAGCGGCC 
CAGTCGCCGC TGGTTGAGAC TGTTTTTGTT GCTCCGGGTA ACGCAGGCAC AGCGCTGGAA
CCCACGCTGC AAAACGTCGC TATCGGCGTG ACCGATATCC CGGCGCTGCT GGATTTCGCG
CAAAACGAAA AGGTAGATCT AACCATCGTC GGCCCGGAAG CGCCGCTGGT GAAAGGCGTG
GTCGATACCT TCCGCGCCGC CGGGATGAAA ATCTTCGGCC CAACCGCAGG CGCCGCACAG
CTGGAAGGCT CGAAAGCGTT TACTAAAGAT TTCCTGGCCC GCCATAACAT TCCTACGGCG
GAATATCAGA ACTTCACCGA GGTAGAACCT GCGCTGGCGT ATCTGCGTGA GAAAGGCGCG
CCAATCGTCA TTAAAGCGGA CGGTCTGGCT GCCGGGAAAG GCGTTATCGT GGCGATGACG
CTGGAAGAAG CGGAAGCGGC TGTTCACGAT ATGCTGGCGG GCAACGCGTT TGGCGACGCG
GGTCATCGCA TCGTTATCGA AGAGTTCCTC GACGGCGAAG AAGCGAGCTT TATCGTGATG
GTGGACGGCG AGCATGTGCT GCCGATGGCC ACCAGCCAGG ATCACAAACG CGTAGGCGAT
AAAGATACCG GGCCGAACAC GGGCGGAATG GGCGCTTACT CTCCTGCGCC GGTAGTGACA
GATGACGTTC ATCAGCGCAC GATGGAACGT ATCATCTGGC CAACCGTGAA AGGCATGGCG
TCGGAAGGCA ACACCTACAC CGGTTTTCTC TACGCGGGCC TGATGATCGA CAAACAGGGC
AATCCGAAGG TTATCGAATT TAACTGCCGC TTTGGCGATC CAGAAACCCA GCCGATTATG
CTGCGCATGA AGTCCGATCT TGTTGAACTC TGCCTGGCGG CCTGTGAAGG CAAACTAGAC
GAGAAAACGT CAGAGTGGGA TGAACGTGCT TCTCTCGGCG TGGTGATGGC TGCGGGTGGA
TATCCGGGCG ATTACCGCAC CGGTGACGTG ATCCACGGCC TGCCGCTGGA AGAAGTGGAA
GACGGCAAAG TGTTCCACGC GGGCACAAAA CTGGCGGATG ACGAGCAGGT GGTAACCAGC
GGCGGGCGCG TACTGTGCGT CACCGCGCTG GGTCATACCG TAGCAGAAGC ACAGAAACGC
GCCTATGCCT TAATGACCGA TATCCACTGG GACGACTGCT TCTGCCGGAA AGATATCGGC
TGGCGCGCTA TCGAACGCGA GCAGAACTAA
 
Protein sequence
MKVLVIGNGG REHALAWKAA QSPLVETVFV APGNAGTALE PTLQNVAIGV TDIPALLDFA 
QNEKVDLTIV GPEAPLVKGV VDTFRAAGMK IFGPTAGAAQ LEGSKAFTKD FLARHNIPTA
EYQNFTEVEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA
GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGD KDTGPNTGGM GAYSPAPVVT
DDVHQRTMER IIWPTVKGMA SEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM
LRMKSDLVEL CLAACEGKLD EKTSEWDERA SLGVVMAAGG YPGDYRTGDV IHGLPLEEVE
DGKVFHAGTK LADDEQVVTS GGRVLCVTAL GHTVAEAQKR AYALMTDIHW DDCFCRKDIG
WRAIEREQN