Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5474 |
Symbol | purD |
ID | 6967135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5116559 |
End bp | 5117848 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643389121 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_002273522 |
Protein GI | 209396846 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.13476 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAT TAGTGATTGG TAACGGCGGG CGCGAGCACG CGCTGGCCTG GAAAGCGGCC CAGTCGCCGC TGGTTGAGAC TGTTTTTGTT GCTCCGGGTA ACGCAGGCAC AGCGCTGGAA CCCACGCTGC AAAACGTCGC TATCGGCGTG ACCGATATCC CGGCGCTGCT GGATTTCGCG CAAAACGAAA AGGTAGATCT AACCATCGTC GGCCCGGAAG CGCCGCTGGT GAAAGGCGTG GTCGATACCT TCCGCGCCGC CGGGATGAAA ATCTTCGGCC CAACCGCAGG CGCCGCACAG CTGGAAGGCT CGAAAGCGTT TACTAAAGAT TTCCTGGCCC GCCATAACAT TCCTACGGCG GAATATCAGA ACTTCACCGA GGTAGAACCT GCGCTGGCGT ATCTGCGTGA GAAAGGCGCG CCAATCGTCA TTAAAGCGGA CGGTCTGGCT GCCGGGAAAG GCGTTATCGT GGCGATGACG CTGGAAGAAG CGGAAGCGGC TGTTCACGAT ATGCTGGCGG GCAACGCGTT TGGCGACGCG GGTCATCGCA TCGTTATCGA AGAGTTCCTC GACGGCGAAG AAGCGAGCTT TATCGTGATG GTGGACGGCG AGCATGTGCT GCCGATGGCC ACCAGCCAGG ATCACAAACG CGTAGGCGAT AAAGATACCG GGCCGAACAC GGGCGGAATG GGCGCTTACT CTCCTGCGCC GGTAGTGACA GATGACGTTC ATCAGCGCAC GATGGAACGT ATCATCTGGC CAACCGTGAA AGGCATGGCG TCGGAAGGCA ACACCTACAC CGGTTTTCTC TACGCGGGCC TGATGATCGA CAAACAGGGC AATCCGAAGG TTATCGAATT TAACTGCCGC TTTGGCGATC CAGAAACCCA GCCGATTATG CTGCGCATGA AGTCCGATCT TGTTGAACTC TGCCTGGCGG CCTGTGAAGG CAAACTAGAC GAGAAAACGT CAGAGTGGGA TGAACGTGCT TCTCTCGGCG TGGTGATGGC TGCGGGTGGA TATCCGGGCG ATTACCGCAC CGGTGACGTG ATCCACGGCC TGCCGCTGGA AGAAGTGGAA GACGGCAAAG TGTTCCACGC GGGCACAAAA CTGGCGGATG ACGAGCAGGT GGTAACCAGC GGCGGGCGCG TACTGTGCGT CACCGCGCTG GGTCATACCG TAGCAGAAGC ACAGAAACGC GCCTATGCCT TAATGACCGA TATCCACTGG GACGACTGCT TCTGCCGGAA AGATATCGGC TGGCGCGCTA TCGAACGCGA GCAGAACTAA
|
Protein sequence | MKVLVIGNGG REHALAWKAA QSPLVETVFV APGNAGTALE PTLQNVAIGV TDIPALLDFA QNEKVDLTIV GPEAPLVKGV VDTFRAAGMK IFGPTAGAAQ LEGSKAFTKD FLARHNIPTA EYQNFTEVEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGD KDTGPNTGGM GAYSPAPVVT DDVHQRTMER IIWPTVKGMA SEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM LRMKSDLVEL CLAACEGKLD EKTSEWDERA SLGVVMAAGG YPGDYRTGDV IHGLPLEEVE DGKVFHAGTK LADDEQVVTS GGRVLCVTAL GHTVAEAQKR AYALMTDIHW DDCFCRKDIG WRAIEREQN
|
| |