Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4506 |
Symbol | purD |
ID | 6491456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 4386561 |
End bp | 4387850 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642744580 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_002048160 |
Protein GI | 194450383 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.0278032 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTT TAGTCATTGG TAACGGCGGG CGCGAACACG CGCTGGCCTG GAAAGCCGCA CAGTCGCCGT TGGTTGATAC CGTTTTTGTC GCACCGGGTA ACGCCGGTAC CGCGCTGGAG CCAGCGTTGC AGAACGTGGC TATCGGCGTC ACCGATATTC CGGCGCTGCT GAGCTTTGCC CAGAACGAGA AGATAGATCT GACCATCGTT GGCCCGGAAG CGCCGCTGGT GATTGGTGTG GTGGATGAGT TCCGCGCGGC GGGTCTGAAG ATCTTTGGCC CAACCGAAGG GGCCGCCCAA CTGGAAGGCT CCAAAGCGTT CACCAAAGAT TTCCTCGCTC GTCACCAGAT TCCGACGGCG GAATACCAGA ATTTCACCGA GATTGAGCCA GCCCTGGCTT ATCTGCGTGA GAAAGGCGCG CCGATCGTCA TCAAAGCTGA CGGTCTGGCT GCCGGTAAAG GCGTTATCGT GGCGATGACG CTGGAAGAAG CCGAAGCTGC CGTTCATGAC ATGCTGGCGG GTAACGCTTT TGGTGATGCG GGACATCGTA TCGTCATCGA AGAGTTCCTC GACGGCGAAG AGGCAAGCTT TATCGTGATG GTCGACGGCG AGCACGTGCT GCCGATGGCC ACCAGCCAGG ACCACAAACG CGTAGGCAAC GGCGATACCG GCCCGAACAC CGGCGGCATG GGGGCTTACT CTCCGGCTCC AGTGGTAACC GATGAAGTGC ATCAGCGCAC CATGGAACGC ATCATTTGGC CAACCGTGAA AGGCATGGCG GCGGAAGGTA ACACGTACAC CGGCTTCCTG TACGCGGGTC TGATGATCGA CAAGCAGGGT AATCCGAAGG TTATCGAGTT CAACTGCCGC TTCGGCGATC CGGAAACCCA GCCGATCATG TTGCGCATGA AGTCGGACCT GGTGGATCTT TGCCTGGCCG CCTGCGAAGG CAAGCTGGAT GAGAAAACCT CCGAGTGGGA CGAGCGCGCT TCATTAGGCG TGGTGATCGC CGCGGGCGGT TATCCGGGCA ACTACAACAC TGGCGATGAG ATCCACGGCC TGCCGCTGGA AGAAGTGGCT GACGGTAAGG TTTTCCACGC GGGCACCAAA CTCGCCGATG ACGACCGTGT GCTGACCAGC GGCGGACGCG TCCTGTGCGC CACCGCGCTG GGCCACACCG TCGCCGAAGC GCAGAAACGC GCTTACGCCC TGATGACCGA CATCCGCTGG GACGGCAGCT TCAGCCGTAA CGACATCGGC TGGCGCGCCA TCGAACGCGA ACAGCGCTAA
|
Protein sequence | MKVLVIGNGG REHALAWKAA QSPLVDTVFV APGNAGTALE PALQNVAIGV TDIPALLSFA QNEKIDLTIV GPEAPLVIGV VDEFRAAGLK IFGPTEGAAQ LEGSKAFTKD FLARHQIPTA EYQNFTEIEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGN GDTGPNTGGM GAYSPAPVVT DEVHQRTMER IIWPTVKGMA AEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM LRMKSDLVDL CLAACEGKLD EKTSEWDERA SLGVVIAAGG YPGNYNTGDE IHGLPLEEVA DGKVFHAGTK LADDDRVLTS GGRVLCATAL GHTVAEAQKR AYALMTDIRW DGSFSRNDIG WRAIEREQR
|
| |