Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4582 |
Symbol | purD |
ID | 6873473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4421597 |
End bp | 4422886 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642787489 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_002218091 |
Protein GI | 198241778 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.213422 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTT TAGTCATTGG TAACGGCGGG CGCGAACACG CGCTGGCCTG GAAAGCCGCA CAGTCGCCGC TGGTTGATAC CGTTTTTGTC GCACCGGGTA ACGCCGGTAC CGCGCTGGAG CCAGCGTTGC AGAACGTGGC TATCGGCGTC ACCGATATTC CGGCGCTGCT GAGCTTTGCC CAGAACGAGA AAATAGATCT GACCATCGTC GGCCCGGAAG CGCCGCTGGT GATTGGCGTG GTCAATGCGT TCCGCGCGGC GGGTCTGAAG ATCTTTGGCC CAACCGAAGG GGCCGCCCAA CTGGAAGGCT CCAAAGCCTT CACCAAAGAT TTCCTCGCTC GTCACCAGAT TCCGACGGCG GAATACCAGA ATTTCACCGA GATTGAGCCA GCCCTGGCTT ATCTGCGTGA GAAAGGCGCG CCGATCGTCA TCAAAGCTGA CGGTCTGGCT GCCGGTAAAG GCGTTATCGT GGCGATGACG CTGGAAGAAG CCGAAGCCGC CGTTCATGAC ATGCTGGCAG GCAACGCCTT TGGCGACGCG GGCCACCGTA TCGTGATTGA GGAGTTCCTC GACGGCGAAG AAGCGAGCTT TATCGTGATG GTCGACGGCG AGCACGTGCT GCCGATGGCC ACCAGCCAGG ATCACAAACG CGTAGGCAAC GGCGATACCG GCCCGAACAC CGGCGGCATG GGGGCTTACT CTCCGGCTCC AGTGGTGACC GATGAAGTCC ACCAGCGCAC CATGGAACGG ATCATCTGGC CAACCGTGAA AGGCATGGCG GCAGAAGGGA ATACCTATAC CGGCTTCCTG TACGCGGGTC TGATGATCGA CAAGCAGGGC AACCCAAAAG TGATCGAGTT CAACTGCCGC TTCGGCGATC CGGAAACCCA GCCGATCATG CTGCGCATGA AGTCGGATCT GGTGGATCTT TGCCTGGCCG CCTGCGACGG CAAGCTGGAT GAGAAAACCT CCGAGTGGGA CGAACGCGCT TCATTAGGCG TGGTGATCGC CGCGGGCGGT TATCCGGGCA ACTACAACAC TGGCGATGAG ATCCACGGCC TGCCGCTGGA AGAAGTGGCT GACGGTAAGG TTTTCCACGC GGGCACCAAA CTCGCCGATG ACGACCGTGT GCTGACCAGC GGCGGTCGCG TACTGTGCGC CACCGCGCTG GGCCACACCG TGGCTGAGGC GCAGAAACGC GCTTACGCCT TGATGACCGA CATCCGCTGG GACGGCAGCT TCAGCCGTAA CGACATCGGC TGGCGCGCCA TTGAGCGTGA GCAAAACTAA
|
Protein sequence | MKVLVIGNGG REHALAWKAA QSPLVDTVFV APGNAGTALE PALQNVAIGV TDIPALLSFA QNEKIDLTIV GPEAPLVIGV VNAFRAAGLK IFGPTEGAAQ LEGSKAFTKD FLARHQIPTA EYQNFTEIEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGN GDTGPNTGGM GAYSPAPVVT DEVHQRTMER IIWPTVKGMA AEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM LRMKSDLVDL CLAACDGKLD EKTSEWDERA SLGVVIAAGG YPGNYNTGDE IHGLPLEEVA DGKVFHAGTK LADDDRVLTS GGRVLCATAL GHTVAEAQKR AYALMTDIRW DGSFSRNDIG WRAIEREQN
|
| |