Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4508 |
Symbol | purD |
ID | 6482854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4381298 |
End bp | 4382587 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642739737 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_002043423 |
Protein GI | 194445679 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.0778373 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTT TAGTCATTGG TAACGGCGGG CGCGAACACG CGCTGGCCTG GAAAGCCGCA CAGTCGCCGC TGGTTGATAC CGTTTTTGTC GCACCGGGTA ACGCCGGTAC CGCGCTGGAG CCTGCGTTGC AGAACGTGGC TATCGGCGTC ACCGATATTC CAGCGCTGCT GAGCTTTGCC CAGAACGAGA AGATTGATCT GACCATCGTC GGCCCGGAAG CGCCGCTGGT GATTGGCGTG GTCGATGCGT TCCGCGCGGC GGGCCTGAAG ATCTTCGGCC CAACCGAAGG CGCCGCGCAA CTGGAAGGCT CCAAAGCCTT CACCAAAGAT TTCCTCGCTC GTCACCAGAT TCCGACGGCG GAATACCAGA ATTTCACCGA GATTGAGCCT GCCCTGGCTT ATCTGCGTGA GAAAGGCGCG CCGATCGTCA TCAAGGCCGA TGGTCTGGCC GCCGGTAAAG GCGTTATCGT GGCGATGACG CTGGAAGAAG CAGAAGCTGC CGTACACGAT ATGCTGGCCG GTAACGCCTT TGGCGACGCG GGTCACCGCA TCGTGATTGA AGAGTTCCTC GACGGCGAGG AAGCGAGCTT TATCGTGATG GTCGACGGCG AGCACGTTCT GCCGATGGCT ACCAGCCAGG ACCACAAACG CGTGGGCAAT GGCGATACCG GCCCGAATAC CGGCGGTATG GGGGCCTACT CACCGGCGCC GGTGGTGACT GATGAAGTGC ACCAGCGCAC CATGGAACGC ATCATCTGGC CAACCGTGAA AGGCATGGCA GCAGAAGGTA ACACGTACAC CGGCTTCCTG TATGCGGGTC TGATGATCGA CAAGCAGGGC AACCCGAAAG TTATCGAGTT CAACTGCCGC TTCGGCGATC CGGAAACCCA GCCGATCATG CTGCGCATGA AATCGGATCT GGTGGATCTT TGCCTGGCCG CCTGCGACGG CAAGCTGGAT GAGAAAACCT CCGAGTGGGA CGAACGCGCT TCATTAGGCG TGGTGATCGC CGCGGGCGGT TATCCGGGCA ACTACAACAC TGGCGATGAG ATCCACGGCC TGCCGCTGGA AGAAGTGGCT GACGGTAAGG TTTTCCACGC GGGCACCAAA CTCGCCGATG ACGACCGTGT GCTGACCAGC GGCGGACGCG TCCTGTGCGC CACCGCGCTG GGCCACACCG TCGCCGAAGC GCAGAAACGC GCTTACGCCC TGATGACCGA CATCCGCTGG GACGGCAGCT TCAGCCGTAA CGACATCGGC TGGCGCGCCA TCGAACGCGA ACAGCGCTAA
|
Protein sequence | MKVLVIGNGG REHALAWKAA QSPLVDTVFV APGNAGTALE PALQNVAIGV TDIPALLSFA QNEKIDLTIV GPEAPLVIGV VDAFRAAGLK IFGPTEGAAQ LEGSKAFTKD FLARHQIPTA EYQNFTEIEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGN GDTGPNTGGM GAYSPAPVVT DEVHQRTMER IIWPTVKGMA AEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM LRMKSDLVDL CLAACDGKLD EKTSEWDERA SLGVVIAAGG YPGNYNTGDE IHGLPLEEVA DGKVFHAGTK LADDDRVLTS GGRVLCATAL GHTVAEAQKR AYALMTDIRW DGSFSRNDIG WRAIEREQR
|
| |