Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4496 |
Symbol | purD |
ID | 6270763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4205478 |
End bp | 4206767 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641728289 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_001882691 |
Protein GI | 187730149 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0000225394 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTAT TAGTGATTGG TAACGGCGGG CGCGAGCACG CGCTGGCCTG GAAAGCGGCC CAGTCGCCGC TGGTTGAGAC TGTTTTTGTT GCTCCGGGTA ATGCAGGCAC TGCGCTGGAA CCCGCGCTGC AAAACGTTGC TATTGGCGTG ACCGATATCC CGGCGCTGTT GGATTTCGCA CAAAACGAAA AGATTGATCT GACCATCGTC GGCCCGGAAG CGCCGCTGGT GAAAGGCGTG GTCGATACCT TCCGCGCCGC CGGGCTGAAA ATCTTCGGCC CAACCGCAGG TGCGGCCCAA CTGGAAGGCT CAAAAGCGTT TACCAAAGAT TTCCTGGCCC GCCATAAGAT CCCTACGGCG GAATACCAGA ACTTCACCGA GGTAGAACCT GCGCTGGCGT ATCTGCGTGA GAAAGGCGCG CCAATCGTCA TTAAAGCGGA CGGTCTGGCT GCCGGGAAAG GCGTTATCGT GGCGATGACG CTGGAAGAAG CGGAAGCGGC TGTTCACGAT ATGCTGGCGG GCAACGCTTT TGGCGACGCG GGTCATCGCA TCGTTATCGA AGAGTTCCTC GATGGCGAAG AAGCGAGCTT TATCGTGATG GTGGACGGCG AGCATGTGCT GCCGATGGCT ACCAGCCAGG ATCACAAACG CGTAGGCGAT AAAGATACCG GACCAAACAC CGGCGGGATG GGCGCTTACT CCCCCGCGCC GGTAGTAACC GATGACGTTC ATCAGCGCAC CATGGAACGT ATTATCTGGC CAACCGTGAA AGGCATGGCG GCGGAAGGCA ACACCTACAC CGGTTTTCTC TACGCGGGCC TGATGATCGA CAAACAGGGC AATCCGAAGG TTATTGAATT TAACTGCCGC TTTGGCGATC CGGAAACCCA GCCGATTATG CTGCGCATGA AGTCCGATCT GGTTGAGCTC TGCCTAGCGG CCTGTGAAAG CAAACTGGAC GAGAAAACGT CCGAGTGGGA TGAACGCGCT TCTCTCGGCG TGGTGATGGC TGCGGGTGGA TATCCGGGTG ATTACCGCAC CGGTGACGTG ATCCACGGCC TGCCGCTGGA AGAAGTGGCA GGCGGCAAAG TGTTCCACGC GGGCACAAAA CTGGCGGATG ACGAGCAGGT AGTGACCAAC GGCGGGCGCG TACTGTGCGT CACCGCGCTG GGTCATACCG TGGCAGAAGC GCAGAAACGC GCCTATGCCT TAATGACCGA TATCCACTGG GACGACTGCT TCTGCCGGAA AGATATCGGC TGGCGTGCTA TCGAACGCGA GCAGAACTAA
|
Protein sequence | MKVLVIGNGG REHALAWKAA QSPLVETVFV APGNAGTALE PALQNVAIGV TDIPALLDFA QNEKIDLTIV GPEAPLVKGV VDTFRAAGLK IFGPTAGAAQ LEGSKAFTKD FLARHKIPTA EYQNFTEVEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGD KDTGPNTGGM GAYSPAPVVT DDVHQRTMER IIWPTVKGMA AEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM LRMKSDLVEL CLAACESKLD EKTSEWDERA SLGVVMAAGG YPGDYRTGDV IHGLPLEEVA GGKVFHAGTK LADDEQVVTN GGRVLCVTAL GHTVAEAQKR AYALMTDIHW DDCFCRKDIG WRAIEREQN
|
| |