Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4708 |
Symbol | purA |
ID | 6272445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4400237 |
End bp | 4401535 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641728473 |
Product | adenylosuccinate synthetase |
Protein accession | YP_001882868 |
Protein GI | 187732758 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0104] Adenylosuccinate synthase |
TIGRFAM ID | [TIGR00184] adenylosuccinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAACA ACGTCGTCGT ACTGGGCACC CAATGGGGTG ACGAAGGTAA AGGTAAGATC GTCGATCTTC TGACTGAACG GGCTAAATAT GTTGTACGCT ACCAGGGCGG TCACAACGCA GGCCATACTC TCGTAATCAA CGGTGAAAAA ACCGTTCTCC ATCTTATTCC ATCAGGTATT CTCCGCGAGA ATGTAACCAG CATCATCGGT AACGGTGTTG TGCTGTCTCC GGCCGCGCTG ATGAAAGAGA TGAAAGAACT GGAAGACCGT GGCATCCCCG TTCGTGAGCG TCTGCTGCTG TCTGAAGCAT GTCCGCTGAT CCTTGATTAT CACGTTGCGC TGGATAACGC GCGTGAGAAA GCGCGTGGCG CGAAAGCGAT CGGCACCACC GGTCGTGGTA TCGGGCCTGC TTATGAAGAT AAAGTGGCAC GTCGCGGTCT GCGTGTTGGC GACCTTTTCG ACAAAGAAAC CTTCGCTGAA AAACTGAAAG AAGTGATGGA ATATCACAAC TTCCAGTTGG TTAACTACTA CAAAGCTGAA GCGGTTGATT ACCAGAAAGT TCTGGATGAT ACGATGGCTG TTGCCGACAT CCTGACTTCT ATGGTGGTTG ACGTTTCTGA CCTGCTCGAC CAGGCGCGTC AGCGTGGCGA TTTCGTCATG TTTGAAGGTG CGCAGGGTAC GCTGCTGGAT ATCGACCACG GTACTTATCC GTACGTAACT TCTTCCAACA CCACTGCTGG TGGCGTGGCG ACCGGTTCCG GCCTGGGCCC GCGTTATGTT GATTACGTTC TGGGTATCCT CAAAGCTTAC TCCACTCGTG TGGGTGCAGG TCCTTTCCCG ACCGAACTGT TTGATGAAAC TGGCGAGTTC CTCTGCAAGC AGGGTAACGA ATTCGGCGCA ACGACGGGTC GTCGTCGTCG TACCGGCTGG CTGGACACCG TTGCCGTTCG TCGTGCGGTA CAGCTGAACT CCCTGTCTGG CTTCTGCCTG ACTAAACTGG ACGTTCTGGA TGGCCTGAAA GAGGTTAAAC TCTGCGTGGC TTACCGTATG CCGGATGGTC GCGAAGTGAC TACCACTCCG CTGGCAGCTG ACGACTGGAA AGGTGTAGAG CCGATTTACG AAACCATGCC GGGCTGGTCT GAATCCACTT TCGGCGTGAA AGATCGTAGC GGGCTGCCGC AGGCGGCGCT GAACTACATC AAGCGTATTG AAGAGCTGAC CGGTGTGCCG ATCGATATCA TCTCTACCGG TCCGGATCGT ACTGAAACTA TGATTCTGCG CGACCCGTTC GACGCGTAA
|
Protein sequence | MGNNVVVLGT QWGDEGKGKI VDLLTERAKY VVRYQGGHNA GHTLVINGEK TVLHLIPSGI LRENVTSIIG NGVVLSPAAL MKEMKELEDR GIPVRERLLL SEACPLILDY HVALDNAREK ARGAKAIGTT GRGIGPAYED KVARRGLRVG DLFDKETFAE KLKEVMEYHN FQLVNYYKAE AVDYQKVLDD TMAVADILTS MVVDVSDLLD QARQRGDFVM FEGAQGTLLD IDHGTYPYVT SSNTTAGGVA TGSGLGPRYV DYVLGILKAY STRVGAGPFP TELFDETGEF LCKQGNEFGA TTGRRRRTGW LDTVAVRRAV QLNSLSGFCL TKLDVLDGLK EVKLCVAYRM PDGREVTTTP LAADDWKGVE PIYETMPGWS ESTFGVKDRS GLPQAALNYI KRIEELTGVP IDIISTGPDR TETMILRDPF DA
|
| |