Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4763 |
Symbol | purA |
ID | 6871011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4621611 |
End bp | 4622909 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642787656 |
Product | adenylosuccinate synthetase |
Protein accession | YP_002218250 |
Protein GI | 198242444 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0104] Adenylosuccinate synthase |
TIGRFAM ID | [TIGR00184] adenylosuccinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.707154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAACA ACGTCGTCGT ACTGGGCACC CAATGGGGTG ACGAAGGTAA AGGAAAGATC GTCGATCTTC TGACTGAACG GGCTAAATAT GTTGTACGCT ACCAGGGCGG TCACAACGCA GGCCATACTC TCGTAATCAA CGGTGAAAAA ACCGTTCTCC ATCTTATTCC ATCAGGTATT CTTCGCGAGA ATGTAACCAG CATCATCGGT AACGGTGTTG TGCTGTCTCC GAGCGCGCTG ATGAAAGAGA TGAAAGAACT GGAAGACCGT GGCATCCCCG TTCGTGAGCG TCTGCTGCTG TCTGAAGCCT GTCCGCTGAT CCTTGATTAT CACGTTGCGC TGGATAACGC GCGTGAGAAA GCGCGTGGCG CGAAAGCGAT CGGCACCACC GGGCGTGGAA TCGGGCCTGC TTATGAAGAT AAAGTGGCAC GTCGCGGTCT GCGTGTTGGT GACCTTTTCG ACAAAGAAAC CTTCGCTGAA AAACTGAAAG AAGTGATGGA ATATCACAAC TTCCAGTTGG TTAACTACTA CAAAGTTGAA GCGGTTGATT ACCAGAAAGT TCTGGATGAT ACGATGGCTG TTGCCGACAT CCTGACTTCT ATGGTTGTTG ACGTTTCAGA CCTGCTCGAC CAGGCGCGTC AGCGTGGCGA TTTCGTCATG TTTGAAGGTG CGCAGGGTAC CCTGCTGGAT ATCGACCACG GTACTTATCC GTACGTAACT TCTTCTAACA CCACTGCAGG TGGCGTGGCG ACCGGTTCCG GCCTGGGCCC GCGTTATGTT GATTACGTTC TGGGTATCCT CAAAGCTTAC TCCACTCGTG TAGGTGCAGG TCCGTTCCCG ACCGAACTGT TTGATGAAAC CGGCGAGTTC CTCTGCAAGC AGGGTAACGA ATACGGCGCC ACTACCGGCC GTCGTCGTCG TACCGGCTGG CTGGACACCG TTGCCGTTCG TCGTGCGGTA CAGCTGAACT CCCTGTCTGG CTTCTGCCTG ACCAAACTGG ACGTGCTGGA TGGCCTGAAA GAGGTGAAAC TCTGCGTGGC TTATCGTATG CCGGATGGTC GCGAAGTGAC TACCACTCCG CTGGCAGCTG ACGACTGGAA AGGTGTAGAG CCGATTTACG AAACCATGCC GGGCTGGTCT GAATCCACCT TCGGCGTGAA AGATCGTAGC GGCCTGCCGC AGGCGGCGCT GAACTACATC AAACGTATCG AAGAACTGAC CGGCGTGCCG ATTGATATTA TTTCTACCGG CCCCGATCGT ACTGAGACGA TGATTCTGCG CGACCCGTTC GACGCGTAA
|
Protein sequence | MGNNVVVLGT QWGDEGKGKI VDLLTERAKY VVRYQGGHNA GHTLVINGEK TVLHLIPSGI LRENVTSIIG NGVVLSPSAL MKEMKELEDR GIPVRERLLL SEACPLILDY HVALDNAREK ARGAKAIGTT GRGIGPAYED KVARRGLRVG DLFDKETFAE KLKEVMEYHN FQLVNYYKVE AVDYQKVLDD TMAVADILTS MVVDVSDLLD QARQRGDFVM FEGAQGTLLD IDHGTYPYVT SSNTTAGGVA TGSGLGPRYV DYVLGILKAY STRVGAGPFP TELFDETGEF LCKQGNEYGA TTGRRRRTGW LDTVAVRRAV QLNSLSGFCL TKLDVLDGLK EVKLCVAYRM PDGREVTTTP LAADDWKGVE PIYETMPGWS ESTFGVKDRS GLPQAALNYI KRIEELTGVP IDIISTGPDR TETMILRDPF DA
|
| |