Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4648 |
Symbol | purA |
ID | 6143254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4749469 |
End bp | 4750767 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619464 |
Product | adenylosuccinate synthetase |
Protein accession | YP_001746572 |
Protein GI | 170679879 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0104] Adenylosuccinate synthase |
TIGRFAM ID | [TIGR00184] adenylosuccinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.378806 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAACA ACGTCGTCGT ACTGGGCACC CAATGGGGTG ACGAAGGTAA AGGTAAGATC GTCGATCTTC TGACTGAACG GGCTAAATAT GTTGTACGCT ACCAGGGCGG TCACAACGCA GGCCATACTC TCGTAATCAA CGGTGAAAAA ACCGTTCTCC ATCTTATTCC ATCAGGTATT CTCCGCGAGA ATGTAACCAG CATCATCGGT AACGGTGTTG TGCTGTCTCC GGCTGCGCTG ATGAAAGAGA TGAAAGAACT GGAAGACCGT GGCATCCCCG TTCGTGAGCG TCTGCTGCTG TCTGAAGCAT GTCCGCTGAT CCTTGATTAT CACGTTGCGC TGGATAACGC GCGTGAGAAA GCGCGTGGCG CGAAAGCGAT CGGCACCACC GGTCGTGGTA TCGGGCCTGC TTATGAAGAT AAAGTGGCAC GTCGCGGTCT GCGTGTTGGC GACCTTTTCG ACAAAGAAAC CTTCGCTGAA AAACTGAAAG AAGTGATGGA ATATCACAAC TTCCAGTTGG TTAACTACTA CAAAGCTGAA GCGGTTGATT ACCAGAAAGT TCTGGATGAT ACGATGGCTG TTGCCGACAT CCTGACTTCT ATGGTTGTTG ACGTTTCTGA TCTGCTCGAC CAGGCGCGTC AGCGTGGCGA TTTCGTCATG TTTGAAGGTG CGCAGGGTAC GCTGCTGGAT ATCGACCACG GTACTTATCC GTACGTAACT TCTTCCAACA CCACTGCTGG TGGCGTGGCG ACCGGTTCCG GCCTGGGCCC ACGTTATGTT GATTACGTTC TGGGTATCCT CAAAGCTTAC TCCACTCGTG TGGGTGCAGG TCCGTTCCCG ACTGAACTGT TTGATGAAAC TGGCGAGTTC CTCTGCAAGC AGGGTAACGA ATTCGGCGCA ACTACGGGTC GTCGTCGTCG TACCGGCTGG CTGGACACCG TTGCCGTTCG TCGTGCGGTA CAGCTGAACT CCCTGTCTGG CTTCTGCCTG ACCAAGCTGG ACGTTCTGGA TGGCCTGAAA GAGGTGAAAC TCTGCGTGGC TTACCGTATG CCGGATGGTC GTGAAGTGAC TACCACTCCG CTGGCAGCTG ACGACTGGAA AGGTGTAGAG CCGATTTACG AAACCATGCC GGGCTGGTCT GAATCCACCT TCGGCGTGAA AGATCGTAGC GGCCTGCCGC AGGCGGCGCT GAACTACATC AAGCGTATTG AAGAGCTGAC CGGTGTGCCG ATCGATATCA TCTCTACCGG TCCGGATCGT ACTGAAACCA TGATTCTGCG CGACCCGTTC GACGCGTAA
|
Protein sequence | MGNNVVVLGT QWGDEGKGKI VDLLTERAKY VVRYQGGHNA GHTLVINGEK TVLHLIPSGI LRENVTSIIG NGVVLSPAAL MKEMKELEDR GIPVRERLLL SEACPLILDY HVALDNAREK ARGAKAIGTT GRGIGPAYED KVARRGLRVG DLFDKETFAE KLKEVMEYHN FQLVNYYKAE AVDYQKVLDD TMAVADILTS MVVDVSDLLD QARQRGDFVM FEGAQGTLLD IDHGTYPYVT SSNTTAGGVA TGSGLGPRYV DYVLGILKAY STRVGAGPFP TELFDETGEF LCKQGNEFGA TTGRRRRTGW LDTVAVRRAV QLNSLSGFCL TKLDVLDGLK EVKLCVAYRM PDGREVTTTP LAADDWKGVE PIYETMPGWS ESTFGVKDRS GLPQAALNYI KRIEELTGVP IDIISTGPDR TETMILRDPF DA
|
| |