Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1592 |
Symbol | purB |
ID | 6966662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1549016 |
End bp | 1550386 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385554 |
Product | adenylosuccinate lyase |
Protein accession | YP_002270048 |
Protein GI | 209399204 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0015] Adenylosuccinate lyase |
TIGRFAM ID | [TIGR00928] adenylosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0000440379 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAATTAT CCTCACTGAC CGCCGTTTCC CCTGTCGATG GACGCTACGG CGATAAAGTC AGCGCGCTGC GCGGGATTTT CAGCGAATAT GGTTTGCTGA AATTCCGTGT ACAAGTTGAA GTACGTTGGC TGCAAAAACT GGCCGCGCAC GCAGCGATCA AGGAAGTTCC TGCTTTTGCT GCCGACGCAA TCGGTTACCT TGATGCAATT GTCGCCAATT TCAGCGAAGA AGATGCCGCA CGCATCAAAA CCATCGAGCG TACTACTAAC CACGACGTTA AAGCGGTTGA GTATTTCCTG AAAGAAAAAG TGGCGGAGAT CCCGGAACTG CACGCGGTTT CTGAATTCAT CCACTTTGCC TGTACTTCGG AAGATATCAA TAACCTCTCC CACGCATTAA TGCTGAAAAC CGCGCGTGAT GAAGTGATCC TGCCGTACTG GCGTCAACTA ATTGATGGCA TTAAAGATCT CGCCGTTCAG TATCGCGATA TCCCGCTGCT GTCCCGTACC CACGGTCAGC CAGCCACGCC GTCAACCATC GGTAAAGAGA TGGCAAACGT CGCCTACCGT ATGGAGCGCC AGTACCGCCA GCTTAACCAG GTGGAGATCC TCGGCAAAAT CAACGGCGCG GTCGGTAACT ATAACGCCCA CATCGCCGCT TACCCGGAAG TTGACTGGCA TCAGTTCAGC GAAGAGTTCG TCACCTCGCT GGGTATTCAA TGGAACCCGT ACACTACCCA GATTGAACCG CACGACTACA TTGCCGAACT GTTTGATTGC GTTGCGCGCT TCAACACCAT TCTGATCGAC TTTGACCGTG ACGTCTGGGG TTATATCGCC CTTAACCACT TCAAACAGAA AACCATTGCT GGTGAGATTG GTTCTTCCAC CATGCCGCAT AAAGTTAACC CGATCGACTT CGAAAACTCC GAAGGGAATC TGGGCCTTTC CAACGCGGTA TTGCAGCATC TGGCAAGCAA ACTGCCGGTT TCCCGCTGGC AGCGTGACCT GACCGACTCC ACCGTGCTGC GTAACCTCGG CGTGGGTATC GGTTATGCGC TGATTGCGTA TCAATCCACC CTGAAAGGCG TGAGCAAACT GGAAGTGAAC CGTGACCATC TGCTGGATGA GCTGGATCAC AACTGGGAAG TGCTGGCAGA ACCAATCCAG ACAGTTATGC GTCGCTATGG CATCGAAAAA CCGTACGAGA AGCTGAAAGA GCTGACTCGC GGTAAGCGCG TTGACGCCGA AGGCATGAAG CAGTTTATCG ACGGTCTGGC GTTGCCAGAA GAAGAGAAAG CCCGCCTGAA AGCGATGACG CCGGCTAACT ACATTGGTCG CGCCATCACC ATGGTTGATG AGCTGAAATA A
|
Protein sequence | MELSSLTAVS PVDGRYGDKV SALRGIFSEY GLLKFRVQVE VRWLQKLAAH AAIKEVPAFA ADAIGYLDAI VANFSEEDAA RIKTIERTTN HDVKAVEYFL KEKVAEIPEL HAVSEFIHFA CTSEDINNLS HALMLKTARD EVILPYWRQL IDGIKDLAVQ YRDIPLLSRT HGQPATPSTI GKEMANVAYR MERQYRQLNQ VEILGKINGA VGNYNAHIAA YPEVDWHQFS EEFVTSLGIQ WNPYTTQIEP HDYIAELFDC VARFNTILID FDRDVWGYIA LNHFKQKTIA GEIGSSTMPH KVNPIDFENS EGNLGLSNAV LQHLASKLPV SRWQRDLTDS TVLRNLGVGI GYALIAYQST LKGVSKLEVN RDHLLDELDH NWEVLAEPIQ TVMRRYGIEK PYEKLKELTR GKRVDAEGMK QFIDGLALPE EEKARLKAMT PANYIGRAIT MVDELK
|
| |