Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1994 |
Symbol | purB |
ID | 6146250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2014524 |
End bp | 2015894 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616870 |
Product | adenylosuccinate lyase |
Protein accession | YP_001744046 |
Protein GI | 170682479 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0015] Adenylosuccinate lyase |
TIGRFAM ID | [TIGR00928] adenylosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.922086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTAT CCTCACTGAC CGCCGTTTCC CCTGTCGATG GACGCTACGG CGATAAAGTC AGCGCGCTGC GCGGGATTTT CAGCGAATAT GGTTTGCTGA AATTCCGTGT ACAAGTTGAA GTACGTTGGC TGCAAAAACT GGCCGCGCAC GCAGCGATCA AGGAAGTTCC TGCTTTTGCT GCCGACGCAA TCGGTTACCT TGATGCAATT GTCGCCAGTT TCAGCGAAGA AGATGCCGCA CGCATCAAAA CCATCGAGCG TACCACTAAC CACGACGTTA AAGCGGTTGA GTATTTCCTG AAAGAAAAAG TGGCGGAGAT CCCGGAACTG CACGCGGTTT CTGAATTCAT CCACTTTGCC TGTACTTCGG AAGATATCAA TAACCTCTCC CACGCATTAA TGCTGAAAAC CGCGCGTGAT GAAGTGATCC TGCCGTACTG GCGTCAACTG ATTGATGGCA TTAAAGATCT CGCCGCTCAG TACCGCGATA TCCCGCTGCT GTCCCGTACC CACGGTCAGC CAGCCACGCC GTCAACCATC GGTAAAGAGA TGGCTAACGT CGCCTACCGT ATGGAGCGCC AGTACCGCCA GCTTAACCAG GTGGAGATCC TCGGCAAAAT CAACGGTGCG GTCGGTAACT ATAACGCCCA CATCGCCGCT TACCCGGAAG TTGACTGGCA TCAGTTCAGC GAAGAGTTCG TCACCTCGCT GGGTATTCAG TGGAACCCGT ACACCACCCA GATCGAACCG CACGACTACA TTGCCGAACT GTTTGATTGC GTTGCGCGCT TCAACACCAT TCTGATCGAC TTTGACCGTG ACGTCTGGGG TTATATCGCC CTTAACCACT TCAAACAGAA AACCATTGCT GGTGAGATTG GTTCTTCCAC CATGCCGCAT AAAGTTAACC CGATCGACTT CGAAAACTCC GAAGGAAACC TGGGCCTTTC CAACGCGGTA TTGCAGCACC TGGCAAGCAA ACTGCCAGTT TCCCGCTGGC AGCGTGACCT GACCGACTCC ACCGTGCTGC GTAACCTCGG CGTGGGTATC GGTTATGCGC TGATTGCGTA TCAATCCACC CTGAAAGGCG TGAGCAAACT GGAAGTGAAC CGTGGCCATC TGCTGGATGA ACTGGATCAC AACTGGGAAG TGCTGGCTGA GCCAATCCAG ACAGTTATGC GTCGCTATGG CATCGAAAAA CCGTACGAGA AGCTGAAAGA GCTGACTCGC GGTAAGCGCG TTGACGCCGA AGGCATGAAG CAGTTTATCG ACGGTCTGGC GCTGCCGGAA GAAGAGAAAG CCCGCCTTAA AGCGATGACG CCGGCAAACT ACATTGGTCG CGCCATCACC ATGGTTGATG AGCTGAAATA A
|
Protein sequence | MELSSLTAVS PVDGRYGDKV SALRGIFSEY GLLKFRVQVE VRWLQKLAAH AAIKEVPAFA ADAIGYLDAI VASFSEEDAA RIKTIERTTN HDVKAVEYFL KEKVAEIPEL HAVSEFIHFA CTSEDINNLS HALMLKTARD EVILPYWRQL IDGIKDLAAQ YRDIPLLSRT HGQPATPSTI GKEMANVAYR MERQYRQLNQ VEILGKINGA VGNYNAHIAA YPEVDWHQFS EEFVTSLGIQ WNPYTTQIEP HDYIAELFDC VARFNTILID FDRDVWGYIA LNHFKQKTIA GEIGSSTMPH KVNPIDFENS EGNLGLSNAV LQHLASKLPV SRWQRDLTDS TVLRNLGVGI GYALIAYQST LKGVSKLEVN RGHLLDELDH NWEVLAEPIQ TVMRRYGIEK PYEKLKELTR GKRVDAEGMK QFIDGLALPE EEKARLKAMT PANYIGRAIT MVDELK
|
| |