Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1294 |
Symbol | purB |
ID | 5588089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1296488 |
End bp | 1297858 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640924991 |
Product | adenylosuccinate lyase |
Protein accession | YP_001462400 |
Protein GI | 157154805 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0015] Adenylosuccinate lyase |
TIGRFAM ID | [TIGR00928] adenylosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTAT CCTCACTGAC CGCCGTTTCC CCTGTCGATG GACGCTACGG CGATAAAGTC AGCGCGCTGC GCGGGATTTT CAGCGAATAT GGTTTGCTGA AATTCCGTGT ACAAGTTGAA GTACGTTGGC TGCAAAAACT GGCCGCGCAC GCAGCGATCA AGGAAGTTCC TGCTTTTGCT GCCGACGCAA TCGGTTACCT TGATGCAATT GTCGCCAATT TCAGCGAAGA AGATGCCGCA CGCATCAAAA CCATCGAGCG TACCACTAAC CACGACGTTA AAGCGGTTGA GTATTTCCTG AAAGAAAAAG TGGCGGAGAT CCCGGAACTG CACGCGGTTT CTGAATTCAT CCACTTTGCC TGTACTTCGG AAGATATCAA TAACCTCTCC CACGCATTAA TGCTGAAAAC CGCGCGTGAT GAAGTGATCC TGCCGTACTG GCGTCAACTG ATTGATGGCA TTAAAGATCT CGCCGTTCAG TACCGCGATA TCCCACTGCT GTCTCGTACC CACGGTCAGC CAGCCACGCC GTCAACCATC GGTAAAGAGA TGGCAAACGT CGCCTACCGT ATGGAGCGCC AGTACCGCCA GCTTAATCAG GTGGAGATCC TCGGCAAAAT CAACGGCGCG GTCGGTAACT ATAACGCCCA TATCGCCGCT TACCCGGAAG TTGACTGGCA TCAGTTCAGC GAAGAGTTCG TCACCTCGCT GGGTATTCAG TGGAACCCGT ACACCACCCA GATCGAACCG CACGACTACA TTGCCGAATT GTTTGATTGC GTTGCGCGCT TCAACACCAT TCTGATCGAC TTTGACCGTG ACGTCTGGGG TTATATCGCC CTTAACCACT TCAAACAGAA AACCATTGCT GGTGAGATTG GTTCTTCCAC CATGCCGCAT AAAGTTAACC CGATTGACTT CGAAAACTCC GAAGGGAATC TGGGCCTTTC CAACGCGGTA TTGCAGCATC TGGCAAGCAA ACTGCCGGTT TCCCGCTGGC AGCGTGACCT GACCGACTCC ACCGTGCTGC GTAACCTCGG CGTGGGTATC GGTTATGCGC TGATTGCGTA TCAATCCACC CTGAAAGGCG TGAGCAAACT GGAAGTAAAC CGTGACCATC TGCTGGATGA GCTGGATCAC AACTGGGAAG TGCTGGCTGA GCCAATCCAG ACAGTTATGC GTCGCTATGG CATCGAAAAA CCGTACGAGA AGCTGAAAGA GCTGACTCGC GGTAAGCGCG TTGACGCCGA AGGCATGAAG CAGTTTATCG ATGGTCTGGC GTTGCCAGAA GAAGAGAAAG CCCGCCTGAA AGCGATGACG CCGGCTAACT ATATTGGTCG AGCTATCACC ATGGTTGATG AGCTGAAATA A
|
Protein sequence | MELSSLTAVS PVDGRYGDKV SALRGIFSEY GLLKFRVQVE VRWLQKLAAH AAIKEVPAFA ADAIGYLDAI VANFSEEDAA RIKTIERTTN HDVKAVEYFL KEKVAEIPEL HAVSEFIHFA CTSEDINNLS HALMLKTARD EVILPYWRQL IDGIKDLAVQ YRDIPLLSRT HGQPATPSTI GKEMANVAYR MERQYRQLNQ VEILGKINGA VGNYNAHIAA YPEVDWHQFS EEFVTSLGIQ WNPYTTQIEP HDYIAELFDC VARFNTILID FDRDVWGYIA LNHFKQKTIA GEIGSSTMPH KVNPIDFENS EGNLGLSNAV LQHLASKLPV SRWQRDLTDS TVLRNLGVGI GYALIAYQST LKGVSKLEVN RDHLLDELDH NWEVLAEPIQ TVMRRYGIEK PYEKLKELTR GKRVDAEGMK QFIDGLALPE EEKARLKAMT PANYIGRAIT MVDELK
|
| |