Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1309 |
Symbol | purB |
ID | 6270496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1195524 |
End bp | 1196894 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641725428 |
Product | adenylosuccinate lyase |
Protein accession | YP_001879939 |
Protein GI | 187730123 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0015] Adenylosuccinate lyase |
TIGRFAM ID | [TIGR00928] adenylosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTAT CCTCACTGAC CGCCGTTTCC CCTGTCGATG GACGCTACGG CGATAAAGTC AGCGCGCTGC GCGGGATTTT CAGCGAATAT GGTTTGCTGA AATTCCGTGT ACAAGTTGAA GTACGTTGGC TGCAAAAACT GGCCGCGCAC GCAGCGATCA AGGAAGTTCC TGCTTTTGCT GCCGACGCAA TCGGTTACCT TGATGCAATT GTCGCCAATT TCAGCGAAGA AGATGCCGCA CGCATCAAAA CCATCGAGCG TACCACTAAC CACGACGTTA AAGCGGTTGA GTATTTCCTG AAAGAAAAAG TGGCGGAGAT CCCGGAACTG CACGCGGTTT CTGAATTCAT CCACTTTGCC TGTACTTCGG AAGATATCAA TAACCTCTCC CACGCATTAA TGCTGAAAAC CGCGCGTGAT GAAGTGATCC TGCCGTACTG GCGTCAACTG ATTGATGGCA TTAAAGATCT CGCCGTTCAG TACCGCGATA TCCCGCTGCT GTCTCGTACC CACGGTCAGC CAGCCACGCC GTCAACCATC GGTAAAGAGA TGGCAAACGT CGCCTACCGT ATGGAGCGCC AGTACCGCCA GCTTAACCAG GTGGAGATCC TCGGCAAAAT CAACGGCGCG GTCGGTAACT ATAACGCCCA CATCGCCGCT TACCCGGAAG TTGACTGGCA TCAGTTCAGC GAAGAGTTCG TCACCTCGCT GGGTATTCAG TGGAACCCGT ACACTACCCA GATTGAACCG CACGACTACA TTGCCGAACT GTTTGATTGC GTTGCGCGCT TCAACACCAT TCTGATCGAC TTTGACCGTG ACGTCTGGGG TTATATCGCC CTTAACCACT TCAAACAGAA AACCATTGCT GGTGAGATTG GTTCTTCCAC CATGCCGCAT AAAGTTAACC CGATCGACTT CGAGAACTCC GAAGGGAATC TGGGCCTTTC CAACGCGGTA TTACAGCATC TGGCAAGCAA ACTGCCGGTT TCCCGCTGGC AGCGTGACCT GACCGACTCC ACCGTGCTGC GTAACCTCGG CGTGGGTATC GGTTATGCGC TGATTGCGTA TCAATCCACC CTGAAAGGCG TGAGCAAACT GGAAGTGAAC CGTGACCATC TGCTGGATGA GCTGGATCAC AACTGGGAAG TGCTGGCGGA GCCAATCCAG ACAGTTATGC GTCGCTATGG CATCGAAAAA CCGTACGAGA AGCTGAAAGA GCTGACTCGT GGTAAGCGCG TTGACGCCGA AGGCATGAAG CAGTTTATCG ACGGTCTGGC GTTGCCAGAA GAAGAGAAAG CCCGCCTGAA AGCGATGACG CCGGCAAACT ACATTGGTCG CGCCATCACC ATGGTTGATG AGCTGAAATA A
|
Protein sequence | MELSSLTAVS PVDGRYGDKV SALRGIFSEY GLLKFRVQVE VRWLQKLAAH AAIKEVPAFA ADAIGYLDAI VANFSEEDAA RIKTIERTTN HDVKAVEYFL KEKVAEIPEL HAVSEFIHFA CTSEDINNLS HALMLKTARD EVILPYWRQL IDGIKDLAVQ YRDIPLLSRT HGQPATPSTI GKEMANVAYR MERQYRQLNQ VEILGKINGA VGNYNAHIAA YPEVDWHQFS EEFVTSLGIQ WNPYTTQIEP HDYIAELFDC VARFNTILID FDRDVWGYIA LNHFKQKTIA GEIGSSTMPH KVNPIDFENS EGNLGLSNAV LQHLASKLPV SRWQRDLTDS TVLRNLGVGI GYALIAYQST LKGVSKLEVN RDHLLDELDH NWEVLAEPIQ TVMRRYGIEK PYEKLKELTR GKRVDAEGMK QFIDGLALPE EEKARLKAMT PANYIGRAIT MVDELK
|
| |