Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0965 |
Symbol | |
ID | 6143048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 975064 |
End bp | 976068 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615852 |
Product | ADP-ribosylglycohydrolase family protein |
Protein accession | YP_001743044 |
Protein GI | 170681153 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1397] ADP-ribosylglycohydrolase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAG AACGTATTCT CGGTGCTCTT TATGGGCAGG CGTTAGGGGA TGCGATGGGG ATGCCCTCCG AGCTTTGGCC GCGCAGTCGC GTCAAAGCAC ACTTTGGCTG GATTGACCGT TTTCTGCCTG GGCCAAAGGA GAATAACGCA GCCTGTTATT TTAACCGCGC CGAATTCACC GACGATACCT CGATGGCGCT GTGTCTGGCG GATGCGTTAC TGGAACGTGA AGGCAAGATC GATCCGGATC TGATTGGGCG TAATATTCTC GACTGGGCGC TGCGTTTCGA CGCCTTTAAC AAAAACGTAC TAGGTCCGAC CTCGAAGATT GCGCTTAACG CCATTCGCGA CGGTAAACCC GTTGCTGAAC TGGAAAATAA CGGCGTGACC AACGGCGCAG CGATGCGCGT CTCGCCATTA GGTTGTTTGC TTCCGGCGCG CGATATTGAC TCTTTTATTG ATGATGTGGC GCTGGCCTCC AGCCCGACAC ATAAATCCGA TCTGGCGGTT GCGGGGGCGG TAGTCATCGC ATGGGCGATT TCTCGTGCCA TTGACGGAGA AAGCTGGTCA GCGATTGTTG ATTCACTGCC TTCAATTGCG CGACATGCTC AACAAAAACG CATCACTACC TTCAGCGCCT CACTGGCAGC ACGTCTGGAG ATTGCGCTGA AAATTGTGCG CAATGCCGAT GGCACCGAAT CCGCCAGCGA ACAGCTTTAC CAGGTCGTTG GCGCAGGTAC CAGCACTATT GAGTCCGTTC CGTGCGCCAT TGCGCTGGTT GAACTGGCAC AAACCGACCC GAACCGTTGC GCCGTCCTGT GCGCTAACCT TGGCGGCGAC ACAGACACCA TCGGTGCTAT GGCGACGGCA ATTTGTGGCG CGTTGCATGG CGTTAACGCT ATCGATCCTG CGTTAAAGGC GGAACTGGAT GCAGTAAATC AGCTTGATTT CAACCGCTAT GCCACGGCGC TGGCGAAGTA TCGCCAACAA CGGGAGGCGA TATGA
|
Protein sequence | MKTERILGAL YGQALGDAMG MPSELWPRSR VKAHFGWIDR FLPGPKENNA ACYFNRAEFT DDTSMALCLA DALLEREGKI DPDLIGRNIL DWALRFDAFN KNVLGPTSKI ALNAIRDGKP VAELENNGVT NGAAMRVSPL GCLLPARDID SFIDDVALAS SPTHKSDLAV AGAVVIAWAI SRAIDGESWS AIVDSLPSIA RHAQQKRITT FSASLAARLE IALKIVRNAD GTESASEQLY QVVGAGTSTI ESVPCAIALV ELAQTDPNRC AVLCANLGGD TDTIGAMATA ICGALHGVNA IDPALKAELD AVNQLDFNRY ATALAKYRQQ REAI
|
| |