Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2235 |
Symbol | |
ID | 5592822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2222444 |
End bp | 2223448 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640921365 |
Product | ADP-ribosylglycohydrolase family protein |
Protein accession | YP_001458901 |
Protein GI | 157161583 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1397] ADP-ribosylglycohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAG AACGTATTCT CGGTGCTCTT TATGGGCAGG CGTTAGGGGA TGCGATGGGG ATGCCCTCCG AGCTTTGGCC ACGCAGCCGC GTTAAAGCAC ACTTTGGCTG GATTGACCGT TTTCTTCCTG GACCAAAGGA GAATAACGCG GCCTGTTATT TTAACCGCGC CGAATTCACC GACGATACCT CGATGGCGCT GTGTCTGGCG GATGCGTTAC TGGAACGTGA AGGCAAGATC GATCCGAATC TGATTGGGCG TAATATTCTC GACTGGGCGC TGCGTTTCGA CGCCTTTAAC AAAAACGTAC TTGGTCCGAC CTCGAAGATT GCGCTTAACG CCATTCGCGA CGGTAAACCC ATTGCTGAAC TGGAAAACAA CGGCGTGACC AACGGCGCAG CGATGCGCGT CTCGCCATTA GGTTGTTTGC TTCCGGCGCA CGATGTTGAT TCCTTTATTG ATGATGTGGC GCTGGCGTCC AGCCCGACCC ATAAATCCGA TCTGGCGGTT GCAGGCGCGG TAGTCATCGC ATGGGCGATT TCTCGTGCCA TTGACGGAGA AAGCTGGTCA GCGATTGTAG ATTCACTGCC TTCAATTGCG CGACATGCAC AGCAAAAACG CATCACTACC TTCAGCGCCT CACTGGCAGC ACGTCTGGAG ATTGCGCTGA AAATTGTGCG CAATGCCGAC GGCACCGAAT CCGCCAGCGA ACAGCTTTAC CAGGTCGTTG GCGCAGGTAC CAGCACTATT GAGTCCGTTC CGTGCGCCAT TGCGCTGGTT GAACTGGCAC AAACCGACCC GAATCGCTGC GCCGTCCTGT GCGCTAACCT TGGCGGCGAC ACAGACACCA TCGGCGCTAT GGCGACGGCG ATTTGCGGCG CGTTGCATGG CGTTAACGCT ATCGATCCTG CATTAAAGGC GGAACTGGAT GCGGTAAATC AGCTTGATTT CAACCGCTAT GCCACAGCGC TGGCGAAATA TCGTCAACAA CGGGAGGCGG TATGA
|
Protein sequence | MKTERILGAL YGQALGDAMG MPSELWPRSR VKAHFGWIDR FLPGPKENNA ACYFNRAEFT DDTSMALCLA DALLEREGKI DPNLIGRNIL DWALRFDAFN KNVLGPTSKI ALNAIRDGKP IAELENNGVT NGAAMRVSPL GCLLPAHDVD SFIDDVALAS SPTHKSDLAV AGAVVIAWAI SRAIDGESWS AIVDSLPSIA RHAQQKRITT FSASLAARLE IALKIVRNAD GTESASEQLY QVVGAGTSTI ESVPCAIALV ELAQTDPNRC AVLCANLGGD TDTIGAMATA ICGALHGVNA IDPALKAELD AVNQLDFNRY ATALAKYRQQ REAV
|
| |