Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1449 |
Symbol | cho |
ID | 6142688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1434697 |
End bp | 1435584 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616328 |
Product | nucleotide excision repair endonuclease |
Protein accession | YP_001743508 |
Protein GI | 170683911 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.432222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTACGGC GTTTAACTTC TCCGCGGCTC GAATTTGAAG CTGCGGCAAT TTATGAATAT CCCGAACATT TACGTTCATT CCTTAATGAC TTACCCACCC GACCAGGGGT GTATCTGTTT CATGGCGAAA GTGACACCAT GCCGCTCTAT ATCGGCAAAA GCATTAACAT CCGCAGTCGG GTGCTTTCTC ATTTACGCAC CCCTGATGAA GCCGCCATGC TACGGCAATC CCGACGCATC AGCTGGATAT GTACCGCCGG TGAAATCGGC GCTCTGCTCC TTGAAGCGCG ATTAATCAAA GAACAACAGC CACTGTTTAA TAAACGCTTG CGGCGAAATC GCCAGCTCTG TGCCTTGCAA TTAAATGAAA AGCGCGTCGA TGTGGTGTAT GCCAAAGAGG TGGATTTTTC ACGAGCCCCC AACCTGTTTG GCCTGTTTGC CAATAGGCGC GCAGCTTTGC AAGCATTGCA GACCATCGCT GATGAACAAA AACTTTGTTA TGGTCTGTTG GGACTGGAGC CGTTAAGTCG CGGTCGTGCA TGTTTTCGTT CAGCGCTAAA ACGTTGCGCC GGAGCATGCT GCGGTAAAGA GAGCCATGAC GATCATGCGC TACGTTTGCG CCAGTCTCTG GAGCGTTTGC GGGTGGTGTG TTGGCCTTGG CAAGGGGCGG TGGCGCTGAA AGAACAGCAC CCGGAAATGA CTCAATATCA TATTATTCAA AACTGGCTGT GGCTGGGGGC GGTTAATTCG CTGGAAGAAG CGACAATGTT AATTCGGACA CCCGCCGGGT TTGATCACGA CGGTTATAAA ATTCTTTGTA AGCCGCTGCT TTCCGGTAAC TATGAAATTA CTGAACTTGA TCCGGCGAAT GACCAGCGAG CCAGTTGA
|
Protein sequence | MVRRLTSPRL EFEAAAIYEY PEHLRSFLND LPTRPGVYLF HGESDTMPLY IGKSINIRSR VLSHLRTPDE AAMLRQSRRI SWICTAGEIG ALLLEARLIK EQQPLFNKRL RRNRQLCALQ LNEKRVDVVY AKEVDFSRAP NLFGLFANRR AALQALQTIA DEQKLCYGLL GLEPLSRGRA CFRSALKRCA GACCGKESHD DHALRLRQSL ERLRVVCWPW QGAVALKEQH PEMTQYHIIQ NWLWLGAVNS LEEATMLIRT PAGFDHDGYK ILCKPLLSGN YEITELDPAN DQRAS
|
| |