Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1963 |
Symbol | cho |
ID | 5590367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1949137 |
End bp | 1950024 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640925634 |
Product | nucleotide excision repair endonuclease |
Protein accession | YP_001463037 |
Protein GI | 157156765 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTACGGC GTTTAACTTC TCCGCGGCTC GAATTTGAAG CTGCGGCAAT TTATGAATAT CCCGAACATT TACGTTCATT CCTTAATGAC TTACCCACCC GACCAGGGGT GTATCTGTTT CATGGCGAAA GTGACACCAT GCCGCTCTAT ATCGGCAAAA GCGTTAATAT CCGCAGTCGG GTGCTTTCTC ATTTACGCAC CCCGGATGAA GCCGCCATGC TACGGCAATC CCGGCGAATT AGCTGGATCT GCACCGCAGG CGAAATCGGC GCTCTGCTCC TTGAAGCGCG ATTAATCAAA GAACAACAGC CGCTGTTTAA TAAACGGTTG CGCCGCAATC GCCAGCTCTG TGCCCTGCAA TTAAATGAAA AGCGCGTCGA TGTGGTGTAT GCCAAAGAGG TGGATTTTTC ACGAGCCCCC AACCTGTTTG GCCTGTTTGC CAATAGGCGC GCAGCTTTGC AAGCATTGCA GAGCATCGCT GATGAACAAA AACTTTGTTA TGGCCTGCTG GGACTGGAAC CGTTAAGTCG CGGTCGTGCA TGTTTTCGTT CAGCGCTAAA ACGTTGCGCC GGAGCATGCT GCGGTAAAGA GAGCCATGAG GAACATGCGC TACGCTTGCG CCAGTCTCTG GAGCGTTTGC GGGTGGTGTG TTGGCCTTGG CAAGGGGCGG TGGCGCTGAA AGAACAGCAC CCGGAAATGA CTCAATATCA TATTATTCAA AACTGGCTGT GGCTGGGGGC GGTTAATTCG CTGGAAGACG CGACAACGTT AATTCGGACA CCCGCCGGGT TTGATCACGA CGGTTATAAA ATTCTTTGTA AGCCGCTGCT TTCCGGTAAC TATGAAATTA CTGAACTTGA TCCGGCGAAT GACCAGCGAG CCAGTTGA
|
Protein sequence | MVRRLTSPRL EFEAAAIYEY PEHLRSFLND LPTRPGVYLF HGESDTMPLY IGKSVNIRSR VLSHLRTPDE AAMLRQSRRI SWICTAGEIG ALLLEARLIK EQQPLFNKRL RRNRQLCALQ LNEKRVDVVY AKEVDFSRAP NLFGLFANRR AALQALQSIA DEQKLCYGLL GLEPLSRGRA CFRSALKRCA GACCGKESHE EHALRLRQSL ERLRVVCWPW QGAVALKEQH PEMTQYHIIQ NWLWLGAVNS LEDATTLIRT PAGFDHDGYK ILCKPLLSGN YEITELDPAN DQRAS
|
| |