Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1986 |
Symbol | cho |
ID | 6272806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1809858 |
End bp | 1810745 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641726039 |
Product | nucleotide excision repair endonuclease |
Protein accession | YP_001880533 |
Protein GI | 187732908 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.282045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTACGGC GTTTAACTTC TCCGCGGCTC GAATTTGAAG CTGCGGCAAT TTATGAATAT CCCGAACATT TACGTTCATT CCTTAATGAC TTACCCACCC GACCAGGGGT GTATCTGTTT CATGGTGAAA GTGACACCAT GCCGCTCTAT ATCGGCAAAA GCGTTAACAT CCGCAGCCGC GTCCTTTCTC ATTTACGTAC CCCGGATGAA GCCGCCATGC TACGGCAATC CCGACGGATC AGCTGGATAT GTACCGCCGG TGAAATCGGC GCTCTGCTCC TTGAAGCGCG ATTAATCAAA GAACAACAGC CGCTGTTTAA TAAACGGTTG CGCCGCAATC GCCAGCTCTG TGCCCTGCAA TTAAATGAAA AGCGCGTCGA TGTGGTGTAT GCCAAAGAGG TGGATTTTTC ACGAGCCCCC AACCTGTTTG GCCTGTTTGC CAATAGGCGC GCAGCTTTGC AAGCATTGCA GACCATCGCT GATGAACAAA AACTTTGTTA TGGCCTGCTG GGACTGGAAC CGTTAAGTCG CGGTCGTGCA TGTTTTCGTT CAGCGCTAAA ACGTTGCGCC GGAGCATGCT GCGGTAAAGA GAGCCATGAG GAACATGCGC TACGCTTGCG CCAGTCTCTG GAGCGTTTGC GGGTGGTGTG TTGGCCTTGG CAAGGGGCGG TGGCGCTGAA AGAACAGCAC CCGGAAATGA CTCAATATCA TATTATTCAA AACTGGCTGT GGCTGGGGGC GGTTAATTCG CTGGAAGACG CGACAACGTT AATTCGGACA CCCGCCGGGT TTGATCACGA CGGTTATAAA ATTCTTTGTA AGCCGCTGCT TTCCGGTAAC TATGAAATTA CTGAACTTGA TCCGGCGAAT GACCAGCGAG CCAGTTGA
|
Protein sequence | MVRRLTSPRL EFEAAAIYEY PEHLRSFLND LPTRPGVYLF HGESDTMPLY IGKSVNIRSR VLSHLRTPDE AAMLRQSRRI SWICTAGEIG ALLLEARLIK EQQPLFNKRL RRNRQLCALQ LNEKRVDVVY AKEVDFSRAP NLFGLFANRR AALQALQTIA DEQKLCYGLL GLEPLSRGRA CFRSALKRCA GACCGKESHE EHALRLRQSL ERLRVVCWPW QGAVALKEQH PEMTQYHIIQ NWLWLGAVNS LEDATTLIRT PAGFDHDGYK ILCKPLLSGN YEITELDPAN DQRAS
|
| |