Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0317 |
Symbol | sbcD |
ID | 6272296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 309243 |
End bp | 310445 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641724556 |
Product | exonuclease subunit SbcD |
Protein accession | YP_001879106 |
Protein GI | 187732912 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | [TIGR00619] exonuclease SbcD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.639541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATCC TTCACACCTC AGACTGGCAT CTCGGCCAGA ACTTCTACAG TAAAAGCCGC GAAGCTGAAC ATCAGGCTTT TCTTGACTGG CTGCTGGAGA CAGCACAAAC CCATCAGGTG GATGCGATTA TTGTTGCTGG TGATGTTTTC GATACCGGCT CGCCGCCCAG TTACGCCCGC ACGTTATACA ACCGTTTTGT TGTCAATTTA CAGCAAACTG GCTGTCATCT GGTGGTACTG GCAGGAAACC ATGACTCGGT CGCCACGCTG AATGAATCGC GCGATATCAT GGCGTTCCTC AATACAACCG TGGTCGCCAG CGCCGGACAT GCGCCGCAAA TCTTGCCTCG TCGCGACGGG ACGCCAGGCG CAGTGCTGTG CTCCATTCCG TTTTTACGCC CGCGTGACAT TATTTCCAGC CAGGCGGGGC TTAACGGTAT TGAAAAACAG CAGCATTTAC TGGCAGCGAT TACCGATTAT TACCAACAAC ACTATGCCGA TGCCTGCAAA CTGCGCGGCG ATCAGCCTCT GCCCATCATC GCCACGGGAC ATTTAACGAC CGTGGGTGCC AGTAAAAGTG ACGCCGTGCG TGACATTTAT ATTGGCACGC TGGACGCGTT TCCGGCACAA AACTTTCCAC CAGCCGACTA CATAGCGCTC GGGCATATTC ACCGCGCACA GATTATTGGC GGCATGGAAC ATGTTCGCTA TTGCGGTTCC CCCATTCCAC TGAGTTTTGA TGAATGCGGT AAGAGTAAAT ATGTCCATCT GGTGACATTT TCAAACGGCA AATTAGAGAG CGTGGAAAAC CTGAACGTAC CGGTAACTCA ACCCATGGCG GTGCTGAAAG GCGATCTGGC GTCGATTACC GCACAGCTGG AACAGTGGCG CGATGTATCG CAGGAGCCAC CTGTCTGGCT GGATATCGAA ATCACTACTG ATGAGTATCT GCATGATATT CAGCGCAAAA TCCAGGCATT AACCGAATCA TTGCCTGTCG AAGTATTGCT GGTACGTCGG AGTCATGAAC AGCGCGAGCG TGTGTTAGCC AGCCAACAGC GTGAAACCCT CAGCGAACTC AGCGTCGAAG AGGTGTTCAA TCGCCGTCTG GCACTGGAAG AACTGGATGA ATCGCAGCAG CAACGTCTGC AGCATCTTTT CACCACGACG TTGCATACCC TCGCCGGAGA ACACGAAGCA TGA
|
Protein sequence | MRILHTSDWH LGQNFYSKSR EAEHQAFLDW LLETAQTHQV DAIIVAGDVF DTGSPPSYAR TLYNRFVVNL QQTGCHLVVL AGNHDSVATL NESRDIMAFL NTTVVASAGH APQILPRRDG TPGAVLCSIP FLRPRDIISS QAGLNGIEKQ QHLLAAITDY YQQHYADACK LRGDQPLPII ATGHLTTVGA SKSDAVRDIY IGTLDAFPAQ NFPPADYIAL GHIHRAQIIG GMEHVRYCGS PIPLSFDECG KSKYVHLVTF SNGKLESVEN LNVPVTQPMA VLKGDLASIT AQLEQWRDVS QEPPVWLDIE ITTDEYLHDI QRKIQALTES LPVEVLLVRR SHEQRERVLA SQQRETLSEL SVEEVFNRRL ALEELDESQQ QRLQHLFTTT LHTLAGEHEA
|
| |