Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3670 |
Symbol | dprA |
ID | 6272779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3411989 |
End bp | 3413113 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641727534 |
Product | DNA protecting protein DprA |
Protein accession | YP_001881969 |
Protein GI | 187731689 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.206561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGATA CAGATATTTG GCTGCGTTTA ATGAGCATCA GCAGCTTGTA CGGCGATGAT ATGGTCCGTA TAGCTCACTG GCTGGCAAAA CAGTCGCATA TTGATGCGGT TGTATTGCAG CAAACAGGGC TTACATTGCG GCAGGCACAA CGCTTTCTTT CATTTCCGCG AAAGAGTATC GAAAGCTCAC TTTGTTGGTT GGAGCAACCC AATCATCATT TAATCCCTGC GGACAGCGAA TTTTATCCTC CTCAACTTCT GGCGACGACA GATTACCCCG GCGCACTGTT TGTTGAAGGA GAACTGCACG CGCTGCATTC ATTTCAGCTT GCCGTAGTGG GGAGTCGGGC GCATTCATGG TATGGCGAGC GATGGGGACG ATTATTTTGC GAAACTCTGG CCACGCGTGG AGTGACAATT ACGAGTGGAC TGGCGCGTGG AATCGATGGT GTGGCGCATA AAGCGGCCTT ACAGGTAAAT GGCGTCAGCA TTGCTGTATT GGGGAACGGA CTTAATACCA TTCATCCGCG CCGCCATGCC CGACTGGCTA CCAGTTTGCT TGAACATGGT GGGGCACTTG TCTCGGAATT TCCCCTCGAT GTTCCACCCC TTGCTTACAA TTTCCCACGA AGAAATCGCA TTATCAGTGG TCTAAGTAAA GGTGTACTGG TGGTGGAAGC GGCTTTGCGC AGTGGTTCGT TGGTGACAGC ACGTTGTGCG CTTGAGCAGG GGCGTGAAGT TTTTGCCTTG CCAGGACCAA TAGGGAATCC GGGAAGCGAA GGGCCTCACT GGTTAATAAA ACAAGGTGCG ATTCTTGTGA CGGAACCGGA AGAAATTCTG GAAAACTTGC AATTTGGATT GCACTGGTTG CCAGACGCCC CTGAAAATTC ATTTTATTCA CCAGATCAGC AAGACGTGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCAAC CTGTGCCAGA GGTAGTTACT CAACTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA TTGAGGAGGG CATGCCATGT TCGACGTACT AATGTATTTG TTTGA
|
Protein sequence | MVDTDIWLRL MSISSLYGDD MVRIAHWLAK QSHIDAVVLQ QTGLTLRQAQ RFLSFPRKSI ESSLCWLEQP NHHLIPADSE FYPPQLLATT DYPGALFVEG ELHALHSFQL AVVGSRAHSW YGERWGRLFC ETLATRGVTI TSGLARGIDG VAHKAALQVN GVSIAVLGNG LNTIHPRRHA RLATSLLEHG GALVSEFPLD VPPLAYNFPR RNRIISGLSK GVLVVEAALR SGSLVTARCA LEQGREVFAL PGPIGNPGSE GPHWLIKQGA ILVTEPEEIL ENLQFGLHWL PDAPENSFYS PDQQDVALPF PELLANVGDE VTPVDVVAER AGQPVPEVVT QLLELELAGW IAAVPGGYVR LRRACHVRRT NVFV
|
| |