Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4608 |
Symbol | dprA |
ID | 6968675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4270528 |
End bp | 4271652 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643388314 |
Product | DNA protecting protein DprA |
Protein accession | YP_002272742 |
Protein GI | 209395952 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0168836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGATA CAGAAATTTG GCTGCGTTTA ATGAGTATCA GCAGCTTGTA CGGCGATGAT ATGGTCCGTA TAGCTCACTG GCTGGCAAGA CAGTCGCATA TTGATGCGGT TGTATTGCAG CAAACAGGGC TTACATTGCG GCAGGCACAA CGCTTTCTTT CATTTCCGCG GAAGAGTATC GAAAGCTCAC TTTGTTGGTT GGAGCAACCC AACCATCATT TAATCCCTGC GGACAGCGAA TTTTATCCTC CTCAACTTCT GGCGACGACA GATTACCCCG GTGCACTGTT TGTTGAAGGA GAACTGCACG CGCTGCATTC ATTTCAGCTT GCCGTAGTGG GGAGTCGGGC GCATTCATGG TATGGCGAGC GATGGGGACG ATTATTTTGC GAAACTCTGG CGAAGCATGG AGTGACAATT ACGAGTGGAC TGGCGCGTGG AATCGATGGT GTAGCGCATA AAGCAGCCTT ACAGGTAAAT GGCGTCAGCA TTGCTGTATT GGGGAATGGA CTTAATACCA TTCATCCCCG CCGTCATGCC CGACTGGCTG CCAGTCTGCT TGAACAGGGG GGCGCTCTCG TCTCGGAATT TCCCCTCGAT GTTCCACCCC TTGCTTACAA TTTCCCACGA AGAAATCGCA TTATCAGTGG TCTAAGTAAA GGTGTACTGG TGGTGGAAGC GGCTTTGCGT AGTGGTTCGC TGGTGACAGC ACGTTGTGCG CTTGAGCAGG GGCGAGAAGT TTTTGCCTTG CCAGGTCCAA TAGGGAATCC GGGAAGCGAA GGGCCTCACT GGTTAATAAA ACAAGGTGCG ATTCTTGTGA CGGAACCGGA AGAAATTCTG GAAAACTTGC AATTTGGATT GCACTGGTTG CCAGACGCCC CTGAAAATTC ATTTTATTCA CCAGATCAGG AAGACGTGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCGAC CTGTGCCAGA GGTAGTTACT CAACTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA TTGAGGAGGG CATGCCATGT TCGACGTACT AATGTATTTG TTTGA
|
Protein sequence | MVDTEIWLRL MSISSLYGDD MVRIAHWLAR QSHIDAVVLQ QTGLTLRQAQ RFLSFPRKSI ESSLCWLEQP NHHLIPADSE FYPPQLLATT DYPGALFVEG ELHALHSFQL AVVGSRAHSW YGERWGRLFC ETLAKHGVTI TSGLARGIDG VAHKAALQVN GVSIAVLGNG LNTIHPRRHA RLAASLLEQG GALVSEFPLD VPPLAYNFPR RNRIISGLSK GVLVVEAALR SGSLVTARCA LEQGREVFAL PGPIGNPGSE GPHWLIKQGA ILVTEPEEIL ENLQFGLHWL PDAPENSFYS PDQEDVALPF PELLANVGDE VTPVDVVAER AGRPVPEVVT QLLELELAGW IAAVPGGYVR LRRACHVRRT NVFV
|
| |