Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3581 |
Symbol | dprA |
ID | 6144004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3659671 |
End bp | 3660795 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618408 |
Product | DNA protecting protein DprA |
Protein accession | YP_001745548 |
Protein GI | 170681650 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0315496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.18869 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGATA CAGAAATTTG GCTGCGTTTA ATGAGCATCA GCAGCTTGTA CGGCGATGAT ATGGTCCGTA TAGCTCACTG GCTGGCAAAA CAGTCGCATA TTGATGCGGT TGTATTGCAG CAAACTGGGC TTACATTGCG GCAGGCACAA CGCTTTCTTT CATTTCCGCG AAAGAGTATC GAAAGCTCAC TTTCTTGGTT GGAGCAACCC AACCATCATT TAATTCCTGC GGACAGCGAA TTTTATCCTC CTCAACTTCT TGCGACGACA GATTACCCCG GCGCACTGTT TGTTGAAGGA GAACTGCACG CGCTGCATTC ATTTCAGCTT GCCGTAGTGG GGAGTCGGGC GCATTCATGG TATGGCGAAC GCTGGGGACG ATTATTTTGC GAAACTCTGG CGACGCGTGG AGTGACAATT ACGAGTGGAC TGGCGCGTGG AATCGATGGT GTGGCGCATA AAGCGGCCTT ACAGGTAAAT GGCGTCAGCA TTGCTGTATT AGGGAATGGA CTTAATACCA TTCATCCGCG TCGCCATGCC CGACTGGCTA CCAGTTTGCT TGAACATGGT GGGGCTCTTG TCTCGGAATT TCCCCTCGAT GTTCCTCCCC TTGCTTACAA TTTCCCACGA AGAAATCGCA TTATCAGTGG TCTAAGTAAA GGTGTACTGG TGGTGGAAGC GGCTTTGCGC AGTGGTTCGC TGGTGACAGC ACGTTGTGCG CTTGAGCAGG GGCGTGAAGT TTTTGCCTTG CCGGGTCCAT TAGGGAATCC GGGAAGCGAA GGGCCTCACT GGTTAATAAA ACAAGGTGCG ATTCTTGTGA CGGAACCGGA AGAAATTCTG GAAAACTTGC AATTTGGATT GCACTGGTTG CCAGACGCCC CTGAAAATTC ATTTTATTCA CCAGATCAGG AAGACGTGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCAAC CTGTGCCAGA GGTAGTAACT CAACTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA TTGAGGAGGG CATGCCATGT TCGACGTACT AATGTATTTG TTTGA
|
Protein sequence | MVDTEIWLRL MSISSLYGDD MVRIAHWLAK QSHIDAVVLQ QTGLTLRQAQ RFLSFPRKSI ESSLSWLEQP NHHLIPADSE FYPPQLLATT DYPGALFVEG ELHALHSFQL AVVGSRAHSW YGERWGRLFC ETLATRGVTI TSGLARGIDG VAHKAALQVN GVSIAVLGNG LNTIHPRRHA RLATSLLEHG GALVSEFPLD VPPLAYNFPR RNRIISGLSK GVLVVEAALR SGSLVTARCA LEQGREVFAL PGPLGNPGSE GPHWLIKQGA ILVTEPEEIL ENLQFGLHWL PDAPENSFYS PDQEDVALPF PELLANVGDE VTPVDVVAER AGQPVPEVVT QLLELELAGW IAAVPGGYVR LRRACHVRRT NVFV
|
| |