Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4747 |
Symbol | |
ID | 6971876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4394589 |
End bp | 4395791 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643388448 |
Product | putative DNA protecting protein DprA |
Protein accession | YP_002272876 |
Protein GI | 209399908 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.853316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTTT CAGCCAATGC ACAAGCAACT CTCCTGCTAA CCAGCGATTT TTCTCGCGCG GCGGCGAGTA AGTATAAACC TCTTAGTAAT AGTGAATGGG GGAAGTTTGC ATTATGGCTG AAGCACCAAC GTATCAGTCC CGCCGAGCTT CTGGTGCCGC AACCGCAAGA GAAACTTACA GGCTGGAGCG ATCCGCGTAT TTCTCAGGAG CGTATTCTTG GCTTGCTGGC GCGTGGTCAT AGTCTGGCGT TGGCGGTAGA TAAGTGGCAA CGCGCCGGTT TATGGATCTT AACCCGCGGA GATGCTGATT ATCCCGTTCG CTTGAAAAAC CGATTGCGAA CGGATGCACC TCCCGTTTTA TTTGGCTGCG GGAATAAAGC ATTACTGCAA GCGGAAGGTA TGGCGATTGT TGGCTCGCGA GATGCTCCGA CTGACGATTT GCGCTATACC CAACAACTGG CCGCGAAACT GGCCCAACAG GGGATTTGCG TTATCTCTGG TGGTGCGCGA GGTATTGATG AATGTGCAAT GGCGTCGGCA CTGGAGGCCG GGGGAACTGC CGTTGGCGTA TTAGCTGATA GCTTGTTAAA AACGAGTACG TTAGTGAAAT GGCGTGAAGG GCTTATAGCA GGCAACCTGG TGTTGATTTC GCCGTTTTAC CCAGAGGTAC GTTTCACCGT CGGCAATGCG ATGGCGCGAA ATAAATATAT TTATTGCCTT GCTGAAAGCG CAATGGTTGT ACGTGCGGGA ATGACCGGTG GAACGATAAC CGGGGCGATG GAGGCATTAA AACATCAGTG GCTGCCTGTG CAGGTTAAAC CAAATCAGGA TATGCAATCA GCCAATTCAC GATTAGTAGA AAATGGGGCG TCATGGAGTG CTGAACAGGC TGAGAATGTG ACGATCAGAC TGCCAGACGT TCCTGGGCTG ATGTATGACA GAGCACTCCG TAACGCTCAA CCAGAACTGT TTTCGCTGCA TGAAGATGAC GCAAATTACG CAGTAATGCC CGCGTATACG CCTGTCGATT TTTATCAACT CTTTGTGGCG GAACTGGCGA TCCTTGCAAA GGAATCGATA AGTATTGAAA GGCTGGCGTC TTGTACTGGT TTAACCATCG AACAAATTAG TGTGTGGCTG AACCGCGCAG AAGAAGAGGG AAGGGTTATC CGATTGGGCG AAGGTCATTA TCAGTTCAGG TAA
|
Protein sequence | MNLSANAQAT LLLTSDFSRA AASKYKPLSN SEWGKFALWL KHQRISPAEL LVPQPQEKLT GWSDPRISQE RILGLLARGH SLALAVDKWQ RAGLWILTRG DADYPVRLKN RLRTDAPPVL FGCGNKALLQ AEGMAIVGSR DAPTDDLRYT QQLAAKLAQQ GICVISGGAR GIDECAMASA LEAGGTAVGV LADSLLKTST LVKWREGLIA GNLVLISPFY PEVRFTVGNA MARNKYIYCL AESAMVVRAG MTGGTITGAM EALKHQWLPV QVKPNQDMQS ANSRLVENGA SWSAEQAENV TIRLPDVPGL MYDRALRNAQ PELFSLHEDD ANYAVMPAYT PVDFYQLFVA ELAILAKESI SIERLASCTG LTIEQISVWL NRAEEEGRVI RLGEGHYQFR
|
| |