Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3717 |
Symbol | |
ID | 6143927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3786095 |
End bp | 3787297 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641618543 |
Product | putative DNA protecting protein DprA |
Protein accession | YP_001745683 |
Protein GI | 170683496 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTTT CAGCCAATGC ACAAGCGACC CTCCTGCTAA CCAGCGATTT TTCTCGCGCG GCGGCGAGTG AGTATAAACC TCTTAGTAAT AGTGAATGGG GGAAGTTTGC ATTATGGCTG AAGCACCAAC GTATCAGTCC CGCCGATCTT CTGGTGCCGC AACCGCAAGA GAAACTTACA GGCTGGAGCG ATCCGCGTAT TTCTCAGGAG CGTATTCTTG GCTTGCTGGC GCGTGGTCAT AGTTTGGCGT TGGCGGTAGA TAAGTGGCAA CGCGCCGGTT TATGGATCTT AACCCGCGGA GATGCCGATT ATCCCGTTCG CTTGAAAAAC CGATTGCGAA CGGATGCACC TCCCGTTTTA TTTGGCTGCG GGAATAAAGC ATTACTGCAA GCGGAAGGTA TGGCGATTGT TGGCTCGCGA GATGCTCCGA CTGCCGATTT GCGCTATACC CAACAACTGG CAGCGAAACT GGCCCAACAG GGGATTTGCG TTATCTCTGG TGGTGCGCGA GGTATTGATG AATGTGCAAT GGCGTCGGCA CTGGAGGCCG GGGGAACTGC CGTTGGCGTA TTAGCTGATA GCTTGTTAAA AACGAGTACG TTAGTGAAAT GGCGTGAAGG GCTTATAGCA GGCAACCTGG TGTTGATTTC GCCGTTTTAC CCAGAGGTAC GTTTCACCGT CGGCAATGCG ATGGCGCGAA ATAAATATAT TTATTGCCTT GCTGAAAGCG CAATGGTTGT ACGTGCGGGA ATGACCGGTG GAACGATAAC CGGGGCGATG GAGGCATTAA AACATCAGTG GCTGCCTGTG CAGGTTAAAC CAAATCAGGA TATGCAATCA GCCAATTCAC GATTAGTAGA AAATGGGGCG TCATGGAGTA CTGAACAGGC TGAGAATGTG ACGTTCAGAC TGCCAGACGT TACTGGGCTG ATGTATGACA AAGCACTCCG TAACGCTCAA CCAGAATTGT TTTCGCTGCA CGAAGATGAC GCAAATTACG CAGTGATGCC CTCGCATACC CCTGTCGATT TTTATCAACT TTTTGTGGCG GAACTGGCGA TCCTTGCAAA GGAATCGATA AGTATTGAAA GGCTGGCGTC TTGTACTGGT TTAACCATCG AACAAATTAG TGTGTGGCTG AACCGCGCAG AAGAAGAGGG AAGGGTTATC CGATTGGGCG AAGGTCATTA TCAGTTCAGG TAA
|
Protein sequence | MNLSANAQAT LLLTSDFSRA AASEYKPLSN SEWGKFALWL KHQRISPADL LVPQPQEKLT GWSDPRISQE RILGLLARGH SLALAVDKWQ RAGLWILTRG DADYPVRLKN RLRTDAPPVL FGCGNKALLQ AEGMAIVGSR DAPTADLRYT QQLAAKLAQQ GICVISGGAR GIDECAMASA LEAGGTAVGV LADSLLKTST LVKWREGLIA GNLVLISPFY PEVRFTVGNA MARNKYIYCL AESAMVVRAG MTGGTITGAM EALKHQWLPV QVKPNQDMQS ANSRLVENGA SWSTEQAENV TFRLPDVTGL MYDKALRNAQ PELFSLHEDD ANYAVMPSHT PVDFYQLFVA ELAILAKESI SIERLASCTG LTIEQISVWL NRAEEEGRVI RLGEGHYQFR
|
| |