Gene ECH74115_4747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4747 
Symbol 
ID6971876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4394589 
End bp4395791 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content50% 
IMG OID643388448 
Productputative DNA protecting protein DprA 
Protein accessionYP_002272876 
Protein GI209399908 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.853316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTT CAGCCAATGC ACAAGCAACT CTCCTGCTAA CCAGCGATTT TTCTCGCGCG 
GCGGCGAGTA AGTATAAACC TCTTAGTAAT AGTGAATGGG GGAAGTTTGC ATTATGGCTG
AAGCACCAAC GTATCAGTCC CGCCGAGCTT CTGGTGCCGC AACCGCAAGA GAAACTTACA
GGCTGGAGCG ATCCGCGTAT TTCTCAGGAG CGTATTCTTG GCTTGCTGGC GCGTGGTCAT
AGTCTGGCGT TGGCGGTAGA TAAGTGGCAA CGCGCCGGTT TATGGATCTT AACCCGCGGA
GATGCTGATT ATCCCGTTCG CTTGAAAAAC CGATTGCGAA CGGATGCACC TCCCGTTTTA
TTTGGCTGCG GGAATAAAGC ATTACTGCAA GCGGAAGGTA TGGCGATTGT TGGCTCGCGA
GATGCTCCGA CTGACGATTT GCGCTATACC CAACAACTGG CCGCGAAACT GGCCCAACAG
GGGATTTGCG TTATCTCTGG TGGTGCGCGA GGTATTGATG AATGTGCAAT GGCGTCGGCA
CTGGAGGCCG GGGGAACTGC CGTTGGCGTA TTAGCTGATA GCTTGTTAAA AACGAGTACG
TTAGTGAAAT GGCGTGAAGG GCTTATAGCA GGCAACCTGG TGTTGATTTC GCCGTTTTAC
CCAGAGGTAC GTTTCACCGT CGGCAATGCG ATGGCGCGAA ATAAATATAT TTATTGCCTT
GCTGAAAGCG CAATGGTTGT ACGTGCGGGA ATGACCGGTG GAACGATAAC CGGGGCGATG
GAGGCATTAA AACATCAGTG GCTGCCTGTG CAGGTTAAAC CAAATCAGGA TATGCAATCA
GCCAATTCAC GATTAGTAGA AAATGGGGCG TCATGGAGTG CTGAACAGGC TGAGAATGTG
ACGATCAGAC TGCCAGACGT TCCTGGGCTG ATGTATGACA GAGCACTCCG TAACGCTCAA
CCAGAACTGT TTTCGCTGCA TGAAGATGAC GCAAATTACG CAGTAATGCC CGCGTATACG
CCTGTCGATT TTTATCAACT CTTTGTGGCG GAACTGGCGA TCCTTGCAAA GGAATCGATA
AGTATTGAAA GGCTGGCGTC TTGTACTGGT TTAACCATCG AACAAATTAG TGTGTGGCTG
AACCGCGCAG AAGAAGAGGG AAGGGTTATC CGATTGGGCG AAGGTCATTA TCAGTTCAGG
TAA
 
Protein sequence
MNLSANAQAT LLLTSDFSRA AASKYKPLSN SEWGKFALWL KHQRISPAEL LVPQPQEKLT 
GWSDPRISQE RILGLLARGH SLALAVDKWQ RAGLWILTRG DADYPVRLKN RLRTDAPPVL
FGCGNKALLQ AEGMAIVGSR DAPTDDLRYT QQLAAKLAQQ GICVISGGAR GIDECAMASA
LEAGGTAVGV LADSLLKTST LVKWREGLIA GNLVLISPFY PEVRFTVGNA MARNKYIYCL
AESAMVVRAG MTGGTITGAM EALKHQWLPV QVKPNQDMQS ANSRLVENGA SWSAEQAENV
TIRLPDVPGL MYDRALRNAQ PELFSLHEDD ANYAVMPAYT PVDFYQLFVA ELAILAKESI
SIERLASCTG LTIEQISVWL NRAEEEGRVI RLGEGHYQFR