Gene ECH74115_4608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4608 
SymboldprA 
ID6968675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4270528 
End bp4271652 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content51% 
IMG OID643388314 
ProductDNA protecting protein DprA 
Protein accessionYP_002272742 
Protein GI209395952 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0168836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGATA CAGAAATTTG GCTGCGTTTA ATGAGTATCA GCAGCTTGTA CGGCGATGAT 
ATGGTCCGTA TAGCTCACTG GCTGGCAAGA CAGTCGCATA TTGATGCGGT TGTATTGCAG
CAAACAGGGC TTACATTGCG GCAGGCACAA CGCTTTCTTT CATTTCCGCG GAAGAGTATC
GAAAGCTCAC TTTGTTGGTT GGAGCAACCC AACCATCATT TAATCCCTGC GGACAGCGAA
TTTTATCCTC CTCAACTTCT GGCGACGACA GATTACCCCG GTGCACTGTT TGTTGAAGGA
GAACTGCACG CGCTGCATTC ATTTCAGCTT GCCGTAGTGG GGAGTCGGGC GCATTCATGG
TATGGCGAGC GATGGGGACG ATTATTTTGC GAAACTCTGG CGAAGCATGG AGTGACAATT
ACGAGTGGAC TGGCGCGTGG AATCGATGGT GTAGCGCATA AAGCAGCCTT ACAGGTAAAT
GGCGTCAGCA TTGCTGTATT GGGGAATGGA CTTAATACCA TTCATCCCCG CCGTCATGCC
CGACTGGCTG CCAGTCTGCT TGAACAGGGG GGCGCTCTCG TCTCGGAATT TCCCCTCGAT
GTTCCACCCC TTGCTTACAA TTTCCCACGA AGAAATCGCA TTATCAGTGG TCTAAGTAAA
GGTGTACTGG TGGTGGAAGC GGCTTTGCGT AGTGGTTCGC TGGTGACAGC ACGTTGTGCG
CTTGAGCAGG GGCGAGAAGT TTTTGCCTTG CCAGGTCCAA TAGGGAATCC GGGAAGCGAA
GGGCCTCACT GGTTAATAAA ACAAGGTGCG ATTCTTGTGA CGGAACCGGA AGAAATTCTG
GAAAACTTGC AATTTGGATT GCACTGGTTG CCAGACGCCC CTGAAAATTC ATTTTATTCA
CCAGATCAGG AAGACGTGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG
GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCGAC CTGTGCCAGA GGTAGTTACT
CAACTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA
TTGAGGAGGG CATGCCATGT TCGACGTACT AATGTATTTG TTTGA
 
Protein sequence
MVDTEIWLRL MSISSLYGDD MVRIAHWLAR QSHIDAVVLQ QTGLTLRQAQ RFLSFPRKSI 
ESSLCWLEQP NHHLIPADSE FYPPQLLATT DYPGALFVEG ELHALHSFQL AVVGSRAHSW
YGERWGRLFC ETLAKHGVTI TSGLARGIDG VAHKAALQVN GVSIAVLGNG LNTIHPRRHA
RLAASLLEQG GALVSEFPLD VPPLAYNFPR RNRIISGLSK GVLVVEAALR SGSLVTARCA
LEQGREVFAL PGPIGNPGSE GPHWLIKQGA ILVTEPEEIL ENLQFGLHWL PDAPENSFYS
PDQEDVALPF PELLANVGDE VTPVDVVAER AGRPVPEVVT QLLELELAGW IAAVPGGYVR
LRRACHVRRT NVFV