Gene ECH74115_3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3946 
SymbolrecA 
ID6968887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3650871 
End bp3651932 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content54% 
IMG OID643387719 
Productrecombinase A 
Protein accessionYP_002272162 
Protein GI209397319 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.44116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.835205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCG ACGAAAACAA ACAGAAAGCG TTGGCGGCAG CACTGGGCCA GATTGAGAAA 
CAATTTGGTA AAGGCTCCAT CATGCGCCTG GGTGAAGACC GTTCCATGGA TGTGGAAACC
ATCTCTACCG GTTCGCTTTC ACTGGATATC GCGCTTGGGG CAGGTGGTCT GCCGATGGGC
CGTATCGTCG AAATCTACGG ACCGGAATCT TCCGGTAAAA CCACGCTGAC GTTGCAGGTG
ATTGCCGCAG CGCAGCGTGA AGGTAAAACC TGTGCGTTTA TCGATGCTGA ACACGCGCTG
GACCCAATCT ACGCACGTAA ACTGGGCGTC GATATCGACA ACCTGCTGTG CTCCCAGCCG
GACACCGGCG AGCAGGCACT GGAAATCTGT GACGCCCTGG CGCGTTCTGG CGCAGTAGAC
GTTATCGTCG TTGACTCCGT GGCGGCACTG ACGCCGAAAG CGGAAATCGA AGGCGAAATC
GGCGACTCTC ACATGGGCCT TGCGGCACGT ATGATGAGCC AGGCGATGCG TAAGCTGGCG
GGTAACCTGA AGCAGTCCAA CACGCTGCTG ATCTTCATCA ACCAGATCCG TATGAAAATT
GGTGTGATGT TCGGTAACCC GGAAACCACT ACCGGTGGTA ACGCGCTGAA ATTCTACGCC
TCTGTTCGTC TCGACATCCG TCGTATCGGC GCGGTGAAAG AGGGCGAAAA CGTGGTGGGT
AGCGAAACCC GCGTGAAAGT GGTGAAGAAC AAAATCGCTG CGCCGTTTAA ACAGGCTGAA
TTCCAGATCC TCTACGGCGA AGGTATCAAC TTCTACGGCG AACTGGTTGA CCTGGGCGTA
AAAGAGAAGC TGATCGAGAA AGCAGGCGCG TGGTACAGCT ACAAAGGTGA GAAGATCGGT
CAGGGTAAAG CGAATGCAAC TGCCTGGCTG AAAGATAACC CGGAAACCGC GAAAGAGATC
GAGAAGAAAG TACGTGAGTT GCTGCTGAGC AACCCGAACT CAACGCCGGA TTTCTCTGTA
GATGATAGCG AAGGCGTAGC AGAAACTAAC GAAGATTTTT AA
 
Protein sequence
MAIDENKQKA LAAALGQIEK QFGKGSIMRL GEDRSMDVET ISTGSLSLDI ALGAGGLPMG 
RIVEIYGPES SGKTTLTLQV IAAAQREGKT CAFIDAEHAL DPIYARKLGV DIDNLLCSQP
DTGEQALEIC DALARSGAVD VIVVDSVAAL TPKAEIEGEI GDSHMGLAAR MMSQAMRKLA
GNLKQSNTLL IFINQIRMKI GVMFGNPETT TGGNALKFYA SVRLDIRRIG AVKEGENVVG
SETRVKVVKN KIAAPFKQAE FQILYGEGIN FYGELVDLGV KEKLIEKAGA WYSYKGEKIG
QGKANATAWL KDNPETAKEI EKKVRELLLS NPNSTPDFSV DDSEGVAETN EDF