Gene EcHS_A2835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2835 
SymbolrecA 
ID5594734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2840125 
End bp2841186 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content54% 
IMG OID640921952 
Productrecombinase A 
Protein accessionYP_001459463 
Protein GI157162145 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0246819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATCG ACGAAAACAA ACAGAAAGCG TTGGCGGCAG CACTGGGCCA GATTGAGAAA 
CAATTTGGTA AAGGCTCCAT CATGCGCCTG GGTGAAGACC GTTCCATGGA TGTGGAAACC
ATCTCTACCG GTTCGCTTTC ACTGGATATC GCGCTTGGGG CAGGTGGTCT GCCGATGGGC
CGTATCGTCG AAATCTACGG ACCGGAATCT TCCGGTAAAA CCACGCTGAC GCTGCAGGTG
ATCGCCGCAG CGCAGCGTGA AGGTAAAACC TGTGCGTTTA TCGATGCTGA ACACGCGCTG
GACCCAATCT ACGCACGTAA ACTGGGCGTC GATATCGATA ACCTGCTGTG CTCCCAGCCG
GACACCGGCG AGCAGGCACT GGAAATCTGT GACGCCCTGG CGCGTTCTGG CGCAGTAGAC
GTTATCGTCG TTGACTCCGT GGCGGCACTG ACGCCGAAAG CGGAAATCGA AGGCGAAATC
GGCGACTCTC ACATGGGCCT TGCGGCACGT ATGATGAGCC AGGCGATGCG TAAGCTGGCG
GGTAACCTGA AGCAGTCCAA CACGCTGCTG ATCTTCATCA ACCAGATCCG TATGAAAATT
GGTGTGATGT TCGGTAACCC GGAAACCACT ACCGGTGGTA ACGCGCTGAA ATTCTACGCC
TCTGTTCGTC TCGACATCCG TCGTATCGGC GCGGTGAAAG AGGGCGAAAA CGTGGTGGGT
AGCGAAACCC GCGTGAAAGT GGTGAAGAAC AAAATCGCTG CGCCGTTTAA ACAGGCTGAA
TTCCAGATCC TCTACGGCGA AGGTATCAAC TTCTACGGCG AACTGGTTGA CCTGGGCGTA
AAAGAGAAGC TGATCGAGAA AGCAGGCGCG TGGTACAGCT ACAAAGGTGA GAAGATCGGT
CAGGGTAAAG CGAATGCGAC TGCCTGGCTG AAAGATAACC CGGAAACCGC GAAAGAGATC
GAGAAGAAAG TACGTGAGTT GCTGCTGAGC AACCCGAACT CAACGCCGGA TTTCTCTGTA
GATGATAGCG AAGGCGTAGC AGAAACTAAC GAAGATTTTT AA
 
Protein sequence
MAIDENKQKA LAAALGQIEK QFGKGSIMRL GEDRSMDVET ISTGSLSLDI ALGAGGLPMG 
RIVEIYGPES SGKTTLTLQV IAAAQREGKT CAFIDAEHAL DPIYARKLGV DIDNLLCSQP
DTGEQALEIC DALARSGAVD VIVVDSVAAL TPKAEIEGEI GDSHMGLAAR MMSQAMRKLA
GNLKQSNTLL IFINQIRMKI GVMFGNPETT TGGNALKFYA SVRLDIRRIG AVKEGENVVG
SETRVKVVKN KIAAPFKQAE FQILYGEGIN FYGELVDLGV KEKLIEKAGA WYSYKGEKIG
QGKANATAWL KDNPETAKEI EKKVRELLLS NPNSTPDFSV DDSEGVAETN EDF