Gene EcSMS35_2822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2822 
SymbolrecA 
ID6146393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2897933 
End bp2898994 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content54% 
IMG OID641617691 
Productrecombinase A 
Protein accessionYP_001744846 
Protein GI170683219 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0783343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0000036093 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCTATCG ACGAAAACAA ACAGAAAGCG TTGGCGGCAG CACTGGGCCA GATTGAGAAA 
CAATTTGGTA AAGGCTCCAT CATGCGCCTG GGTGAAGACC GTTCCATGGA TGTGGAAACC
ATCTCTACCG GTTCGCTTTC ACTGGATATC GCGCTTGGGG CAGGTGGTCT GCCGATGGGC
CGTATCGTCG AAATCTACGG ACCGGAATCT TCCGGTAAAA CCACGCTGAC GTTGCAGGTG
ATCGCCGCAG CCCAGCGCGA AGGTAAAACT TGTGCGTTTA TCGATGCTGA ACACGCGCTG
GACCCAATCT ACGCACGTAA ACTGGGCGTC GATATCGACA ACCTGCTGTG CTCCCAGCCG
GACACTGGCG AGCAGGCACT GGAAATCTGT GACGCCCTGG CACGTTCTGG CGCAGTAGAC
GTTATCGTCG TTGACTCCGT GGCGGCACTG ACGCCGAAAG CGGAAATCGA AGGCGAAATC
GGCGACTCTC ACATGGGCCT TGCGGCACGT ATGATGAGCC AGGCGATGCG TAAGCTGGCG
GGTAACCTGA AGCAGTCCAA CACGCTGCTG ATCTTCATCA ACCAGATCCG TATGAAAATT
GGTGTGATGT TCGGTAACCC GGAAACCACT ACCGGTGGTA ACGCGCTGAA ATTCTACGCC
TCTGTTCGTC TCGACATCCG TCGTATCGGC GCGGTGAAAG AGGGCGAAAA CGTGGTGGGT
AGCGAAACCC GTGTGAAAGT GGTGAAGAAC AAAATCGCTG CGCCGTTTAA ACAGGCTGAA
TTCCAGATCC TCTACGGCGA AGGTATCAAC TTCTATGGCG AACTGGTTGA TCTGGGCGTG
AAAGAGAAGC TGATCGAGAA AGCAGGCGCA TGGTACAGCT ACAAAGGTGA GAAGATCGGT
CAGGGTAAAG CGAATGCGAC TGCCTGGCTG AAAGATAACC CGGAAACCGC GAAAGAGATC
GAGAAGAAAG TACGTGAGTT GCTGCTGAGC AACCCGAACT CAACGCCGGA TTTCTCTGTA
GATGACAGCG AAGGCGTAGC AGAAACTAAC GAAGATTTTT AA
 
Protein sequence
MAIDENKQKA LAAALGQIEK QFGKGSIMRL GEDRSMDVET ISTGSLSLDI ALGAGGLPMG 
RIVEIYGPES SGKTTLTLQV IAAAQREGKT CAFIDAEHAL DPIYARKLGV DIDNLLCSQP
DTGEQALEIC DALARSGAVD VIVVDSVAAL TPKAEIEGEI GDSHMGLAAR MMSQAMRKLA
GNLKQSNTLL IFINQIRMKI GVMFGNPETT TGGNALKFYA SVRLDIRRIG AVKEGENVVG
SETRVKVVKN KIAAPFKQAE FQILYGEGIN FYGELVDLGV KEKLIEKAGA WYSYKGEKIG
QGKANATAWL KDNPETAKEI EKKVRELLLS NPNSTPDFSV DDSEGVAETN EDF