Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2822 |
Symbol | recA |
ID | 6146393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2897933 |
End bp | 2898994 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617691 |
Product | recombinase A |
Protein accession | YP_001744846 |
Protein GI | 170683219 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0468] RecA/RadA recombinase |
TIGRFAM ID | [TIGR02012] protein RecA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0783343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0000036093 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCTATCG ACGAAAACAA ACAGAAAGCG TTGGCGGCAG CACTGGGCCA GATTGAGAAA CAATTTGGTA AAGGCTCCAT CATGCGCCTG GGTGAAGACC GTTCCATGGA TGTGGAAACC ATCTCTACCG GTTCGCTTTC ACTGGATATC GCGCTTGGGG CAGGTGGTCT GCCGATGGGC CGTATCGTCG AAATCTACGG ACCGGAATCT TCCGGTAAAA CCACGCTGAC GTTGCAGGTG ATCGCCGCAG CCCAGCGCGA AGGTAAAACT TGTGCGTTTA TCGATGCTGA ACACGCGCTG GACCCAATCT ACGCACGTAA ACTGGGCGTC GATATCGACA ACCTGCTGTG CTCCCAGCCG GACACTGGCG AGCAGGCACT GGAAATCTGT GACGCCCTGG CACGTTCTGG CGCAGTAGAC GTTATCGTCG TTGACTCCGT GGCGGCACTG ACGCCGAAAG CGGAAATCGA AGGCGAAATC GGCGACTCTC ACATGGGCCT TGCGGCACGT ATGATGAGCC AGGCGATGCG TAAGCTGGCG GGTAACCTGA AGCAGTCCAA CACGCTGCTG ATCTTCATCA ACCAGATCCG TATGAAAATT GGTGTGATGT TCGGTAACCC GGAAACCACT ACCGGTGGTA ACGCGCTGAA ATTCTACGCC TCTGTTCGTC TCGACATCCG TCGTATCGGC GCGGTGAAAG AGGGCGAAAA CGTGGTGGGT AGCGAAACCC GTGTGAAAGT GGTGAAGAAC AAAATCGCTG CGCCGTTTAA ACAGGCTGAA TTCCAGATCC TCTACGGCGA AGGTATCAAC TTCTATGGCG AACTGGTTGA TCTGGGCGTG AAAGAGAAGC TGATCGAGAA AGCAGGCGCA TGGTACAGCT ACAAAGGTGA GAAGATCGGT CAGGGTAAAG CGAATGCGAC TGCCTGGCTG AAAGATAACC CGGAAACCGC GAAAGAGATC GAGAAGAAAG TACGTGAGTT GCTGCTGAGC AACCCGAACT CAACGCCGGA TTTCTCTGTA GATGACAGCG AAGGCGTAGC AGAAACTAAC GAAGATTTTT AA
|
Protein sequence | MAIDENKQKA LAAALGQIEK QFGKGSIMRL GEDRSMDVET ISTGSLSLDI ALGAGGLPMG RIVEIYGPES SGKTTLTLQV IAAAQREGKT CAFIDAEHAL DPIYARKLGV DIDNLLCSQP DTGEQALEIC DALARSGAVD VIVVDSVAAL TPKAEIEGEI GDSHMGLAAR MMSQAMRKLA GNLKQSNTLL IFINQIRMKI GVMFGNPETT TGGNALKFYA SVRLDIRRIG AVKEGENVVG SETRVKVVKN KIAAPFKQAE FQILYGEGIN FYGELVDLGV KEKLIEKAGA WYSYKGEKIG QGKANATAWL KDNPETAKEI EKKVRELLLS NPNSTPDFSV DDSEGVAETN EDF
|
| |