Gene EcolC_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1013 
SymbolrecA 
ID6067411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1100294 
End bp1101355 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content54% 
IMG OID641600421 
Productrecombinase A 
Protein accessionYP_001724009 
Protein GI170019055 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.580458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000301114 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCTATCG ACGAAAACAA ACAGAAAGCG TTGGCGGCAG CACTGGGCCA GATTGAGAAA 
CAATTTGGTA AAGGCTCCAT CATGCGCCTG GGTGAAGACC GTTCCATGGA TGTGGAAACC
ATCTCTACCG GTTCGCTTTC ACTGGATATC GCGCTTGGGG CAGGTGGTCT GCCGATGGGC
CGTATCGTCG AAATCTACGG ACCGGAATCT TCCGGTAAAA CCACGCTGAC GCTGCAGGTG
ATCGCCGCAG CGCAGCGTGA AGGTAAAACC TGTGCGTTTA TCGATGCTGA ACACGCGCTG
GACCCAATCT ACGCACGTAA ACTGGGCGTC GATATCGATA ACCTGCTGTG CTCCCAGCCG
GACACCGGCG AGCAGGCACT GGAAATCTGT GACGCCCTGG CGCGTTCTGG CGCAGTAGAC
GTTATCGTCG TTGACTCCGT GGCGGCACTG ACGCCGAAAG CGGAAATCGA AGGCGAAATC
GGCGACTCTC ACATGGGCCT TGCAGCACGT ATGATGAGCC AGGCGATGCG TAAGCTGGCG
GGTAACCTGA AGCAGTCCAA CACGCTGCTG ATCTTCATCA ACCAGATCCG TATGAAAATT
GGTGTGATGT TCGGTAACCC GGAAACCACT ACCGGTGGTA ACGCGCTGAA ATTCTACGCC
TCTGTTCGTC TCGACATCCG TCGTATCGGC GCGGTGAAAG AGGGCGAAAA CGTGGTGGGT
AGCGAAACCC GCGTGAAAGT GGTGAAGAAC AAAATCGCTG CGCCGTTTAA ACAGGCTGAA
TTCCAGATCC TCTACGGCGA AGGTATCAAC TTCTACGGCG AACTGGTTGA CCTGGGCGTA
AAAGAGAAGC TGATCGAGAA AGCAGGCGCG TGGTACAGCT ACAAAGGTGA GAAGATCGGT
CAGGGTAAAG CGAATGCGAC TGCCTGGCTG AAAGATAACC CGGAAACCGC GAAAGAGATC
GAGAAGAAAG TACGTGAGTT GCTGCTGAGC AACCCGAACT CAACGCCGGA TTTCTCTGTA
GATGATAGCG AAGGCGTAGC AGAAACTAAC GAAGATTTTT AA
 
Protein sequence
MAIDENKQKA LAAALGQIEK QFGKGSIMRL GEDRSMDVET ISTGSLSLDI ALGAGGLPMG 
RIVEIYGPES SGKTTLTLQV IAAAQREGKT CAFIDAEHAL DPIYARKLGV DIDNLLCSQP
DTGEQALEIC DALARSGAVD VIVVDSVAAL TPKAEIEGEI GDSHMGLAAR MMSQAMRKLA
GNLKQSNTLL IFINQIRMKI GVMFGNPETT TGGNALKFYA SVRLDIRRIG AVKEGENVVG
SETRVKVVKN KIAAPFKQAE FQILYGEGIN FYGELVDLGV KEKLIEKAGA WYSYKGEKIG
QGKANATAWL KDNPETAKEI EKKVRELLLS NPNSTPDFSV DDSEGVAETN EDF