Gene SNSL254_A3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3031 
SymbolrecA 
ID6484821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2951353 
End bp2952414 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content54% 
IMG OID642738346 
Productrecombinase A 
Protein accessionYP_002042070 
Protein GI194446772 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00473022 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.000967818 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTATCG ACGAAAACAA ACAGAAAGCG TTGGCGGCAG CACTGGGCCA AATTGAAAAG 
CAATTTGGTA AAGGCTCCAT CATGCGTCTG GGTGAAGACC GTTCTATGGA TGTGGAAACT
ATCTCCACCG GTTCGCTTTC ACTGGACATC GCACTCGGTG CGGGCGGTCT GCCGATGGGG
CGTATCGTCG AAATTTACGG GCCGGAATCT TCCGGTAAAA CGACCCTGAC GCTGCAGGTG
ATTGCCGCTG CGCAGCGTGA AGGTAAAACC TGTGCGTTTA TCGATGCGGA ACACGCGCTT
GACCCTGTTT ACGCACGCAA GCTGGGCGTC GATATCGATA ACCTGCTCTG TTCTCAGCCG
GATACCGGCG AGCAGGCGCT GGAAATCTGT GACGCGCTGG CGCGTTCAGG CGCGGTGGAC
GTCATTGTGG TCGACTCCGT AGCGGCGCTA ACGCCGAAAG CGGAAATCGA AGGCGAAATC
GGCGACTCTC ACATGGGCCT CGCGGCGCGT ATGATGAGCC AGGCGATGCG TAAGCTGGCG
GGGAACCTGA AACAGTCCAA TACGCTGCTG ATCTTCATCA ACCAGATCCG TATGAAAATT
GGCGTGATGT TTGGTAACCC GGAAACCACC ACCGGCGGTA ACGCGCTGAA ATTCTACGCC
TCCGTTCGTC TTGATATCCG TCGTATTGGC GCGGTGAAAG AAGGCGATAA TGTCGTGGGT
AGCGAAACGC GTGTGAAAGT GGTGAAAAAC AAAATCGCCG CGCCGTTTAA GCAGGCCGAG
TTCCAGATCC TCTACGGCGA AGGCATCAAC TTCTATGGCG AACTGGTTGA CCTGGGCGTG
AAAGAGAAGC TGATCGAGAA AGCGGGCGCA TGGTACAGCT ACAACGGCGA GAAGATTGGC
CAGGGTAAAG CGAACGCGAC TACCTGGCTG AAAGAGAACC CGGCAACAGC GAAAGAGATT
GAGAAAAGAG TGCGTGAATT ACTGTTGAGT AATCAGAATG CCACGCCCGA TTTCGCCGTT
GACGATAGCG AAGGCGTTGC AGAAACCAAC GAAGATTTTT AA
 
Protein sequence
MAIDENKQKA LAAALGQIEK QFGKGSIMRL GEDRSMDVET ISTGSLSLDI ALGAGGLPMG 
RIVEIYGPES SGKTTLTLQV IAAAQREGKT CAFIDAEHAL DPVYARKLGV DIDNLLCSQP
DTGEQALEIC DALARSGAVD VIVVDSVAAL TPKAEIEGEI GDSHMGLAAR MMSQAMRKLA
GNLKQSNTLL IFINQIRMKI GVMFGNPETT TGGNALKFYA SVRLDIRRIG AVKEGDNVVG
SETRVKVVKN KIAAPFKQAE FQILYGEGIN FYGELVDLGV KEKLIEKAGA WYSYNGEKIG
QGKANATTWL KENPATAKEI EKRVRELLLS NQNATPDFAV DDSEGVAETN EDF