Gene SeD_A3378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3378 
SymbolrecJ 
ID6875145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3249037 
End bp3250770 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content57% 
IMG OID642786381 
ProductssDNA exonuclease RecJ 
Protein accessionYP_002217019 
Protein GI198243236 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACAAC AGAGACAACT TCGTCGGCGC GAGGCTGATG AGACGGCGGA ACTCCCCGCC 
GATCTTCCTC CATTACTACG ACGTTTATAT GCCAGCCGGG GCGTTCGTAG CGCCCGCGAA
CTGGAGCGCA GCGTGAAAGG AATGCTGCCC TGGCAACAGC TTAGCGGCAT AGATAACGCG
GTGGAGATCC TCTACAACGC CTTTCGCGAA GGTATCCGCA TTATTGTTGT TGGCGATTTT
GACGCCGACG GCGCGACCAG TACTGCGCTA AGCGTATTGG GAATGCGTGC GTTGGGATGT
GACAACATCA GTTATCTGGT GCCTAATCGC TTTGAGGACG GCTACGGTTT AAGCCCGGAA
GTGGTCGATC AGGCGAAGGC GCGCGGCGCG CAGCTTATCG TCACCGTAGA CAACGGCATT
TCATCCCACG CCGGCGTAGC GCATGCAAAG ACGTTGGGGA TTCCGGTGAT CGTGACCGAT
CACCACCTGC CAGGCGACAC GTTGCCGGAT GCCGAAGCGA TTATTAATCC CAATCTGCGC
GACTGCGAAT TCCCGTCTAA GTCGCTGGCG GGCGTCGGCG TGGCGTTTTA CCTGATGCTG
GCGTTGCGAA CATTTTTGCG CGACAAAGGA TGGTTCGACG AGCGCAACAT TGCGCCGCCG
AATTTGGCGG AGCTGCTGGA TTTAGTGGCG TTGGGAACGG TAGCAGACGT TGTGCCGCTG
GACGCTAACA ACCGTATTCT GACCTGGCAA GGGCTAAGTC GTATTCGTGC CGGGAAATGC
CGTCCGGGAA TTAAAGCGTT GCTGGAGATA TCGAATCGCG ATCCGCAGCA GCTTGCCGCC
AGTGATTTAG GCTTCGCGTT GGGGCCTCGC CTGAATGCCG CCGGCAGGCT GGATGATATG
TCCGTTGGTG TGGCGTTACT GTTGTGCGAT AACCTCGGCG AAGCGCGTGT TCTGGCCAGC
GAGCTGGATG CGCTTAACCA GACGCGCAAA GAAATAGAGC AGGGGATGCA GGCGGAAGCG
CTTATCCTGT GCGAAAAGCT TGAGCGCAGT AGTGAAACGC TTCCGGGCGG TCTGGCGATG
TATCATCCTG AATGGCACCA GGGCGTGGTC GGAATTCTGG CGTCGCGCAT TAAAGAGCGT
TTTCACCGCC CGGTGATCGC CTTTGCGCCT GCGGGCGACG GCACGCTAAA AGGCTCAGGC
CGATCGATTC AGGGGTTGCA TATGCGCGAT GCGCTGGAAC GGCTGGATAC GCTTTACCCT
GATCTGATGA TCAAGTTCGG CGGTCATGCG ATGGCGGCGG GATTGTCGCT GGAAGAGCAT
AAATTCGAGC AGTTTCAGCA ACGTTTTGGC GAGCTGGTGA CGGAATGGCT CGATCCTGCC
CTGTTGCAAG GCGAGGTGAT CTCCGATGGT CCATTAAGCG CGGCGGAGAT GTCTATGGAA
GTGGCGCAAC TGTTGCGGGA TGCCGGACCG TGGGGGCAAA TGTTCCCGGA ACCGTTATTC
GATGGTCGTT TCCGTCTGCT ACAGCAGCGT CTGGTGGGCG AGCGTCACCT CAAGGTGATG
GTGGAGCCTG TCGGCGGCGG CCCGCTGTTG GATGGCATCG CATTTAATAT TGATACGACC
TGTTGGCCGG ATAACGGCGT GCGGGAGGTA GAACTGGCTT ATAAACTGGA CATTAACGAG
TTTCGCGGCA ACCGTAGTTT ACAGATTATT ATCGATGATA TTTGGCCGCT ATGA
 
Protein sequence
MKQQRQLRRR EADETAELPA DLPPLLRRLY ASRGVRSARE LERSVKGMLP WQQLSGIDNA 
VEILYNAFRE GIRIIVVGDF DADGATSTAL SVLGMRALGC DNISYLVPNR FEDGYGLSPE
VVDQAKARGA QLIVTVDNGI SSHAGVAHAK TLGIPVIVTD HHLPGDTLPD AEAIINPNLR
DCEFPSKSLA GVGVAFYLML ALRTFLRDKG WFDERNIAPP NLAELLDLVA LGTVADVVPL
DANNRILTWQ GLSRIRAGKC RPGIKALLEI SNRDPQQLAA SDLGFALGPR LNAAGRLDDM
SVGVALLLCD NLGEARVLAS ELDALNQTRK EIEQGMQAEA LILCEKLERS SETLPGGLAM
YHPEWHQGVV GILASRIKER FHRPVIAFAP AGDGTLKGSG RSIQGLHMRD ALERLDTLYP
DLMIKFGGHA MAAGLSLEEH KFEQFQQRFG ELVTEWLDPA LLQGEVISDG PLSAAEMSME
VAQLLRDAGP WGQMFPEPLF DGRFRLLQQR LVGERHLKVM VEPVGGGPLL DGIAFNIDTT
CWPDNGVREV ELAYKLDINE FRGNRSLQII IDDIWPL