Gene EcSMS35_3025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3025 
SymbolrecJ 
ID6146377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3115173 
End bp3116906 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content57% 
IMG OID641617894 
ProductssDNA exonuclease RecJ 
Protein accessionYP_001745045 
Protein GI170683389 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACAAC AGATACAACT TCGTCGCCGT GAAGTCGATG AAACGGCAGA CTTGCCCGCG 
GAATTGCCTC CCTTGCTGCG CCGTTTATAC GCCAGCCGGG GAGTACGCAG TGCGCATGAA
CTGGAACGCA GTGTTAAAGG TATGCTGCCC TGGCAGCAAC TGAGCGGCGT CGAAAAGGCC
GTTGAGATCC TTTACAACGC TTTTCGCGAA GGAACGCGGA TTATTGTGGT CGGTGATTTC
GACGCCGACG GCGCGACCAG CACGGCTCTA AGCGTGCTGG CGATGCGCTC GCTTGGTTGC
AGCAATATCG ACTACCTGGT ACCAAACCGT TTCGAAGACG GTTACGGCTT AAGCCCGGAA
GTAGTCGATC AGGCCCATGC CCGTGGCGCG CAGTTAATTG TCACGGTGGA TAACGGTATT
TCCTCCCATG CGGGGGTTGA ACACGCTCGT TCGTTGGGCA TTCCGGTTAT TGTTACCGAT
CACCATTTGC CGGGCGACAC ATTACCCGCA GCGGAAGCGA TCATTAACCC TAACTTGCGC
GACTGTAATT TCCCGTCGAA ATCACTGGCA GGCGTGGGTG TGGCGTTTTA TCTGATGCTG
GCTCTGCGCA CCTTTTTGCG CGATCAGGGC TGGTTTGATG AGCGCGGCAT CGCAATTCCT
AATCTGGCAG AACTGCTGGA TCTGGTGGCG TTGGGGACAG TGGCAGACGT GGTGCCGCTG
GACGCTAATA ACCGTATTCT GACCTGGCAG GGGATGAGCC GCATCCGCGC CGGGAAGTGC
CGTCCGGGGA TTAAAGCGCT GCTGGAAGTG GCCAATCGCG ATGCGCAAAA ACTCGCCGCC
AGCGATTTAG GTTTTGCGCT GGGGCCGCGT CTCAATGCCG CCGGGCGACT GGATGATATG
TCCGTTGGTG TGGCGCTGTT GCTGTGCGAC AACATCGGCG AAGCGCGCGT GCTGGCAAAT
GAACTCGATG CGCTAAACCA GACGCGAAAA GAGATCGAAC AGGGAATGCA GATTGAAGCC
CTGACCCTGT GCGAGAAACT GGAGCGCAGC CGCGACACGC TACCCGGTGG GCTGGCAATG
TATCACCCCG AATGGCATCA GGGCGTGGTC GGCATTCTGG CTTCGCGCAT CAAAGAGCGT
TTTCATCGTC CGGTTATCGC CTTTGCGCCA GCAGGTGATG GTACGCTGAA AGGTTCCGGT
CGCTCCATTC AGGGGCTGCA TATGCGTGAT GCGCTGGAGC GGTTAGACAT GCTCTACCCT
GGCATGATGC TCAAGTTTGG CGGTCATGCG ATGGCGGCTG GTTTGTCGCT GGAAGAGGAT
AAATTCGAAC TCTTTCAACA ACGGTTTGGC GAACTGGTTA CTGAGTGGCT GGATCCTTCA
CTATTGCAAG GCGAAGTGGT GTCAGACGGC CCGTTAAGCC CGGCCGAAAT GACCATGGAA
GTAGCGCAGC TGCTGCGCGA TGCTGGCCCG TGGGGGCAGA TGTTCCCGGA GCCGCTGTTT
GACGGTCATT TCCGCCTTCT GCAACAACGA CTGGTAGGCG AACGCCATTT GAAGGTCATG
GTTGAACCGG TCGGCGGCGG CCCGCTGCTG GACGGCATTG CCTTTAACGT TGATACCGCC
CTTTGGCCGG ATAACGGCGT GCGCGAAGTG CAACTGGCTT ATAAGCTTGA TATCAACGAG
TTTCGCGGCA ATCGCAGCCT GCAAATTATC ATCGACAATA TCTGGCCAAT TTAG
 
Protein sequence
MKQQIQLRRR EVDETADLPA ELPPLLRRLY ASRGVRSAHE LERSVKGMLP WQQLSGVEKA 
VEILYNAFRE GTRIIVVGDF DADGATSTAL SVLAMRSLGC SNIDYLVPNR FEDGYGLSPE
VVDQAHARGA QLIVTVDNGI SSHAGVEHAR SLGIPVIVTD HHLPGDTLPA AEAIINPNLR
DCNFPSKSLA GVGVAFYLML ALRTFLRDQG WFDERGIAIP NLAELLDLVA LGTVADVVPL
DANNRILTWQ GMSRIRAGKC RPGIKALLEV ANRDAQKLAA SDLGFALGPR LNAAGRLDDM
SVGVALLLCD NIGEARVLAN ELDALNQTRK EIEQGMQIEA LTLCEKLERS RDTLPGGLAM
YHPEWHQGVV GILASRIKER FHRPVIAFAP AGDGTLKGSG RSIQGLHMRD ALERLDMLYP
GMMLKFGGHA MAAGLSLEED KFELFQQRFG ELVTEWLDPS LLQGEVVSDG PLSPAEMTME
VAQLLRDAGP WGQMFPEPLF DGHFRLLQQR LVGERHLKVM VEPVGGGPLL DGIAFNVDTA
LWPDNGVREV QLAYKLDINE FRGNRSLQII IDNIWPI