Gene ECH74115_4183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4183 
SymbolrecJ 
ID6967880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3878919 
End bp3880652 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content57% 
IMG OID643387927 
ProductssDNA exonuclease RecJ 
Protein accessionYP_002272366 
Protein GI209400065 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACAAC AGATACAACT TCGTCGCCGT GAAGTCGATG AAACGGCAGA CTTGCCCGCG 
GAATTGCCTC CCTTGCTGCG CCGTTTATAC GCCAGCCGGG GAGTACGCAG TGCGCAAGAA
CTGGAACGCA GTGTTAAAGG TATGCTGCCC TGGCAGCAAC TGAGCGGCGT CGAAAAGGCC
GTTGAGATCC TTTACAACGC TTTTCGCGAA GGAACGCGGA TTATTGTGGT CGGTGATTTC
GACGCCGACG GTGCCACCAG CACGGCTCTA AGCGTGCTGG CGATGCGCTC GCTTGGTTGC
AGCAATATCG ACTACCTGGT ACCAAACCGT TTCGAAGACG GTTACGGCTT AAGCCCGGAA
GTGGTCGATC AGGCCCATGC CCGTGGCGCG CAGTTAATTG TCACGGTGGA TAACGGTATT
TCCTCCCATG CGGGAGTTGA GCACGCTCGC TCGTTGGGCA TCCCGGTTAT TGTTACCGAT
CACCATTTGC CGGGCGACAC ATTACCCGCA GCGGAAGCGA TCATTAACCC TAACTTGCGC
GACTGTAATT TCCCGTCGAA ATCACTGGCA GGCGTGGGTG TGGCGTTTTA TCTGATGCTG
GCGCTGCGCA CCTTTTTGCG CGATCAGGGC TGGTTTGATG AGCGCGGCAT CGCAATTCCT
AACCTGGCAG AACTGCTGGA TCTGGTCGCG CTGGGGACAG TGGCGGACGT CGTGCCGCTG
GACGCTAATA ATCGCATTCT GACCTGGCAG GGGATGAGTC GCATCCGTGC CGGAAAGTGC
CGTCCAGGGA TTAAAGCGCT GCTGGAAGTG GCAAACCGTG ATGCACAAAA ACTCGCCGCC
AGCGATTTAG GTTTTGCGCT GGGGCCACGT CTCAATGCTG CCGGGCGACT GGACGATATG
TCCGTCGGTG TGGCGCTGTT GCTGTGCGAC AACATCGGCG AAGCGCGCGT GCTGGCAAAT
GAACTCGATG CGCTAAATCA GACGCGAAAA GAGATCGAAC AAGGAATGCA GATTGAAGCC
TTGACCCTGT GCGAGAAACT GGAGCGAAGT CGCGACACGC TACCCGGCGG GCTGGCAATG
TATCACCCCG AATGGCATCA GGGCGTTGTC GGTATTCTGG CTTCGCGCAT CAAAGAGCGT
TTTCACCGTC CGGTTATCGC CTTTGCGCCA GCAGGTGATG GTACGCTGAA AGGTTCCGGT
CGCTCCATTC AGGGGCTGCA TATGCGTGAT GCGCTGGAGC GATTAGACAC ACTCTACCCC
GGTATGATGC TGAAGTTTGG CGGTCATGCG ATGGCGGCGG GTTTGTCGCT GGAAGAGGAT
AAATTCGAAC TCTTTCAACA ACGGTTTGGC GAGTTGGTTA CCGAGTGGCT GGACCCTTCG
CTATTGCAAG GCGAAGTGGT GTCAGACGGC CCGTTAAGCC CGGCCGAAAT GACCATGGAA
GTGGCGCAGC TGCTGCGCGA TGCTGGCCCG TGGGGGCAGA TGTTCCCGGA GCCGCTGTTT
GATGGTCATT TCCGTCTGCT GCAACAGCGG CTGGTGGGCG AACGTCATTT GAAAGTCATG
GTCGAACCGG TCGGCGGCGG TCCGCTGCTG GATGGTATTG CTTTTAATGT CGATACCGCC
CTCTGGCCGG ATAACGGCGT GCGCGAAGTG CAACTGGCTT ACAAGCTCGA TATCAACGAG
TTTCGCGGCA ACCGCAGCCT GCAAATTATC ATCGACAATA TCTGGCCAAT TTAG
 
Protein sequence
MKQQIQLRRR EVDETADLPA ELPPLLRRLY ASRGVRSAQE LERSVKGMLP WQQLSGVEKA 
VEILYNAFRE GTRIIVVGDF DADGATSTAL SVLAMRSLGC SNIDYLVPNR FEDGYGLSPE
VVDQAHARGA QLIVTVDNGI SSHAGVEHAR SLGIPVIVTD HHLPGDTLPA AEAIINPNLR
DCNFPSKSLA GVGVAFYLML ALRTFLRDQG WFDERGIAIP NLAELLDLVA LGTVADVVPL
DANNRILTWQ GMSRIRAGKC RPGIKALLEV ANRDAQKLAA SDLGFALGPR LNAAGRLDDM
SVGVALLLCD NIGEARVLAN ELDALNQTRK EIEQGMQIEA LTLCEKLERS RDTLPGGLAM
YHPEWHQGVV GILASRIKER FHRPVIAFAP AGDGTLKGSG RSIQGLHMRD ALERLDTLYP
GMMLKFGGHA MAAGLSLEED KFELFQQRFG ELVTEWLDPS LLQGEVVSDG PLSPAEMTME
VAQLLRDAGP WGQMFPEPLF DGHFRLLQQR LVGERHLKVM VEPVGGGPLL DGIAFNVDTA
LWPDNGVREV QLAYKLDINE FRGNRSLQII IDNIWPI