Gene ECH74115_4087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4087 
SymbolrecC 
ID6968678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3785857 
End bp3789225 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content54% 
IMG OID643387845 
Productexonuclease V subunit gamma 
Protein accessionYP_002272285 
Protein GI209396729 
COG category[L] Replication, recombination and repair 
COG ID[COG1330] Exonuclease V gamma subunit 
TIGRFAM ID[TIGR01450] exodeoxyribonuclease V, gamma subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.459792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGGG TCTACCATTC CAATCGTCTG GACGTGCTGG AAGCGTTGAT GGAGTTTATT 
GTCGAACGCG AACGGCTGGA CGATCCTTTC GAACCAGAGA TGATTCTGGT GCAAAGTACC
GGTATGGCAC AGTGGCTGCA AATGACCCTG TCGCAAAAGT TTGGTATTGC GGCAAACATT
GATTTTCCGC TGCCTGCGAG CTTTATCTGG GATATGTTCG TCCGGGTGTT ACCGGAGATC
CCCAAAGAGA GCGCCTTTAA CAAACAGAGC ATGAGCTGGA AACTGATGAC TCTGCTGCCG
CAATTGCTGG AGCGCGAAGA CTTTACCCTG TTGCGGCATT ATCTGACTGA CGATAGCGAC
AAGCGAAAGC TGTTCCAGCT TTCCTCAAAA GCGGCGGACC TGTTTGACCA GTATCTGGTC
TATCGTCCGG ACTGGCTGGC ACAGTGGGAA ACAGGACATT TGGTTGAAGG GCTGGGAGAG
GCACAGGCCT GGCAAGCGCC GTTGTGGAAG GCGCTGGTGG AATATACCCA TGAACTCGGA
CAACCGCGCT GGCACCGCGC CAATCTCTAT CAGCGCTTTA TCGAAACGCT GGAGTCCGCG
ACGACCTGCC CGCCGGGGTT ACCTTCGCGC GTCTTTATAT GCGGTATTTC CGCGTTACCG
CCTGTTTATC TCCAGGCGCT CCAGGCGCTG GGTAAACATA TTGAAATCCA TCTCCTGTTT
ACCAACCCCT GCCGTTATTA CTGGGGCGAC ATTAAAGATC CTGCTTATCT GGCGAAACTA
CTGACTCGCC AGCGCCGACA CAGTTTTGAA GATCGCGAAT TACCGCTATT TCGCGACAGC
GAAAATGCCG GGCAGCTCTT TAACAGCGAT GGTGAACAGG ATGTCGGCAA CCCGCTGCTG
GCCTCATGGG GCAAGCTTGG GCGCGACTAC ATTTATCTCC TTTCTGACCT GGAGAGCAGC
CAGGAGCTGG ACGCCTTTGT CGATGTGACG CCAGATAACC TGCTGCATAA CATTCAGTCT
GACATTCTGG AACTGGAAAA CCGCGCCGTT GCTGGTGTGA ACATCGAAGA GTTTTCCCGT
AGCGATAACA AACGTCCGCT TGATCCACTG GATAGCAGTA TCACCTTCCA CGTTTGCCAT
AGCCCGCAGC GTGAAGTTGA AGTTTTACAC GATCGCCTGC TGGCGATGCT GGAAGAAGAC
CCGACACTTA CTCCGCGCGA CATCATCGTG ATGGTGGCTG ATATCGACAG CTACAGTCCG
TTTATTCAGG CTGTGTTTGG TAGCGCACCT GCGGATCGTT ACCTGCCTTA CGCCATTTCC
GACCGTCGGG CGCGGCAGTC GCATCCTGTA CTTGAAGCGT TTATCAGCCT GTTATCGTTG
CCAGACAGCC GCTTTGTGTC GGAAGACGTG CTGGCATTAC TGGATGTGCC GGTGCTGGCG
GCGCGGTTTG ACATCACCGA AGAAGGGCTG CGTTATTTAC GTCAGTGGGT CAACGAATCC
GGCATTCGTT GGGGGATAGA TGACGACAAC GTTCGCGAGC TGGAACTCCC CGCCACCGGA
CAACACACCT GGCGATTTGG CCTGACGCGT ATGTTGTTGG GCTACGCGAT GGAGAGCGCG
CAGGGCGAGT GGCAATCGGT TCTACCTTAT GATGAATCGA GCGGCTTAAT TGCAGAACTG
GTGGGGCATC TGGCTTCACT GCTAATGCAG CTAAACATCT GGCGTCGCGG GCTGGCACAG
GAGCGTCCGC TGGAAGAGTG GTTGCCGGTT TGTCGCGATA TGCTCAACGC CCTCTTCCTG
CCGGATGCGG AAACCGAAGC GGCGATGACG CTGATCGAAC AACAATGGCA GGCGATTATC
GCCGAAGGTT TAGGTGCGCA GTATGGCGAT GCGGTGCCGC TGTCACTATT GCGTGATGAA
CTGGCACAGC GCCTGGATCA AGAACGTATC AGCCAGCGTT TTCTCGCCGG GCCGGTTAAC
ATTTGTACTC TGATGCCAAT GCGTTCAATT CCGTTCAAAG TGGTTTGCCT GCTGGGAATG
AACGACGGCG TTTATCCACG TCAGCTTGCG CCATTGGGCT TTGACCTGAT GAGCCAGAAA
CCGAAGCGTG GCGACCGTAG CCGTCGCGAT GACGACCGCT ACCTGTTCCT GGAAGCGTTA
ATTTCCGCGC AGCAAAAACT CTATATCAGC TATATCGGGC GTTCCATTCA GGATAACAGT
GAACGTTTCC CGTCGGTACT GGTGCAGGAA CTGATCGACT ACATCGGGCA AAGTCATTAT
CTACCGGGCG ATGAAGCGCT CAACTGTGAT GAAAGCGAGG CAAGGGTAAA AGCGCATCTT
ACTTGCCTCC ATACCCGGAT GCCGTTTGAT CCGCAAAACT ACCAGCCAGG CGAACGACAA
AGCTATGCTC GTGAATGGCT ACCTGCGGCC AGCCAGGCTG GTAAAGCACA TTCTGAATTT
GTTCAGCCGC TGCCGTTTAC CTTACCGGAA ACCGTGCCGC TGGAAACGCT ACAACGATTC
TGGGCACATC CGGTGCGGGC GTTCTTCCAG ATGCGTTTGC AGGTGAACTT CCGTACTGAA
GACAGCGAAA TCCCCGACAC CGAGCCATTT ATTCTGGAAG GACTTAGCCG TTATCAAATC
AATCAGCAGT TATTGAATGC ACTGGTTGAG CAGGATGATG CCGAACGCTT GTTCCGCCGC
TTCCGGGCGG CAGGGGATTT ACCGTATGGC GCTTTTGGTG AAATTTTCTG GGAAACACAG
TGCCAGGAGA TGCAGCAGCT TGCCGACAGA GTCATTGCCT GTCGCCAGCC GGGGCAGAGT
ATGGAAATTG ATCTCGCTTG CAACGGTGTG CAGATAACTG GCTGGTTGCC GCAGGTGCAG
CCGGATGGCT TGTTGCGCTG GCGTCCCTCT TTATTAAGTG TGGCGCAGGG AATGCAACTT
TGGCTGGAAC ACCTTGTCTA CTGTGCCAGC GGTGGTAATG GTGAAAGTCG CCTTTTTCTA
CGCAAAGACG GCGAGTGGCG TTTTCCGCCG CTTGCAGCCG AACAGGCTTT GCATTACCTC
TCACAACTGA TTGAGGGGTA TCGTGAAGGG ATGTCCGCGC CATTGCTGGT GTTACCTGAA
AGTGGCGGCG CGTGGCTAAA AACCTGTTAT GACGCGCAAA ACGATGCCAT GCTGGATGAC
GATTCCACGT TGCAAAAAGC CCGTACGAAA TTCCTTCAGG CTTACGAAGG CAACATGATG
GTGAGTGGCG AAGGTGATGA TATCTGGTAT CAAAGGCTCT GGCGGCAATT AACACCAGAG
ACAATGGAGG CAATCGTTGA ACAGTCGCAA CGTTTCCTGT TACCGCTGTT TCGCTTTAAT
CAGTCATGA
 
Protein sequence
MLRVYHSNRL DVLEALMEFI VERERLDDPF EPEMILVQST GMAQWLQMTL SQKFGIAANI 
DFPLPASFIW DMFVRVLPEI PKESAFNKQS MSWKLMTLLP QLLEREDFTL LRHYLTDDSD
KRKLFQLSSK AADLFDQYLV YRPDWLAQWE TGHLVEGLGE AQAWQAPLWK ALVEYTHELG
QPRWHRANLY QRFIETLESA TTCPPGLPSR VFICGISALP PVYLQALQAL GKHIEIHLLF
TNPCRYYWGD IKDPAYLAKL LTRQRRHSFE DRELPLFRDS ENAGQLFNSD GEQDVGNPLL
ASWGKLGRDY IYLLSDLESS QELDAFVDVT PDNLLHNIQS DILELENRAV AGVNIEEFSR
SDNKRPLDPL DSSITFHVCH SPQREVEVLH DRLLAMLEED PTLTPRDIIV MVADIDSYSP
FIQAVFGSAP ADRYLPYAIS DRRARQSHPV LEAFISLLSL PDSRFVSEDV LALLDVPVLA
ARFDITEEGL RYLRQWVNES GIRWGIDDDN VRELELPATG QHTWRFGLTR MLLGYAMESA
QGEWQSVLPY DESSGLIAEL VGHLASLLMQ LNIWRRGLAQ ERPLEEWLPV CRDMLNALFL
PDAETEAAMT LIEQQWQAII AEGLGAQYGD AVPLSLLRDE LAQRLDQERI SQRFLAGPVN
ICTLMPMRSI PFKVVCLLGM NDGVYPRQLA PLGFDLMSQK PKRGDRSRRD DDRYLFLEAL
ISAQQKLYIS YIGRSIQDNS ERFPSVLVQE LIDYIGQSHY LPGDEALNCD ESEARVKAHL
TCLHTRMPFD PQNYQPGERQ SYAREWLPAA SQAGKAHSEF VQPLPFTLPE TVPLETLQRF
WAHPVRAFFQ MRLQVNFRTE DSEIPDTEPF ILEGLSRYQI NQQLLNALVE QDDAERLFRR
FRAAGDLPYG AFGEIFWETQ CQEMQQLADR VIACRQPGQS MEIDLACNGV QITGWLPQVQ
PDGLLRWRPS LLSVAQGMQL WLEHLVYCAS GGNGESRLFL RKDGEWRFPP LAAEQALHYL
SQLIEGYREG MSAPLLVLPE SGGAWLKTCY DAQNDAMLDD DSTLQKARTK FLQAYEGNMM
VSGEGDDIWY QRLWRQLTPE TMEAIVEQSQ RFLLPLFRFN QS