Gene EcSMS35_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2969 
SymbolrecC 
ID6146657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3047691 
End bp3051059 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content54% 
IMG OID641617838 
Productexonuclease V subunit gamma 
Protein accessionYP_001744990 
Protein GI170683034 
COG category[L] Replication, recombination and repair 
COG ID[COG1330] Exonuclease V gamma subunit 
TIGRFAM ID[TIGR01450] exodeoxyribonuclease V, gamma subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.481674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0228423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGGG TCTACCATTC CAATCGTCTG GACGTGCTGG AAGCGTTGAT GGAGTTTATT 
GTCGAACGCG AACGGCTGGA CGATCCTTTC GAACCAGAGA TGATTCTGGT GCAAAGTACC
GGTATGGCGC AGTGGCTGCA AATGACCCTG TCGCAAAAGT TTGGTATTGC GGCAAACATT
GATTTTCCGC TGCCAGCGAG CTTTATCTGG GATATGTTCG TCCGGGTGTT ACCGGAAATC
CCCAAAGAGA GCGCCTTTAA CAAACAGAGC ATGAGCTGGA AACTGATGAC TCTGCTGCCG
CAACTGTTGG AGCGCGAAGA CTTTACCCTG TTGCGGCATT ATCTGACTGA CGATAGTGAC
AAGCGAAAAC TGTTCCAGCT TTCTTCAAAA GCGGCGGACC TGTTTGACCA GTATCTGGTC
TATCGTCTGG ACTGGCTGGC ACAGTGGGAA ACAGGACATC TGGTAGAAGG GTTGGGAGAA
GCACAGGCCT GGCAAGCGCC GTTGTGGAAG GCGTTGGTGG AATATACCGA CGAACTCGGG
CAACCGCGCT GGCACCGCGC CAATCTCTAT CAGCGCTTTA TCGAAACGCT GGAGTCCGCG
ACGACCTGCC CGCCGGGGCT ACCTTCGCGC GTCTTTATAT GCGGTATTTC CGCGTTACCG
CCTGTTTATC TCCAGGCGCT ACAGGCGCTG GGTAAACATA TTGAAATCCA TCTCCTGTTT
ACCAACCCCT GCCGTTATTA CTGGGGCGAC ATTAAAGATC CTGCTTATCT GGCGAAACTG
CTGACTCGCC AGCGCCGACA CAGTTTTGAA GATCGCGAAT TACCGTTATT TCGCGACAGC
GAAAATGCCG GACAGCTCTT TAACAGCGAT GGTGAACAGG ATGTCGGTAA CCCGCTGCTG
GCCTCATGGG GCAAGCTTGG GCGCGACTAC ATTTATCTTC TTTCTGACCT GGAGGGCAGC
CAGGAGCTGG ACGCTTTTGT CGATGTGACG CCAGACAACC TGCTGCATAA CATTCAGTCT
GACATTCTGG AACTGGAAAA CCGCGCCGTT GCGGGTGTGA ACATCGAAGA GTTTTCCCGT
AGCGATAACA AACGCCCGCT TGATCCACTG GATAGCAGTA TCACCTTCCA CGTTTGCCAT
AGCCCGCAGC GTGAAGTTGA AGTTTTACAC GATCGCTTGC TGGCGATGCT GGAGGAAGAC
CCGACACTTA CTCCGCGCGA CATCATCGTG ATGGTGGCCG ACATCGACAG CTACAGTCCG
TTTATTCAGG CAGTGTTTGG TAGCGCACCT GCGGATCGTT ACCTGCCTTA CGCCATTTCC
GACCGTCGTG CGCGGCAGTC ACATCCGGTA CTGGAAGCGT TTATCAGCCT GTTATCACTG
CCTGACAGTC GCTTTGTGTC AGAGGATGTG CTGGCGCTGC TGGATGTGCC GGTGCTGGCG
GCGCGGTTTG ACATCACCGA AGAAGGGCTG CGTTATTTAC GTCAGTGGGT CAACGAATCT
GGCATTCGTT GGGGGATAGA TGACGACAAC GTTCGCGAGC TGGAACTCCC CGCCACCGGA
CAACACACCT GGCGATTTGG CCTGACGCGT ATGTTACTAG GCTACGCGAT GGAGAGCGCG
CAGGGCGAGT GGCAATCGGT TCTACCTTAT GATGAATCGA GCGGCTTAAT TGCAGAACTG
GTGGGGCATC TGGCTTCACT GCTAATGCAG CTAAACATCT GGCGTCGCGG GCTGGCGCAG
GAGCGTCCGC TGGAAGAGTG GTTGCCGGTT TGTCGCGATA TGCTCAACGC CTTCTTCCTG
CCGGATGCGG AAACCGAAGC GGCGATGACG CTGATCGAAC AACAATGGCA GGCGATTATC
GCCGAAGGTT TAGGTGCGCA GTATGGCGAC GCGGTGCCGC TGTCACTATT GCGTGATGAA
CTGGCACAGC GCCTGGATCA GGAACGTATC AGCCAGCGTT TTCTCGCCGG ACCGGTTAAC
ATTTGTACTC TGATGCCAAT GCGTTCCATC CCATTCAAAG TGGTTTGCCT GCTGGGAATG
AACGACGGCG TTTATCCACG CCAGCTTGCG CCATTGGGCT TTGACCTGAT GAGCCAGAAA
CCGAAGCGTG GCGACCGTAG CCGTCGCGAT GACGACCGCT ATCTGTTCCT GGAAGCGTTA
ATTTCCGCGC AGCAAAAACT CTATATCAGC TATATCGGGC GTTCTATTCA GGATAACAGT
GAACGTTTCC CGTCGGTACT GGTGCAGGAA CTGATCGACT ACATCGGGCA AAGCCATTAT
CTACCGGGCG ATGAAGCGCT CAACTGTGAT GAAAGCGAGG CAAGGGTAAA AGCACATCTT
ACTTGCCTCC ATACCCGAAT GCCGTTTGAT CCACAAAACT ACCAGCCAGG CGAACGACAA
AGCTATGCTC GTGAATGGCT ACCCGCGGCC AGCCAGGCTG GTAAAGCACA TTCTGAATTT
GTTCAGCAGC TGCCGTTTAC CTTACCGGAA ACCGTGCCGC TGGAAACGCT ACAACGATTC
TGGGCACATC CGGTGCGGGC ATTTTTCCAG ATGCGTTTGC AGGTGAACTT CCGTACCGAA
GACAGCGAAA TCCCCGACAC CGAGCCATTT ATTCTGGAAG GACTTAGCCG TTATCAAATC
AATCAGCAGT TATTGAATGC ACTGGTTGAG CAGGATGATG CCGAACGCTT GTTCCGCCGC
TTCCGGGCGG CAGGGGATTT ACCGTATGGC GCTTTTGGTG AAATTTTCTG GGAAACACAG
TGCCAGGAGA TGCAGCAGCT TGCCGACAGA GTCATTGCCT GTCGCCAGCC GGGGCAGAGT
ATGGAAATTG ACCTCGCCTG CAACGGTGTG CAGATAACTG GCTGGTTGCC GCAGGTGCAG
TCGGATGGCC TGTTGCGCTG GCGTCCCTCT TTATTAAGTG TGGCGCAGGG AATGCAACTT
TGGCTGGAAC ACCTTGTCTA CTGTGCCAGC GGTGGTAATG GTGAAAGTCG CCTTTTTCTA
CGCAAAGACG GCGAGTGGCG TTTTCCGCCG CTTGCAGCCG AACAGGCTTT ACATTACCTC
TCACAGCTGA TTGAAGGGTA TCGTGAAGGA ATGTCCGCGC CATTGCTGGT GTTACCTGAA
AGTGGCGGCG CGTGGCTAAA AACCTGTTAT GACGCGCAAA ACGATGCCAT GCTGGATGAC
GATTCCACGT TGCAAAAAGC CCGAACGAAA TTCCTTCAGG CTTACGAAGG CAACATGATG
GTGCGTGGCG AAGGTGATGA TATCTGGTAT CAACGGCTCT GGCGGCAATT AACACCAGAG
ACAATGGAGG CCATCGTTGA ACAGTCGCAA CGTTTCCTGT TACCGCTGTT TCGCTTTAAT
CAGTCATGA
 
Protein sequence
MLRVYHSNRL DVLEALMEFI VERERLDDPF EPEMILVQST GMAQWLQMTL SQKFGIAANI 
DFPLPASFIW DMFVRVLPEI PKESAFNKQS MSWKLMTLLP QLLEREDFTL LRHYLTDDSD
KRKLFQLSSK AADLFDQYLV YRLDWLAQWE TGHLVEGLGE AQAWQAPLWK ALVEYTDELG
QPRWHRANLY QRFIETLESA TTCPPGLPSR VFICGISALP PVYLQALQAL GKHIEIHLLF
TNPCRYYWGD IKDPAYLAKL LTRQRRHSFE DRELPLFRDS ENAGQLFNSD GEQDVGNPLL
ASWGKLGRDY IYLLSDLEGS QELDAFVDVT PDNLLHNIQS DILELENRAV AGVNIEEFSR
SDNKRPLDPL DSSITFHVCH SPQREVEVLH DRLLAMLEED PTLTPRDIIV MVADIDSYSP
FIQAVFGSAP ADRYLPYAIS DRRARQSHPV LEAFISLLSL PDSRFVSEDV LALLDVPVLA
ARFDITEEGL RYLRQWVNES GIRWGIDDDN VRELELPATG QHTWRFGLTR MLLGYAMESA
QGEWQSVLPY DESSGLIAEL VGHLASLLMQ LNIWRRGLAQ ERPLEEWLPV CRDMLNAFFL
PDAETEAAMT LIEQQWQAII AEGLGAQYGD AVPLSLLRDE LAQRLDQERI SQRFLAGPVN
ICTLMPMRSI PFKVVCLLGM NDGVYPRQLA PLGFDLMSQK PKRGDRSRRD DDRYLFLEAL
ISAQQKLYIS YIGRSIQDNS ERFPSVLVQE LIDYIGQSHY LPGDEALNCD ESEARVKAHL
TCLHTRMPFD PQNYQPGERQ SYAREWLPAA SQAGKAHSEF VQQLPFTLPE TVPLETLQRF
WAHPVRAFFQ MRLQVNFRTE DSEIPDTEPF ILEGLSRYQI NQQLLNALVE QDDAERLFRR
FRAAGDLPYG AFGEIFWETQ CQEMQQLADR VIACRQPGQS MEIDLACNGV QITGWLPQVQ
SDGLLRWRPS LLSVAQGMQL WLEHLVYCAS GGNGESRLFL RKDGEWRFPP LAAEQALHYL
SQLIEGYREG MSAPLLVLPE SGGAWLKTCY DAQNDAMLDD DSTLQKARTK FLQAYEGNMM
VRGEGDDIWY QRLWRQLTPE TMEAIVEQSQ RFLLPLFRFN QS