Gene ECH74115_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4084 
SymbolrecD 
ID6970517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3777432 
End bp3779258 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content55% 
IMG OID643387842 
Productexonuclease V subunit alpha 
Protein accessionYP_002272282 
Protein GI209397729 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR01447] exodeoxyribonuclease V, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGC AAAAGCAATT ACTGGAAGCT GTGGAGCACA AACAGCTACG CCCGCTGGAC 
GTGCAGTTTG CCCTGACCGT GGCGGGAGAT GAACATCCTG CCGTCACCCT CGCGGCGGCA
CTGTTAAGTC ATGATGCCGG AGAGGGGCAC GTTTGTTTGC CGCTTTCACG ACTGGAAAAT
AATGAGGAGT CGCATCCGCT GTTGGCGACC TGTGTCAGTG AAATCGGTGA GCTACAAAAT
TGGGAAGAAT GCTTGCTGGC TTCACAAGCG GTCAGCCGGG GAGATGAACC CACGCCGATG
ATCCTCTGTG GCGATCGTCT TTATTTGAAT CGCATGTGGT GTAACGAGCG GACGGTAGCG
CGCTTCTTCA ATGAGGTGAA TCATGCCATC GAGGTGGATG AAGCCTTGTT GGCGCAAACC
CTGGATAAAC TCTTTCCGAC TGGTGATGAA ATTAACTGGC AAAAAGTGGC GGCAGCAGTG
GCGCTGACGC GGCGGATCTC GGTGATTTCC GGCGGCCCTG GCACCGGTAA AACGACCACC
GTAGCGAAGT TGCTGGCAGC GTTAATTCAA ATGGCCGACG GCGAACGCTG CCGTATCCGT
CTGGCTGCAC CAACGGGTAA AGCTGCCGCG CGCTTAACCG AATCTCTCGG CAAGGCTTTG
CGACAGTTAC CGCTGACCGA TGAACAAAAG AAACGTATTC CGGAAGATGC CAGTACTTTG
CACCGATTGC TGGGTGCGCA GCCGGGTAGC CAGCGTTTAC GTCATCATGC CGGTAACCCG
CTGCATCTTG ATGTGCTGGT GGTAGATGAA GCGTCAATGA TCGATCTGCC TATGATGTCG
AGACTGATCG ACGCCTTGCC CGATCATGCG CGAGTGATCT TTCTCGGCGA TCGTGATCAA
CTGGCCTCGG TTGAGGCTGG GGCTGTGCTG GGCGATATCT GCGCTTATGC CAATGCCGGC
TTTACCGCCG AGCGTGCCGG GCAGCTAAGC CGCCTGACGG GAACTCACGT TCCGGCAGGA
ACTGGCACAG AAGCGGCATC TTTGCGCGAC AGTCTCTGCC TGCTGCAAAA AAGCTATCGT
TTCGGCAGCG ATTCTGGCAT TGGTCAGTTA GCTGCGGCGA TCAACCGTGG TGATAAAACG
GCAGTGAAAA CCGTTTTTCA GCAGGATTTT ACTGATATTG AAAAACGGCT TTTACAGAGC
GGCGAAGATT ATATTGCGAT GCTTGAGGAA GCTCTTGCGG GTTATGGACG TTATCTGGAT
CTGCTGCAAG CGCGTGCCGA GCCGGATTTA ATCATTCAGG CGTTCAATGA GTACCAACTT
TTGTGCGCCC TGCGGGAAGG GCCATTTGGC GTGGCTGGAC TGAATGAGCG AATTGAGCAG
TTTATGCAAC AGAAGCGCAA AATTCATCGT AATCCGCACT CTCGTTGGTA CGAAGGCCGA
CCGGTGATGA TTGCCCGTAA TGACAGCGCG CTTGGGTTGT TTAATGGCGA TATCGGTATT
GCGCTGGATC GCGGGCAGGG GACGCGCGTC TGGTTTGCGA TGCCGGACGG CAATATTAAG
TCTGTGCAAC CGAGTCGCTT GCCAGAGCAC GAAACTACGT GGGCGATGAC GGTACATAAA
TCGCAGGGAT CGGAGTTCGA CCATGCGGCG TTGATTTTAC CGAGCCAACG CACGCCGGTA
GTAACGCGAG AGCTGGTTTA TACCGCGGTG ACCCGCGCGC GTCGCCGTCT GTCGCTGTAT
GCCGATGAGC GCATATTAAG TGCGGCAATC GCCACTCGTA CTGAGCGGCG CAGTGGTCTG
GCGGCGTTGT TTAGTTCACG GGAATAA
 
Protein sequence
MKLQKQLLEA VEHKQLRPLD VQFALTVAGD EHPAVTLAAA LLSHDAGEGH VCLPLSRLEN 
NEESHPLLAT CVSEIGELQN WEECLLASQA VSRGDEPTPM ILCGDRLYLN RMWCNERTVA
RFFNEVNHAI EVDEALLAQT LDKLFPTGDE INWQKVAAAV ALTRRISVIS GGPGTGKTTT
VAKLLAALIQ MADGERCRIR LAAPTGKAAA RLTESLGKAL RQLPLTDEQK KRIPEDASTL
HRLLGAQPGS QRLRHHAGNP LHLDVLVVDE ASMIDLPMMS RLIDALPDHA RVIFLGDRDQ
LASVEAGAVL GDICAYANAG FTAERAGQLS RLTGTHVPAG TGTEAASLRD SLCLLQKSYR
FGSDSGIGQL AAAINRGDKT AVKTVFQQDF TDIEKRLLQS GEDYIAMLEE ALAGYGRYLD
LLQARAEPDL IIQAFNEYQL LCALREGPFG VAGLNERIEQ FMQQKRKIHR NPHSRWYEGR
PVMIARNDSA LGLFNGDIGI ALDRGQGTRV WFAMPDGNIK SVQPSRLPEH ETTWAMTVHK
SQGSEFDHAA LILPSQRTPV VTRELVYTAV TRARRRLSLY ADERILSAAI ATRTERRSGL
AALFSSRE