Gene ECH74115_B0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0079 
Symbol 
ID6966374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp49983 
End bp51281 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content62% 
IMG OID643383977 
Productprotein TraI 
Protein accessionYP_002268456 
Protein GI209395617 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.342651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGAAG GCGTCACACT GTACCCCCCG GACACCATCA GGGTGGGGAC CGGTGACCGG 
ATGCGCTTCA CGAAGAGTGA CCGGGGGCGT GGTTATGTGG CCAACAGCGT CTGGACGGTG
ACAGCGGTTT CCGGTGACAG TGTCACGCTG TCGGGCGGAC AGCAGACCCG GGTGATTCGC
CCCGCCCAGG AGCGGGCAGA GCAACATATT GACCTGGCCT ATGCCATCAC CGCTCACGGT
GCGCAGGGGG CAAGTGAAAC CTTTGCCATC GCGCTTGAAG GTACGGAAGG CAGCCGGAAA
CTGATGGCCG GCTTTGAGTC AGCCTACGTG GCCCTGTCGC GTATGAAGCA GCATGTGCAG
GTGTACACCG ATAACCGTCA GGGCTGGACG GATGCCATTA ACAATGCCGT ACAGAAAGGA
ACAGCCCACG ATGTGTTTGA ACCGAAACCG GACCGGGAGG TCATGAATGC AGAGCGGCTG
TTCAGTACGG CGCGGGAGCT GCGGGACGTG GCGGCAGGGC GTGCCGTTCT TCGTCAGGCG
GGGCTTGCCG GGGGAGACAG TCCTGCACGG TTTATTGCTC CGGGACGTAA ATATCCACAG
CCGTATGTGG CACTGCCGGC GTTTGACCGT AACGGCAAGT CTGCCGGTAT CTGGCTGAAC
CCGCTGACCA CGGATGACGG AAACGGGCTG CGGGGATTCA GTGGTGAAGG ACGGGTGAAA
GGCAGCGGGG ATGCGCAGTT CGTGGCCCTG CAGGGCAGCC GTAACGGAGA GAGCCTGCTG
GCTGATAATA TGCAGGATGG TGTCCGGATT GCCCGTGATA ATCCTGACAG CGGTGTGGTG
GTGAGAATCG CCGGTGAAGG TCGTCCGTGG AATCCCGGTG CCATCACCGG TGGTCGCGTG
TGGGGGGATA TCCCGGACAA CAGCGTCCAG CCGGGAGCCG GAAATGGCGA GCCGGTCACG
GCAGAGGTAC TGGCACAGCG GCAGGCTGAA GAGGCCATCC GCCGTGAAAC GGAACGCCGC
GCAGATGAAA TTGTCCGTAA AATGGCAGAG AACAAACCTG ACCTGCCGGA CGGCAGAACA
GAGCAGGCTG TCAGGGAGAT TGCCGGGCAG GAGCGTGAAC GGGCTGTCAC TTCTGAACGG
GAAGCCGCGC TGCCGGAGAG TGTACTGCGT GAACCACAAC GGGAGCGGGA GGCGGTCCGT
GAGGTTGTCC GGGAAAACCT GCTGCAGGAG CGACTGCAGC AGATGGAGCG GGATATGGTT
CGTGACCTGC AGAAAGAGAA AACCCTGGGC GGAGACTGA
 
Protein sequence
MAEGVTLYPP DTIRVGTGDR MRFTKSDRGR GYVANSVWTV TAVSGDSVTL SGGQQTRVIR 
PAQERAEQHI DLAYAITAHG AQGASETFAI ALEGTEGSRK LMAGFESAYV ALSRMKQHVQ
VYTDNRQGWT DAINNAVQKG TAHDVFEPKP DREVMNAERL FSTARELRDV AAGRAVLRQA
GLAGGDSPAR FIAPGRKYPQ PYVALPAFDR NGKSAGIWLN PLTTDDGNGL RGFSGEGRVK
GSGDAQFVAL QGSRNGESLL ADNMQDGVRI ARDNPDSGVV VRIAGEGRPW NPGAITGGRV
WGDIPDNSVQ PGAGNGEPVT AEVLAQRQAE EAIRRETERR ADEIVRKMAE NKPDLPDGRT
EQAVREIAGQ ERERAVTSER EAALPESVLR EPQREREAVR EVVRENLLQE RLQQMERDMV
RDLQKEKTLG GD