Gene ECH74115_2944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2944 
SymbolsbcB 
ID6971313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2720101 
End bp2721525 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content50% 
IMG OID643386786 
Productexonuclease I 
Protein accessionYP_002271254 
Protein GI209398513 
COG category[L] Replication, recombination and repair 
COG ID[COG2925] Exonuclease I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000000000391872 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGACA CAGATAAGCA ACCTACCTTC CTCTTTCACG ATTACGAAAC CTTTGGCACG 
CACCCCGCGT TAGATCGCCC TGCACAGTTC GCAGCCATTC GCACCGATGA CGAATTCAAT
GTCATCGGCG AACCCGAAGT CTTTTACTGC AAGCCCGCGG ATGACTATTT ACCCCAGCCT
GGAGCAGTAT TAATTACCGG TATTACCCCG CAGGAAGCTC GGGCGAAAGG AGAAAACGAA
GCCGCATTTG CCGCCCGTAT TCACTCGCTT TTTACCGTAC CGAAGACCTG TATTCTGGGC
TACAACAATG TGCGTTTCGA CGACGAAGTC ACACGCAACG TTTTTTATCG TAATTTCTAC
GATCCTTACG CCTGGAGCTG GCAGCATGAT AACTCGCGCT GGGATTTACT GGATGTTATG
CGTGCCTGCT ATGCCCTGCG CCCGGAAGGA ATAAACTGGC CTGAAAATGA TGACGGTCTA
CCGAGCTTTC GCCTTGAGCA TTTAACCAAA GCGAATGGTA TTGAACATAG CAACGCCCAC
GATGCGATGG CTGATGTGTA CGCCACTATT GCGATGGCGA AACTGGTAAA AACGCGTCAA
CCACGCCTGT TTGATTATCT CTTTACCCAT CGTAATAAAC ACAAACTGAT GGCGTTGATT
GATGTTCCGC AGATGAAACC CCTGGTTCAC GTTTCCGGAA TGTTTGGGGC ATGGCGCGGC
AATACCAGCT GGGTGGCACC GCTGGCGTGG CATCCTGAAA ATCGCAATGC CGTAATTATG
GTGGATTTGG CAGGAGATAT TTCGCCATTA CTGGAACTGG ATAGCGACAC ATTGCGCGAG
CGTTTATATA CCGCAAAAAC CGATCTTGGC GATAACGCCG CCGTTCCGGT TAAACTGGTA
CATATCAATA AATGTCCGGT GCTGGCCCAG GCGAATACGC TACGCCCGGA AGATGCCGAC
CGACTGGGAA TTAATCGTCA GCATTGCCTC GATAACCTGA AAATTCTGCG TGAAAATCCG
CAAGTGCGTG AAAAAGTGGT GGCGATATTC GCGGAAGCCG AACCGTTTAC GCCTTCAGAT
AACGTGGATG CACAGCTTTA TAATGGCTTT TTCAGTGACG CCGATCGTGC AGCAATGAAA
ATTGTGCTTG AAACCGAGCC GCGTAATTTA CCGGCACTGG ATATCACTTT TGTTGATAAA
CGGATTGAAA AGCTGTTGTT CAATTATCGG GCACGCAACT TCCCGGGGAC GCTGGATTAT
GCCGAGCAGC AACGCTGGCT GGAGCACCGC CGCCAGGTCT TCACGCCAGA GTTTTTACAA
GGCTATGCAG AAGAGATTCA GATGCTGGCG CAGCAGTATG CCGACGATAA AGAAAAAGTG
GCGCTGTTAA AAGCGCTTTG GCAGTACGCG GAAGAGATTG TCTAA
 
Protein sequence
MTDTDKQPTF LFHDYETFGT HPALDRPAQF AAIRTDDEFN VIGEPEVFYC KPADDYLPQP 
GAVLITGITP QEARAKGENE AAFAARIHSL FTVPKTCILG YNNVRFDDEV TRNVFYRNFY
DPYAWSWQHD NSRWDLLDVM RACYALRPEG INWPENDDGL PSFRLEHLTK ANGIEHSNAH
DAMADVYATI AMAKLVKTRQ PRLFDYLFTH RNKHKLMALI DVPQMKPLVH VSGMFGAWRG
NTSWVAPLAW HPENRNAVIM VDLAGDISPL LELDSDTLRE RLYTAKTDLG DNAAVPVKLV
HINKCPVLAQ ANTLRPEDAD RLGINRQHCL DNLKILRENP QVREKVVAIF AEAEPFTPSD
NVDAQLYNGF FSDADRAAMK IVLETEPRNL PALDITFVDK RIEKLLFNYR ARNFPGTLDY
AEQQRWLEHR RQVFTPEFLQ GYAEEIQMLA QQYADDKEKV ALLKALWQYA EEIV