Gene ECH74115_3734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3734 
SymbolxseA 
ID6968275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3454491 
End bp3455861 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content54% 
IMG OID643387526 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_002271979 
Protein GI209400584 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000326312 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACCTT CTCAATCCCC TGCAATTTTT ACCGTTAGTC GCCTGAATCA AACGGTTCGT 
CTGCTGCTTG AGCATGAGAT GGGACAGGTT TGGATCAGCG GCGAAATCTC TAATTTCACA
CAACCGGCTT CCGGTCACTG GTACTTTACA CTCAAAGACG ACACCGCCCA GGTACGCTGC
GCGATGTTCC GCAACAGCAA CCGCCGGGTG ACCTTCCGCC CACAGCATGG GCAACAAGTT
TTAGTTCGCG CCAATATTAC GCTCTACGAG CCGCGCGGCG ACTACCAGAT AATCGTTGAG
AGTATGCAGC CGGCCGGTGA AGGGCTGCTG CAACAGAAGT ACGAACAGCT CAAAGCGAAG
TTGCAGGCTG AAGGATTGTT CGATCTGCAA TACAAAAAAC CACTTCCCTC CCCTGCGCAT
TGCGTTGGTG TGATCACCTC AAAAACCGGT GCTGCGCTAC ATGATATTTT GCATGTGTTA
AAACGTCGCG ATCCTTCTCT GCCGGTGATC ATCTACCCCA CCGCCGTTCA GGGCGATGAC
GCGCCGGGGC AAATTGTTCG CGCCATTGAA CTGGCGAATC AGTGCAATGA GTGCGACGTG
TTGATCGTTG GGCGCGGCGG CGGTTCGCTG GAAGATTTAT GGAGTTTTAA CGACGAACGC
GTAGCGCGGG CGATTTTTGC CAGCCGCATT CCGGTCGTCA GCGCCGTCGG GCATGAGACG
GATGTGACCA TTGCCGATTT TGTTGCCGAT CTGCGTGCAC CAACACCGTC GGCTGCCGCC
GAAGTGGTGA GCCGTAATCA GCAAGAGTTA CTGCGCCAGG TGCAATCGAC CCGTCAACGG
CTGGAGATGG CGATGGATTA TTATCTCGCC AACCGCACAC GTCGCTTTAC GCAAATTCAT
CACCGATTAC AGCAACAGCA TCCGCAGCTC CGGCTGGCAC GCCAGCAAAC CATGCTTGAA
CGCCTGCAAA AACGGATGAG CTTTGCGCTG GAAAATCAAC TTAAGCGTGC CGGGCAACAG
CAGCAGCGAT TAACACAGCG GCTGAATCAG CAAAATCCAC AGCCGAAGAT TCATCGCGCG
CAAACGCGCA TTCAGCAACT GGAATATCGT TTAGCAGAAA CCCTGCGCGC ACAGCTTAGC
GCCACGCGTG AACGTTTCGG TAATGCAGTA ACGCACCTCG AAGCCGTAAG CCCACTGTCA
ACGCTGGCGC GTGGATACAG CGTTACTACT GCTACTGACG GCAAGGTACT GAAAAAAGTG
AAGCAGGTTA AAGCGGGTGA AATGCTAACC ACACGTCTGG AAGACGGCTG GGTAGAAAGT
GAAGTAAAAA ACATCCAGCC GGTAAAAAAA TCGCGTAAAA AAGTGCATTA A
 
Protein sequence
MLPSQSPAIF TVSRLNQTVR LLLEHEMGQV WISGEISNFT QPASGHWYFT LKDDTAQVRC 
AMFRNSNRRV TFRPQHGQQV LVRANITLYE PRGDYQIIVE SMQPAGEGLL QQKYEQLKAK
LQAEGLFDLQ YKKPLPSPAH CVGVITSKTG AALHDILHVL KRRDPSLPVI IYPTAVQGDD
APGQIVRAIE LANQCNECDV LIVGRGGGSL EDLWSFNDER VARAIFASRI PVVSAVGHET
DVTIADFVAD LRAPTPSAAA EVVSRNQQEL LRQVQSTRQR LEMAMDYYLA NRTRRFTQIH
HRLQQQHPQL RLARQQTMLE RLQKRMSFAL ENQLKRAGQQ QQRLTQRLNQ QNPQPKIHRA
QTRIQQLEYR LAETLRAQLS ATRERFGNAV THLEAVSPLS TLARGYSVTT ATDGKVLKKV
KQVKAGEMLT TRLEDGWVES EVKNIQPVKK SRKKVH