Gene ECH74115_0877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0877 
Symbol 
ID6968905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp894557 
End bp895627 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content45% 
IMG OID643384902 
Productphage integrase family protein 
Protein accessionYP_002269402 
Protein GI209397327 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGAA GGCGAAGTCA TGAGCGCCGG GATTTACCCC CTAACCTTTA TATAAGAAAC 
AATGGATATT ACTGCTACAG GGACCCAAGG ACGGGTAAAG AGTTTGGTTT AGGCCGAGAC
AGGAGGATAG CAATCACTGA AGCAATACAG GCCAATATTG AGTTATTTTC AGGACACAAA
CACAAGCCTC TGACAGCGAG AATCAACAGT GATAATTCTG TTACGTTACA TTCATGGCTT
GATCGCTACG AAAAAATCCT CGCCAGCAGA GGAATCAAGC AGAAGACACT CATAAATTAC
ATGAGCAAAA TTAAAGCAAT AAGGAGGGGG CTGCCTGATG CTCCACTTGA AGACATCACC
ACAAAAGAAA TTGCGGCAAT GCTCAATGGA TACATAGACG AGGGCAAGGC GGCGTCAGCC
AAGTTAATCA GATCAACACT GAGCGATGCA TTCCGAGAGG CAATAGCTGA AGGCCATATA
ACAACAAACC CGGTCGCTGC CACTCGCGCA GCAAAATCAG AGGTAAGGAG ATCAAGACTT
ACGGCTGACG AATACCTGAA AATTTATCAA GCAGCAGAAT CATCACCATG TTGGCTCAGA
CTTGCAATGG AACTGGCTGT TGTTACCGGG CAGCGAGTTG GTGATTTATG CGAAATGAAG
TGGTCTGATA TCGTAGATGG ATATCTTTAT GTCGAACAAA GCAAAACAGG CGTAAAAATT
GCCATCCCTA CAACATTGCA TGTTGATGCT CTCGGGATAT CAATGAAGGA AACACTTGAT
AAATGCAAAG AGATTCTTGG CGGAGAAACC ATAATTGCAT CTACTCGTCG TGAACCGCTT
TCATCCGGCA CAGTATCAAG GTATTTTATG CGCGCACGAA AAGCATCAGG TCTTTCCTTC
GAAGGGGATC CGCCTACCTT TCACGAGTTG CGCAGTTTGT CTGCAAGACT CTATGAGAAG
CAGATAAGCG ATAAGTTTGC TCAACATCTT CTCGGGCATA AGTCGGACAC CATGGCATCA
CAGTATCGTG ATGACAGAGG CAGGGAGTGG GACAAAATTG AAATCAAATA A
 
Protein sequence
MGRRRSHERR DLPPNLYIRN NGYYCYRDPR TGKEFGLGRD RRIAITEAIQ ANIELFSGHK 
HKPLTARINS DNSVTLHSWL DRYEKILASR GIKQKTLINY MSKIKAIRRG LPDAPLEDIT
TKEIAAMLNG YIDEGKAASA KLIRSTLSDA FREAIAEGHI TTNPVAATRA AKSEVRRSRL
TADEYLKIYQ AAESSPCWLR LAMELAVVTG QRVGDLCEMK WSDIVDGYLY VEQSKTGVKI
AIPTTLHVDA LGISMKETLD KCKEILGGET IIASTRREPL SSGTVSRYFM RARKASGLSF
EGDPPTFHEL RSLSARLYEK QISDKFAQHL LGHKSDTMAS QYRDDRGREW DKIEIK