Gene ECH74115_3227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3227 
Symbol 
ID6971042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2968757 
End bp2970283 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content35% 
IMG OID643387044 
Productrecombinase family protein 
Protein accessionYP_002271508 
Protein GI209400302 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.171762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00185611 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAG CCATAGCATA TATGCGATTT TCATCACCAG GTCAGATGTC TGGCGACTCA 
TTAAACCGAC AGAGAAGACT TATTGCTGAA TGGTTAAAGG TAAATAGTGA TTATTATCTT
GATACCATAA CATATGAAGA TTTAGGATTA AGTGCATTCA AAGGAAAGCA TGCACAATCA
GGAGCTTTTT CGGAATTTTT AGATGCTATA GAGCATGGTT ATATATTGCC AGGAACTACA
TTGTTAGTTG AAAGTCTGGA CAGACTTTCA AGAGAAAAAG TCGGTGAAGC GATTGAACGT
CTGAAATTGA TTTTGAATCA CGGTATTGAT GTTATAACTC TTTGCGACAA TACAGTCTAT
AATATTGACT CTTTGAATGA GCCATATTCA TTAATAAAAG CCATACTTAT AGCACAAAGG
GCAAATGAAG AAAGCGAGAT AAAGTCAAGT CGGGTTAAAT TATCATGGAA GAAAAAACGG
CAGGATGCAC TGGAATCAGG TACGATTATG ACGGCGTCTT GTCCGAGATG GCTCTCCTTA
GATGACAAAA GAACGGCTTT TGTTCCAGAC CCCGACAGGG TGAAAACTAT TGAGCTAATT
TTTAAACTCA GGATGGAAAG GCGCTCATTG AATGCAATAG CCAAGTATTT AAATGATCAT
GCTGTAAAGA ATTTCTCAGG AAAAGAAAGT GCATGGGGAC CTTCTGTAAT TGAAAAATTA
TTAGCGAATA AAGCTCTGAT AGGTATATGC GTACCTTCAT ATCGTGCAAG AGGGAAAGGG
ATAAGTGAAA TCGCTGGCTA TTATCCCAGA GTCATATCAG ATGATTTGTT TTACGCTGTA
CAGGAAATTC GGTTGGCACC TTTTGGTATT AGCAATAGTA GCAAGAATCC TATGCTAATA
AATCTACTTC GAACAGTTAT GAAGTGTGAG GCTTGTGGTA ATACCATGAT TGTTCATGCG
GTATCTGGAA GTTTGCATGG CTATTATGTT TGTCCGATGA GAAGATTACA TCGATGTGAC
AGGCCATCAA TAAAAAGAGA TTTGGTTGAT TATAATATCA TTAATGAATT GCTTTTTAAT
TGTAGCAAAA TTCAACCAGT TGAAAACAAG AAAGATGCTA ATGAAACTTT AGAGTTAAAA
ATTATTGAGC TTCAGATGAA AATTAATAAT TTAATCGTTG CATTGTCTGT CGCGCCTGAA
GTTACCGCTA TAGCAGAGAA AATAAGACTA TTAGATAAGG AATTACGAAG GGCTTCGGTA
TCATTGAAAA CTTTGAAGAG TAAAGGTGTA AATTCATTCA GTGATTTTTA TGCTATTGAC
TTAACCAGTA AAAATGGACG AGAGTTATGC CGTACACTTG CCTATAAAAC ATTCGAAAAA
ATCATAATTA ATACGGATAA TAAAACCTGT GATATCTATT TTATGAATGG CATTGTTTTT
AAACACTATC CTTTAATGAA AGTAATATCT GCCCAGCAGG CGATAAGTGC TCTCAAATAT
ATGGTTGATG GTGAGATTTA TTTCTAA
 
Protein sequence
MKKAIAYMRF SSPGQMSGDS LNRQRRLIAE WLKVNSDYYL DTITYEDLGL SAFKGKHAQS 
GAFSEFLDAI EHGYILPGTT LLVESLDRLS REKVGEAIER LKLILNHGID VITLCDNTVY
NIDSLNEPYS LIKAILIAQR ANEESEIKSS RVKLSWKKKR QDALESGTIM TASCPRWLSL
DDKRTAFVPD PDRVKTIELI FKLRMERRSL NAIAKYLNDH AVKNFSGKES AWGPSVIEKL
LANKALIGIC VPSYRARGKG ISEIAGYYPR VISDDLFYAV QEIRLAPFGI SNSSKNPMLI
NLLRTVMKCE ACGNTMIVHA VSGSLHGYYV CPMRRLHRCD RPSIKRDLVD YNIINELLFN
CSKIQPVENK KDANETLELK IIELQMKINN LIVALSVAPE VTAIAEKIRL LDKELRRASV
SLKTLKSKGV NSFSDFYAID LTSKNGRELC RTLAYKTFEK IIINTDNKTC DIYFMNGIVF
KHYPLMKVIS AQQAISALKY MVDGEIYF