Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3227 |
Symbol | |
ID | 6971042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2968757 |
End bp | 2970283 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643387044 |
Product | recombinase family protein |
Protein accession | YP_002271508 |
Protein GI | 209400302 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.171762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00185611 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAG CCATAGCATA TATGCGATTT TCATCACCAG GTCAGATGTC TGGCGACTCA TTAAACCGAC AGAGAAGACT TATTGCTGAA TGGTTAAAGG TAAATAGTGA TTATTATCTT GATACCATAA CATATGAAGA TTTAGGATTA AGTGCATTCA AAGGAAAGCA TGCACAATCA GGAGCTTTTT CGGAATTTTT AGATGCTATA GAGCATGGTT ATATATTGCC AGGAACTACA TTGTTAGTTG AAAGTCTGGA CAGACTTTCA AGAGAAAAAG TCGGTGAAGC GATTGAACGT CTGAAATTGA TTTTGAATCA CGGTATTGAT GTTATAACTC TTTGCGACAA TACAGTCTAT AATATTGACT CTTTGAATGA GCCATATTCA TTAATAAAAG CCATACTTAT AGCACAAAGG GCAAATGAAG AAAGCGAGAT AAAGTCAAGT CGGGTTAAAT TATCATGGAA GAAAAAACGG CAGGATGCAC TGGAATCAGG TACGATTATG ACGGCGTCTT GTCCGAGATG GCTCTCCTTA GATGACAAAA GAACGGCTTT TGTTCCAGAC CCCGACAGGG TGAAAACTAT TGAGCTAATT TTTAAACTCA GGATGGAAAG GCGCTCATTG AATGCAATAG CCAAGTATTT AAATGATCAT GCTGTAAAGA ATTTCTCAGG AAAAGAAAGT GCATGGGGAC CTTCTGTAAT TGAAAAATTA TTAGCGAATA AAGCTCTGAT AGGTATATGC GTACCTTCAT ATCGTGCAAG AGGGAAAGGG ATAAGTGAAA TCGCTGGCTA TTATCCCAGA GTCATATCAG ATGATTTGTT TTACGCTGTA CAGGAAATTC GGTTGGCACC TTTTGGTATT AGCAATAGTA GCAAGAATCC TATGCTAATA AATCTACTTC GAACAGTTAT GAAGTGTGAG GCTTGTGGTA ATACCATGAT TGTTCATGCG GTATCTGGAA GTTTGCATGG CTATTATGTT TGTCCGATGA GAAGATTACA TCGATGTGAC AGGCCATCAA TAAAAAGAGA TTTGGTTGAT TATAATATCA TTAATGAATT GCTTTTTAAT TGTAGCAAAA TTCAACCAGT TGAAAACAAG AAAGATGCTA ATGAAACTTT AGAGTTAAAA ATTATTGAGC TTCAGATGAA AATTAATAAT TTAATCGTTG CATTGTCTGT CGCGCCTGAA GTTACCGCTA TAGCAGAGAA AATAAGACTA TTAGATAAGG AATTACGAAG GGCTTCGGTA TCATTGAAAA CTTTGAAGAG TAAAGGTGTA AATTCATTCA GTGATTTTTA TGCTATTGAC TTAACCAGTA AAAATGGACG AGAGTTATGC CGTACACTTG CCTATAAAAC ATTCGAAAAA ATCATAATTA ATACGGATAA TAAAACCTGT GATATCTATT TTATGAATGG CATTGTTTTT AAACACTATC CTTTAATGAA AGTAATATCT GCCCAGCAGG CGATAAGTGC TCTCAAATAT ATGGTTGATG GTGAGATTTA TTTCTAA
|
Protein sequence | MKKAIAYMRF SSPGQMSGDS LNRQRRLIAE WLKVNSDYYL DTITYEDLGL SAFKGKHAQS GAFSEFLDAI EHGYILPGTT LLVESLDRLS REKVGEAIER LKLILNHGID VITLCDNTVY NIDSLNEPYS LIKAILIAQR ANEESEIKSS RVKLSWKKKR QDALESGTIM TASCPRWLSL DDKRTAFVPD PDRVKTIELI FKLRMERRSL NAIAKYLNDH AVKNFSGKES AWGPSVIEKL LANKALIGIC VPSYRARGKG ISEIAGYYPR VISDDLFYAV QEIRLAPFGI SNSSKNPMLI NLLRTVMKCE ACGNTMIVHA VSGSLHGYYV CPMRRLHRCD RPSIKRDLVD YNIINELLFN CSKIQPVENK KDANETLELK IIELQMKINN LIVALSVAPE VTAIAEKIRL LDKELRRASV SLKTLKSKGV NSFSDFYAID LTSKNGRELC RTLAYKTFEK IIINTDNKTC DIYFMNGIVF KHYPLMKVIS AQQAISALKY MVDGEIYF
|
| |