Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4307 |
Symbol | |
ID | 6147078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4408933 |
End bp | 4409913 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641619128 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001746252 |
Protein GI | 170680409 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0756066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00146275 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAATTA AGAAGCTCGA TGATGGACGC TATGAAGTGG ACATTAGACC TCGCGGTCGC GACGGAAAAC GCATCCGCAG GAAATTTGAA AGAAAAGCTG AGGCTGTAGC ATTTGAGCGA TACACAATCG CCTACGCCAG CCAGAAAGAA TGGGCAGGTC AGCGAGCAGA TCGCAGAACT TTGAGTGAGT TGCTGAACAT CTGGTGGAAA TATCACGGGC AAAACCACGA GCATGGAACA AAAGAGTTTA ATCATCTGCT CAAAACCATC AGCGGCATAG GTGATATACC AGTGAGCCGG ATGAGCAAAA GAGCTTTGAT GGATTATCGT TCCATGCGAC TACGTGATGG TATCAGTGCC GCAACGATAA ACCGTGACAT GTACCGATTA TCCGGCATGT TCACAAAATT AATTCAATTG GATGAATTTT CCGGGCAACA CCCAATTCAC GGACTGCCGC CACTGGCGGA GGCCAACCCT GAAATGACGT TCCTGGAAAA AGCAGAAATC GAAAAACTGT TAAATGTTTT GGATGGTGAT GACTTACTTG TCGCACTTTT ATGTCTGAGC ACTGGAGGAA GATGGACGGA AGTTGCCACG CTAAAACCAG CACAGATTAC AAATTGCAGG GTTACCTTCC TGAAAACCAA AAACGGTAAA AAGCGAACCG TGCCGATTTC TGAGGAACTG GAGAAAAAAG TTAAAGAGGA GGCCAGCGCT AAATTATTCA AAGTTGATTA TGAGAAGTTT TGCGGGATTT TACGCAGAGT GAAGCCAGAT ATACCACCCA ATCAGGCAAC CCACATCCTG CGGCATACAT TCGCAAGCCA TTTCATGATG AATGGGGGCA ATATAATCGC ACTGCAACAG ATTCTGGGAC ATGCGAGCAT TCAGCAGACG ATGGCCTATG CGCACCTTGC GCCTGACTAC CTGCAAAATG CCGTCGCGCT GAATCCTCTA AAAGGCGGAG TGACGTTATA A
|
Protein sequence | MSIKKLDDGR YEVDIRPRGR DGKRIRRKFE RKAEAVAFER YTIAYASQKE WAGQRADRRT LSELLNIWWK YHGQNHEHGT KEFNHLLKTI SGIGDIPVSR MSKRALMDYR SMRLRDGISA ATINRDMYRL SGMFTKLIQL DEFSGQHPIH GLPPLAEANP EMTFLEKAEI EKLLNVLDGD DLLVALLCLS TGGRWTEVAT LKPAQITNCR VTFLKTKNGK KRTVPISEEL EKKVKEEASA KLFKVDYEKF CGILRRVKPD IPPNQATHIL RHTFASHFMM NGGNIIALQQ ILGHASIQQT MAYAHLAPDY LQNAVALNPL KGGVTL
|
| |