Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4002 |
Symbol | |
ID | 6143699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4081597 |
End bp | 4082760 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641618827 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001745965 |
Protein GI | 170682095 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTCA CCGATATACA GATCAAACGT GCAAAACCAC AAGACAAGCC ATACACATTG AACGATGGAC AAGGTCTGTC ATTGCTTATC AATCCCGATG GCTCGAAAGG CTGGCGTTTC CGTTTCCGGT TTGCAGGGAA AGCGCGGTTA ATGTCATTTG GCAGCTACGA TTTAGTAAGC CTCGCAGAAG CACGTGAGAA GCGTGATATC GCCCGTAAGC AGGTTGCTAA TGGCATTGAC CCGGTAGAGG AACGCAAAGC TTTAAGACTC GCCCAAAAGC TATCAACAGA AAATTCTTTC GAAGCAATAT GTCGAGAATG GCATACCAAC AAAGCTGACC GCTGGACTGT GGCCTATCGA GAAGAAATCA TTAAGACATT CGAGCAAGAT GTCTTCCCGT TCATTGGTAA ACGTCCTATC AGTGAAATCA AACCATTAGA ACTGCTTGAA GTATTACGAC GAATCGAAAA ACGTGGAGCA CTAGAGAAAA CACGCAAGGT GCGTCAAAGA TGCGGTGAGG TTTATCGCTA TGCAATCATA ACTGGCCGTG CTGAGTACAA TCCTGCACCT GATTTAGCTA TCGCTCTGGC CGTTCCCAAG CAAAAACACC ATCCATTTTT ATCCGCTGAA GAGTTGCCTC ATTTTATTCG AGATCTTGAA GCGTATACCG GTAGCATCAT CACCAAAAAT GCTACGAAGA TAGTCATGCT GACTGGTGTA AGAACGCAGG AGATGCGCTT TGCTACGTGG GAAGAAGTAG ACCTCGAAAA AGGTATATGG GAGATACCAG CGGAACGTAT GAAAATGCGT AGACCTCACA TTGTTCCTTT ATCTACTCAG GTAGTTGACC TTTTCAAACA GCTCAAACCT ATTACCGGCC ATTACCCTTA CATCTTTATT GGCAGGAACA ACCGCAGCAA GCCAATCTCA AAAGAAAGTG TTTCACAAGT GATTGAGTTA ATTGGCTACA AAGGCCGTGC TACAGGTCAC GGTTTTCGGC ATACCATGTC GACAATATTG CACGAACAAG GGTTTGATAG CGCATGGATT GAAATACAAT TGGCACATGT TGATAAAAAC AGAATCCGAG GGACTTACAA TCATGCTCAA TATCTTGAAC ATAGAAAAAA AATGATGCAA TGGTATTCAG ATAAATTATA TTGA
|
Protein sequence | MALTDIQIKR AKPQDKPYTL NDGQGLSLLI NPDGSKGWRF RFRFAGKARL MSFGSYDLVS LAEAREKRDI ARKQVANGID PVEERKALRL AQKLSTENSF EAICREWHTN KADRWTVAYR EEIIKTFEQD VFPFIGKRPI SEIKPLELLE VLRRIEKRGA LEKTRKVRQR CGEVYRYAII TGRAEYNPAP DLAIALAVPK QKHHPFLSAE ELPHFIRDLE AYTGSIITKN ATKIVMLTGV RTQEMRFATW EEVDLEKGIW EIPAERMKMR RPHIVPLSTQ VVDLFKQLKP ITGHYPYIFI GRNNRSKPIS KESVSQVIEL IGYKGRATGH GFRHTMSTIL HEQGFDSAWI EIQLAHVDKN RIRGTYNHAQ YLEHRKKMMQ WYSDKLY
|
| |