Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0299 |
Symbol | |
ID | 6143653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 306879 |
End bp | 308063 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641615196 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001742404 |
Protein GI | 170681995 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.000127187 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCTTA ATGCTCGACA GGTAGATGCT GCTAAACCCA GAGAGAAAGC CTACAAGCTA GCAGATGGTG CAGGCTTGTA TCTTGAAGTT GTTCCTTCTG GTTCTCGATA CTGGCGGATG AAATATCGCT TCAATGGAAA AGAGAAACGT ATGGCTTTTG GTGTCTATCC GGCAGTGTCC CTTGCACAAG CGAGGGCACT GCGTGATGAA GCCAAGAAAA AGCTGGCCGA AGGTATCGAC CCATCGTTTG CCAAGAAAGA AGAAAAGTTG GTTCGCGATG TGCAGCTCAA TAATACGTTT CAGGCTGTGG CACTTGAATG GCACGGAACG AAGGTGAGCC GGTGGTCAGA AGGTTATGCC TCCGACATTA TCGAAGCCTT CAATAAAGAT ATTTTCCCTT ATATTGGCCA ACTGCCGGTG AATGACATCA AGCCTTTGGT TCTGCTGAAT GTGCTACGTC GAATGGAAAG CCGTGGCGCG ACAGAGAAGG CCAAGAAGGT TCGCCAGCGT TGCAGTGAAG TCTTTCGTTA CGCCATCGTT ACCGGTCGTG CGGAATACAA TCCTGCAGCG GATCTAACCA GCGCAATGTC AGGGCATGAA TCGAAGCATT ATCCCTTCCT TACTGTTGAG GAGTTACCAG ACTTCTTTAA AGCTCTCGCA GGCTACACAG GAAGCCCGTT AGTTGTTCTT GCCGCTCGTC TGCTGATCCT TACAGGAGTT CGTACTGGCG AGCTACGAGG TGCTTTCTGG AGTGAGTTTG ATCTTGAAAA AGCAGTGTGG GAAATACCTG CAGAGCGTAT GAAGATGAAA CGGCCTCACC TTGTCCCCCT ATCTACCCAA GCGCTGGAAA TCGTACAACA ACTCAAGGTG ATATCTGGGC AATATCCACT GGTATTCCCA GGGCGAAATG ATCCCCGCAA GACGATGAGT GAAGCGAGTA TGAATCAGGT ATTCAAACGG ATTGGGTATA CGGGGAAGGT AACGGGGCAT GGTTTCCGTC ACACGATGAG TACGATTTTG CACGAGGAAG GGTTCAATAC GGCATGGATT GAAACCCAGC TTGCGCATGT CGATAAGAAT GCGATTCGTG GGACGTACAA CCATGCTTTG TATCTGGAAG GGCGGAGGGA GATGATGCAG TGGTATGCTG ATTGCATTGG AAGAATTGGT AATGATGTCA ATTGA
|
Protein sequence | MKLNARQVDA AKPREKAYKL ADGAGLYLEV VPSGSRYWRM KYRFNGKEKR MAFGVYPAVS LAQARALRDE AKKKLAEGID PSFAKKEEKL VRDVQLNNTF QAVALEWHGT KVSRWSEGYA SDIIEAFNKD IFPYIGQLPV NDIKPLVLLN VLRRMESRGA TEKAKKVRQR CSEVFRYAIV TGRAEYNPAA DLTSAMSGHE SKHYPFLTVE ELPDFFKALA GYTGSPLVVL AARLLILTGV RTGELRGAFW SEFDLEKAVW EIPAERMKMK RPHLVPLSTQ ALEIVQQLKV ISGQYPLVFP GRNDPRKTMS EASMNQVFKR IGYTGKVTGH GFRHTMSTIL HEEGFNTAWI ETQLAHVDKN AIRGTYNHAL YLEGRREMMQ WYADCIGRIG NDVN
|
| |