Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2277 |
Symbol | |
ID | 6146630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2301700 |
End bp | 2302908 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617151 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001744324 |
Protein GI | 170680457 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0249375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0747113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTAT TGACGGATAC TAAAGCAAGG CATATCAAAC CTGATGACAA ACCATTGCCC CATGGGGGAA TTACAGGACT GACTCTTCAT CCTTCTTCAG TAAAGGGGAG GGGTAAATGG GTTTTTCGTT ATGTAAGTCC GGTGACACAA AAAAGGCGTA ATGCTGGATT GGGAACTTAT CCTGAGGTCA GTATTGCTGA AGCTGCACGT ACTGCCCGGA TAATGCGAGA GCAACTTGCT GCAGGTGATG ATCCTCTGGA GATTAAAAAG GCTGAAGCTG AAAAAGTTGT TATCCCAACA TTTGCCGATG CAGCCAGGCG TGTACATGCA GAACTGTCTC CTGGATGGGA AAATCCAAAG CATGTAAGGC AGTGGTTATC GACGCTTGAG AATTACGCGT TTCCTCAACT GGGAGCAAAA ACGCTGGATT CGATTACGGC TGCGGACGTG GCAGAAACAC TGCGTCCAGT CTGGTTAACC TTGTCAGAAA CGGCAAGCCG GGTTAAACAG CGCATTCATG TTGTTATGCA GTGGGGCTGG GCGCATGGTT TTTGTGTGGC GAATCCTGTT GATGTGGTTG ATCATTTGCT TCCACAGCAA TCAAGAGGAC GTGATGAACA CCAGCCGGCA ATGCCCTGGA GGCAGTTACC GCTTTTTGTG GCGACCAGTG TGTATACAGA TGAACCTTAT AATGTTACCC GGGCACTGTT ATTAATGGTG ATACTGACAG CAACCCGCTC GGGCGAAGCA AGGGGAATGC GCTGGGCTGA AATTGATTTT CATAAGCGGA TATGGACGAT ACCCGCAGAA AGAATGAAAG CCAGGATACA GCATCGTGTT CCTTTATCCC GACAGGCCAT TCACGTTCTG GAAAATATAC GTGGTCTGCA TGACGAACTG GTGTTTCCTT CTCCCAGAAA GCAGCAGATC CTTTCAGATA TGGTGTTGAC GAGTTTTCTG CGTAAAAAGA AGGCCATCAG TGATATACCC GGACGAGTGG CTACAGCACA TGGTTTTCGT TCAACATTCA GGGACTGGTG TAGCGAACAG GGATATTCGC GGGATTTGGC GGAAAGGGCG CTTGCCCATA CGCTGAAAAA TAAGGTTGAG GCGGCATATC ACCGGACTGA TCTGCTGGAT CAGCGTATAC CGATGATGCA GGCATGGGCG GATTATGTGA TGTCTCAGAT TATGGAAAAC CAGCGATGA
|
Protein sequence | MAVLTDTKAR HIKPDDKPLP HGGITGLTLH PSSVKGRGKW VFRYVSPVTQ KRRNAGLGTY PEVSIAEAAR TARIMREQLA AGDDPLEIKK AEAEKVVIPT FADAARRVHA ELSPGWENPK HVRQWLSTLE NYAFPQLGAK TLDSITAADV AETLRPVWLT LSETASRVKQ RIHVVMQWGW AHGFCVANPV DVVDHLLPQQ SRGRDEHQPA MPWRQLPLFV ATSVYTDEPY NVTRALLLMV ILTATRSGEA RGMRWAEIDF HKRIWTIPAE RMKARIQHRV PLSRQAIHVL ENIRGLHDEL VFPSPRKQQI LSDMVLTSFL RKKKAISDIP GRVATAHGFR STFRDWCSEQ GYSRDLAERA LAHTLKNKVE AAYHRTDLLD QRIPMMQAWA DYVMSQIMEN QR
|
| |