Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0642 |
Symbol | |
ID | 6143361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 656167 |
End bp | 657159 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615533 |
Product | IS5 transposase |
Protein accession | YP_001742739 |
Protein GI | 170681327 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3039] Transposase and inactivated derivatives, IS5 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0956986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCATC AACTCACCTT CGCCGATAGT GAATTCAGCA CTAAGCGCCG TCAGACCCGA AAAGAGATTT TCCTCTCCCG CATGGAGCAG ATTCTGCCAT GGCAGAATAT GACCGCTGTC ATCGAGCCGT TTTATCCCAA GGCGGGCAAT GGCCGACGGC CCTATCCGCT GGAGACCATG CTGCGTATTC ACTGCATGCA GCATTGGTAC AACCTGAGCG ACGGTGCCAT GGAAGATGCC CTGTACGAAA TCGCCTCCAT GCGCCTGTTT GCCCGATTAT CCCTGGATAG CGCCCTGCCG GATCGCACCA CCATCATGAA TTTCCGCCAC CTGCTCGAGC AGCATCAACT GGCCCGTCAA TTGTTCAAGA CCATCAATCG CTGGCTGGCC GAAGCAGGCG TCATGATGAC CCAAGGCACT TTGGTGGATG CCACCATCAT TGAGGCACCC AGCTCTACCA AGAACAAAGA GCAGCAACGC GATCCGGAGA TGCATCAGAC CAAGAAAGGC AATCAGTGGC ACTTTGGCAT GAAGGCCCAC ATTGGTGTCG ATGCCAAGAG TGGCCTGACC CACAGCCTAG TCACCACCGC GGCCAACGAG CATGACCTCA ATCAGCTGGG TAATCTGCTT CATGGTGAGG AGCAATTTGT CTCAGCCGAT GCCGGCTACC AAGGAGCGCC ACAGCGCGAG GAGCTGGCCG AGGTGGATGT GGACTGGCTG ATCGCCGAGC GTCCCGGCAA GGTAAAAACC TTGAAGCAGC ATCCGCGCAA GAACAAAACG GCCATCAACA TCGAATACAT GAAAGCCAGC ATCCGTGCCA GGGTGGAGCA CCCGTTTCGC ATCATCAAGC GGCAGTTCGG CTTCGTGAAA GCCAGATACA AGGGGCTGCT GAAAAACGAT AACCAACTGG CGATGTTATT CACCCTGGCC AACCTGTTTC GGGTGGACCA AATGATACGT CAGGGGAGAG ATCTCAGTAA AAACCGGAAA TAA
|
Protein sequence | MSHQLTFADS EFSTKRRQTR KEIFLSRMEQ ILPWQNMTAV IEPFYPKAGN GRRPYPLETM LRIHCMQHWY NLSDGAMEDA LYEIASMRLF ARLSLDSALP DRTTIMNFRH LLEQHQLARQ LFKTINRWLA EAGVMMTQGT LVDATIIEAP SSTKNKEQQR DPEMHQTKKG NQWHFGMKAH IGVDAKSGLT HSLVTTAANE HDLNQLGNLL HGEEQFVSAD AGYQGAPQRE ELAEVDVDWL IAERPGKVKT LKQHPRKNKT AINIEYMKAS IRARVEHPFR IIKRQFGFVK ARYKGLLKND NQLAMLFTLA NLFRVDQMIR QGRDLSKNRK
|
| |