Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3131 |
Symbol | |
ID | 6142962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3218016 |
End bp | 3219047 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617995 |
Product | IS630 transposase |
Protein accession | YP_001745145 |
Protein GI | 170681851 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3335] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATCA TAGCACCAAT TTCCCGTGAC GAACGACGCC TGATGCAGAA AGCCATCCAT AAAACACACG ATAAAAATTA TGCCCGCAGA CTGACTGCCA TGCTGATGCT GCACCGGGGC GACCGTGTCA GCGACGTTGC CAGAACGCTC TGCTGCGCCC GTTCCTCTGT TGGATGTTGG ATTAACTGGT TCACGCAGTC GGGTGTTGAG GGACTGAAAT CATTACCTGC CGGGCGAGCC CGTCTCTGGT CGTTTGAGCA TATCTGCACA CTGTTACGTG AGCTGGTAAA ACATTCTCCC GGCGACTTTG GCTACCAGCG TTCACGCTGG AGTACAGAAC TGCTGGCAAT AAAAATCAAT GAGATAACCG ATTGCCAGTT AAATGCCGGA ACCGTTCGCC GCTGGTTGCC GTCTGCGGGG ATTGTGTGGC GAAGGGCTGC GCCAACTCTG CGTATCCGTG ACCCGCATAA AGATGAAAAG ATGGCAGCAA TCCATAAAGC ACTGGACGAA TGCAGCACAG AGCATCCGGT CTTTTATGAA GATGAAGTGG ATATCCATCT TAATCCCAAA ATCGGTGCGG ACTGGCAACT GCGCGGACAG CAAAAACGGG TGGTCACGCC GGGACAGAAT GAAAAATATG ATCTGGCCGG AGCGCTGCAC AGCGGGACAG GTAAAGTCAG CTATGTGGGC GGCAACAGCA AAAGTTCGGC GCTGTTCATC AGCCTGCTGA AGCGGCTTAA AGCGACATAC CTTCGGGTGA AAACCATCAC ACTGATCGTG GACAACTACA TTATCCACAA AAGCCGGGAA ACACAGCGCT GGTTGAAGGA GAACCCGAAG TTCAGGGTCA TTTATCAGCC GGTTTACTTG CCATGGGTGA ATCATGTTGA ACGGCTATGG CAGGCACTTC ACGACACAAT AACGCGTAAT CATCAGTGCC GCTCAATGTG GCAACTGTTG AAAAAAATTC GCCATTTTAT GGAAACCATC AGCCCGTTCC CCGGAGGCAA ACATGGGCTG GCAAAAGTGT AG
|
Protein sequence | MPIIAPISRD ERRLMQKAIH KTHDKNYARR LTAMLMLHRG DRVSDVARTL CCARSSVGCW INWFTQSGVE GLKSLPAGRA RLWSFEHICT LLRELVKHSP GDFGYQRSRW STELLAIKIN EITDCQLNAG TVRRWLPSAG IVWRRAAPTL RIRDPHKDEK MAAIHKALDE CSTEHPVFYE DEVDIHLNPK IGADWQLRGQ QKRVVTPGQN EKYDLAGALH SGTGKVSYVG GNSKSSALFI SLLKRLKATY LRVKTITLIV DNYIIHKSRE TQRWLKENPK FRVIYQPVYL PWVNHVERLW QALHDTITRN HQCRSMWQLL KKIRHFMETI SPFPGGKHGL AKV
|
| |