Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3236 |
Symbol | |
ID | 6145201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3308800 |
End bp | 3309999 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641618066 |
Product | IS200 transposase orfB |
Protein accession | YP_001745216 |
Protein GI | 170682393 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.48969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAATCC TGAAAGCCTA CAAATTCAGA CTGGAACCAA CGCATGAGCA GTCGCAGCGT TTGCGGCAGT TATGTGGTTG TGCCCGTTTT GTCTGGAATT TAGGTCTTGC GGAGACAAAG CGCATACTTG GCTCAGGCGA AAAGTTACCT TCGGCTTTCG AGTTGAATCG GATGATTACA GTGTGGAAAA AAATGCCGGA ATACATCTTC TTACAGGATG CTTATACCGA CAATCTGCAA CAAAAGCTGA AAGACCTGCA TACCGCATGG AAACGTTGTT TTGATAAAAA GCTCGCAGCT AAGGCTCCGG TATGGAAACG AAAAAATGAG GGCAGAGACT CAATCCGTTT TGTGAACTTT GAGAAATATT GCTGCCTTGA AAATCGCAGA GTGAAGCTAC CGTCAGGTCT TGGGTGGGTA AAATTCCGGC AATCTCAACG TGTGAACGGT AAAATCAAAA ATGCGACAAT CAGTCAGTTA GCGGGACAGT GGTATATCTC GTTTCAGGTT GAAATTGAAA CGGCAGAACC AAATCACACA AGCACAACGA TAGTCGGACT GGATGCAGGC GTGGCTAAAC TTGCCACGCT GTCAGATGGC ACAGTCTTTG AGCCTGTAAA CAGTTTTCAG AAAAATCAGA AGAAGCTGGC GAGACTCCAG CGACAGTTAA GCCGCAAGGT CAAATTCAGC AACAACTGGC AGAAGCAGAA ACGCAAAATA CAGCGACTGC ATTCCTGTAT CGCAAATATC CGCAGGGACT ACCTTCACAA AGTCACAACG ACCGTCAGCA AAAACCACGC AATGATAGTC ATTGAGGATT TGAAGGTCAG CAACATGTCA AAGTCGGCAG CGGGTACGGT CAGCCAGCCG GGGCGCAATG TCCGGGCAAA ATCAGGTTTA AACCGTTCGA TACTGGATCA GGGCTGGTAT GAAATGCGCC GCCAGCTTGA GTACAAACAG CTCTGGCGTG GTGGTCATGT AGAGGCGGTA AATCCGGCAT ACACAAGCCA GCGTTGTTCG TGTTGCGGTC ATACGGAAAA AGCAAATCGT CGCACACAAA GTAAGTTTGA GTGCAAAGCA TGTGGGTATG CTGAAAATGC GGACGTAAAC GCAGCACGAA ACATTTTAGC GACGTGGCAC GCTCAAATGG CTACAAGTAC CGCGGGACAC GCGGAAACCG GGAGTCTGTC TCTGGGATAG
|
Protein sequence | MLILKAYKFR LEPTHEQSQR LRQLCGCARF VWNLGLAETK RILGSGEKLP SAFELNRMIT VWKKMPEYIF LQDAYTDNLQ QKLKDLHTAW KRCFDKKLAA KAPVWKRKNE GRDSIRFVNF EKYCCLENRR VKLPSGLGWV KFRQSQRVNG KIKNATISQL AGQWYISFQV EIETAEPNHT STTIVGLDAG VAKLATLSDG TVFEPVNSFQ KNQKKLARLQ RQLSRKVKFS NNWQKQKRKI QRLHSCIANI RRDYLHKVTT TVSKNHAMIV IEDLKVSNMS KSAAGTVSQP GRNVRAKSGL NRSILDQGWY EMRRQLEYKQ LWRGGHVEAV NPAYTSQRCS CCGHTEKANR RTQSKFECKA CGYAENADVN AARNILATWH AQMATSTAGH AETGSLSLG
|
| |