Gene EcSMS35_3236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3236 
Symbol 
ID6145201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3308800 
End bp3309999 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content48% 
IMG OID641618066 
ProductIS200 transposase orfB 
Protein accessionYP_001745216 
Protein GI170682393 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.48969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAATCC TGAAAGCCTA CAAATTCAGA CTGGAACCAA CGCATGAGCA GTCGCAGCGT 
TTGCGGCAGT TATGTGGTTG TGCCCGTTTT GTCTGGAATT TAGGTCTTGC GGAGACAAAG
CGCATACTTG GCTCAGGCGA AAAGTTACCT TCGGCTTTCG AGTTGAATCG GATGATTACA
GTGTGGAAAA AAATGCCGGA ATACATCTTC TTACAGGATG CTTATACCGA CAATCTGCAA
CAAAAGCTGA AAGACCTGCA TACCGCATGG AAACGTTGTT TTGATAAAAA GCTCGCAGCT
AAGGCTCCGG TATGGAAACG AAAAAATGAG GGCAGAGACT CAATCCGTTT TGTGAACTTT
GAGAAATATT GCTGCCTTGA AAATCGCAGA GTGAAGCTAC CGTCAGGTCT TGGGTGGGTA
AAATTCCGGC AATCTCAACG TGTGAACGGT AAAATCAAAA ATGCGACAAT CAGTCAGTTA
GCGGGACAGT GGTATATCTC GTTTCAGGTT GAAATTGAAA CGGCAGAACC AAATCACACA
AGCACAACGA TAGTCGGACT GGATGCAGGC GTGGCTAAAC TTGCCACGCT GTCAGATGGC
ACAGTCTTTG AGCCTGTAAA CAGTTTTCAG AAAAATCAGA AGAAGCTGGC GAGACTCCAG
CGACAGTTAA GCCGCAAGGT CAAATTCAGC AACAACTGGC AGAAGCAGAA ACGCAAAATA
CAGCGACTGC ATTCCTGTAT CGCAAATATC CGCAGGGACT ACCTTCACAA AGTCACAACG
ACCGTCAGCA AAAACCACGC AATGATAGTC ATTGAGGATT TGAAGGTCAG CAACATGTCA
AAGTCGGCAG CGGGTACGGT CAGCCAGCCG GGGCGCAATG TCCGGGCAAA ATCAGGTTTA
AACCGTTCGA TACTGGATCA GGGCTGGTAT GAAATGCGCC GCCAGCTTGA GTACAAACAG
CTCTGGCGTG GTGGTCATGT AGAGGCGGTA AATCCGGCAT ACACAAGCCA GCGTTGTTCG
TGTTGCGGTC ATACGGAAAA AGCAAATCGT CGCACACAAA GTAAGTTTGA GTGCAAAGCA
TGTGGGTATG CTGAAAATGC GGACGTAAAC GCAGCACGAA ACATTTTAGC GACGTGGCAC
GCTCAAATGG CTACAAGTAC CGCGGGACAC GCGGAAACCG GGAGTCTGTC TCTGGGATAG
 
Protein sequence
MLILKAYKFR LEPTHEQSQR LRQLCGCARF VWNLGLAETK RILGSGEKLP SAFELNRMIT 
VWKKMPEYIF LQDAYTDNLQ QKLKDLHTAW KRCFDKKLAA KAPVWKRKNE GRDSIRFVNF
EKYCCLENRR VKLPSGLGWV KFRQSQRVNG KIKNATISQL AGQWYISFQV EIETAEPNHT
STTIVGLDAG VAKLATLSDG TVFEPVNSFQ KNQKKLARLQ RQLSRKVKFS NNWQKQKRKI
QRLHSCIANI RRDYLHKVTT TVSKNHAMIV IEDLKVSNMS KSAAGTVSQP GRNVRAKSGL
NRSILDQGWY EMRRQLEYKQ LWRGGHVEAV NPAYTSQRCS CCGHTEKANR RTQSKFECKA
CGYAENADVN AARNILATWH AQMATSTAGH AETGSLSLG