Gene EcSMS35_4777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4777 
Symbol 
ID6146669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4872207 
End bp4873415 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content45% 
IMG OID641619587 
ProductIS10 transposase 
Protein accessionYP_001746694 
Protein GI170682497 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.271469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.676926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGAAC TCGATATTTT ACACGACTCT CTTTACCAAT TCTGCCCCGA ATTACACTTA 
AAACGACTCA ACAGCTTAAC GTTGGCTTGC CACGCATTAC TTGACTGTAA AACTCTCACT
CTTACCGAAC TTGGCCGTAA CCTGCCAACC AAAGCGAGAA CAAAACATAA CATCAAACGA
ATCGACCGAT TGTTAGGTAA TCGTCACCTC CACAAAGAGC GACTCGCTGT ATACCGTTGG
CATGCTAGCT TTATCTGTTC GGGCAATACG ATGCCCATTG TACTTGTTGA CTGGTCTGAT
ATTCGTGAGC AAAAACGACT TATGGTATTG CGAGCTTCAG TCGCACTACA CGGTCGTTCT
GTTACTCTTT ATGAGAAAGC GTTCCCGCTT TCAGAGCAAT GTTCAAAGAA AGCTCATGAC
CAATTTCTAG CCGACCTTGC GAGCATTCTA CCGAGTAACA CCACACCGCT CATTGTCAGT
GATGCTGGCT TTAAAGTGCC ATGGTATAAA TCCGTTGAGA AGCTGGGTTG GTACTGGTTA
AGTCGAGTAA GAGGAAAAGT ACAATATGCA GACCTAGGAG CGGAAAACTG GAAACCTATC
AGCAACTTAC ATGATATGTC ATCTAGTCAC TCAAAGACTT TAGGCTATAA GAGGCTGACT
AAAAGCAATC CAATCTCATG CCAAATTCTA TTGTATAAAT CTCGCTCTAA AGGCCGAAAA
AATCAGCGCT CGACACGGAC TCATTGTCAC CACCCGTCAC CTAAAATCTA CTCAGCGTCG
GCAAAGGAGC CATGGGTTCT AGCAACTAAC TTACCTGTTG AAATTCGAAC ACCCAAACAA
CTTGTTAATA TCTATTCGAA GCGAATGCAG ATTGAAGAAA CCTTCCGAGA CTTGAAAAGT
CCTGCCTACG GACTAGGCCT ACGCCATAGC CGAACGAGCA GCTCAGAGCG TTTTGATATC
ATGCTGCTAA TCGCCCTGAT GCTTCAACTA ACATGTTGGC TTGCGGGCGT TCATGCTCAG
AAACAAGGTT GGGACAAGCA CTTCCAGGCT AACACAGTCA GAAATCGAAA CGTACTCTCA
ACAGTTCGCT TAGGCATGGA AGTTTTGCGG CATTCTGGCT ACACAATAAC AAGGGAAGAC
TTACTCGTGG CTGCAACCCT ACTAGCTCAA AATTTATTCA CACATGGTTA CGCTTTGGGG
AAATTATGA
 
Protein sequence
MCELDILHDS LYQFCPELHL KRLNSLTLAC HALLDCKTLT LTELGRNLPT KARTKHNIKR 
IDRLLGNRHL HKERLAVYRW HASFICSGNT MPIVLVDWSD IREQKRLMVL RASVALHGRS
VTLYEKAFPL SEQCSKKAHD QFLADLASIL PSNTTPLIVS DAGFKVPWYK SVEKLGWYWL
SRVRGKVQYA DLGAENWKPI SNLHDMSSSH SKTLGYKRLT KSNPISCQIL LYKSRSKGRK
NQRSTRTHCH HPSPKIYSAS AKEPWVLATN LPVEIRTPKQ LVNIYSKRMQ IEETFRDLKS
PAYGLGLRHS RTSSSERFDI MLLIALMLQL TCWLAGVHAQ KQGWDKHFQA NTVRNRNVLS
TVRLGMEVLR HSGYTITRED LLVAATLLAQ NLFTHGYALG KL