Gene EcSMS35_4630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4630 
Symbol 
ID6143377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4730709 
End bp4734032 
Gene Length3324 bp 
Protein Length1107 aa 
Translation table11 
GC content56% 
IMG OID641619446 
Producthypothetical protein 
Protein accessionYP_001746557 
Protein GI170682564 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3264] Small-conductance mechanosensitive channel 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.250026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0442574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCTGA TTATCACTTT TCTGATGGCC TGGTGCCTCA GTTGGGGGGC GTACGCCGCG 
ACGGCCCCCG ATAGCAAACA AATCTCACAA GAACTGGAGC AGGCAAAAGC GGCGAAACCC
GCGCAACCGG AAGTCGTAGA GGCGCTCCAG TCTGCCTTAA ATGCGCTTGA GGAACGGAAA
GGTTCCCTTG AGCGCATCAA GCAATATCAA GAAGTCATTG ATAATTATCC GAAACTCTCC
GCTACTCTGC GCGCGCAATT AAACAACATG CGTGACGAGC CGCGCAGCGT GTCACCGGGA
ATGTCTACCG ACGCGCTGAA TCAGGAAATT CTCCAGGTCA GCAGCCAGTT GCTGGATAAA
AGCCGTCAGG CCCAGCAAGA GCAGGAACGC GCCCGCGAGA TTGCCGATTC GCTGAATCAA
CTGCCGCAAC AGCAAACCGA TGCCCGCCGT CAGTTAAATG AAATCGAGCG CCGCCTGGGG
ACGCTTACCG GCAATTCCCC GCTCAATCAG GCACAAAATT TCGCGTTGCA GTCTGACTCC
GCACGCCTGA AAGCGCTCGT CGATGAACTG GAACTGGCGC AGCTGTCCGC CAATAACCGC
CAGGAATTAG CGCGCTTACG CTCAGAGCTG GCGGAAAAAG AGAGCCAGCA ACTGGATGCA
TATTTGCAGG CCTTACGTAA TCAGTTGAAC AGCCAACGTC AGCTTGAGGC GGAGCGGGCG
CTGGAAAGTA CCGAATTACT GGCAGAAAAC AGTGCCGATT TGCCGAAAGA TATCGTCGCG
CAATTCAAAA TTAACCGCGA ACTATCGGCA GCGCTGAATC AACAGGCGCA GCGGATGGAT
CTCGTTGCCT CGCAACAGCG TCAGGCTGCC AGCCAGACGT TACAGGTCCG GCAGGCGTTG
AATACGCTGC GTGAACAGTC GCAATGGCTG GGCTCGTCCA ACCTGCTCGG CGAAGCGTTG
CGGGCGCAGG TGGCACGGCT GCCGGAAATG CCAAAACCAC AACAGCTTGA TACCGAAATG
GCGCAGTTGC GTGTGCAACG GTTACGTTAT GAGGATCTGC TTAATAAACA GCCGCAGCTA
AGGCAAATTC ATCAGGCCGA CGGTCAGCCG CTGACCGCCG AGCAAAACCG TATTCTGGAA
GCACAGCTAC GCACTCAGCG TGAGTTGCTG AACTCATTGT TGCAGGGTGG CGACACGCTA
CTACTGGAAC TGACCAAGCT GAAAGTCTCC AACGGGCAAC TGGAGGATGC GCTGAAAGAG
GTGAATGAAG CGACGCACCG CTATCTGTTC TGGACCTCTG ACGTGCGCCC GATGACCATC
GCCTGGCCGC TGGAAATCGC CCAGGATCTG CGTCGTCTCA TTTCGCTGGA CACCTTCAGT
CAGTTGGGCA AAGCCAGTGT GATGATGCTG ACCAGTAAAG AGACAATTTT ACCGCTGTTT
GGCGCGTTGA TTCTGGTCGG TTGCAGTATT TACTCACGCC GCTATTTCAC CCGTTTTCTT
GAACGTTCGG CGGCGAAAGT CGGCAAAGTG ACTCAGGATC ACTTCTGGCT GACGTTACGC
ACTCTTTTCT GGTCGATTCT CGTCGCGTCA CCGCTACCGG TGCTGTGGAT GACGCTGGGT
TACGGCTTGC GCGAGGCGTG GCCTTATCCG CTGGCGGTCG CGATTGGCGA TGGTGTAACG
GCCACCGTGC CGCTGCTGTG GGTAGTGATG ATTTGCGCTA CCTTTGCCCG CCCGAACGGC
TTGTTTATCG CTCATTTTGG CTGGCCGCGC GAACGCGTTT CCCGTGGGAT GCGCTACTAC
CTGATGAGCA TCGGGCTTAT TGTGCCACTG ATTATGGCGC TGATGATGTT CGATAACCTC
GACGACCGCG AATTTTCCGG TTCGCTGGGA CGGCTTTGCT TTATCCTCAT TTGCGGTGCG
CTGGCGGTGG TCACCCTCAG CCTGAAAAAG GCCGGGATCC CGCTGTATCT CAACAAAGAA
GGCAGCGGCG ACAACATTAC CAACCATATG CTGTGGAACA TGATGATTAG CGCACCGCTG
GTCGCTATTC TGGCGTCGGC GGTGGGTTAT CTGGCAACGG CGCAGGCACT GTTAGCGAGG
CTTGAAACCT CGGTTGCCAT CTGGTTCCTG CTACTGGTGG TTTATCACGT TATCCGCCGC
TGGATGCTGA TCCAGCGCCG CAGGCTGGCG TTTGACCGGG CGAAGCATCG CCGGGCAGAG
ATGTTAGCGC AACGTGCGCG TGGCGAAGAG GAAGCGCATC ATCACAGTAG CCCGGAAGGA
GCAATTGAAG TCGATGAAAG CGAAGTCGAT CTCGATGCCA TCAGTGCGCA ATCCTTGCGG
CTGGTGCGCT CAATTTTGAT GTTGATCGCC CTGCTTTCGG TCATTGTGCT GTGGTCAGAA
ATCCATTCCG CTTTCGGCTT CCTCGAAAAT ATTTCGCTGT GGGATGTCAC TTCCACGGTA
CAGGGCGTAG AAAGCCTGGA GCCGATTACC CTCGGTGCGG TACTGATTGC CATTCTGGTG
TTTATCATCA CCACGCAGCT GGTGCGCAAC CTGCCCGCGC TGCTGGAACT GGCGATTCTG
CAGCACCTGG ATTTAACGCC GGGTACGGGC TACGCCATCA CCACCATCAC CAAATATCTG
CTGATGCTGA TTGGCGGGCT GGTCGGCTTC TCGATGATTG GTATTGAGTG GTCGAAATTG
CAGTGGCTGG TCGCCGCGCT CGGTGTTGGT CTCGGTTTTG GTTTGCAGGA AATTTTCGCC
AACTTTATCT CTGGCCTGAT TATCCTGTTC GAAAAACCGA TTCGCATTGG CGATACGGTA
ACGATCCGCG ATCTTACCGG TAGTGTGACG AAAATTAACA CCCGCGCCAC CACCATCAGC
GACTGGGACC GCAAAGAAAT TATCGTGCCG AACAAGGCGT TTATTACCGA GCAGTTTATC
AACTGGTCGC TCTCTGACTC GGTCACGCGC GTGGTGTTGA CGATTCCGGC CCCTGCCGAT
GCCAACAGTG AAGAAGTGAC GGAAATCCTG CTCACCGCAG CGCGTCGCTG CTCGCTGGTG
ATCGACAACC CGGCACCGGA AGTCTTCCTG GTGGATCTGC AACAGGGGAT TCAGATTTTC
GAGCTGCGTA TTTACGCCGC AGAAATGGGT CACCGCATGC CGCTACGCCA TGAGATTCAC
CAGCTAATTC TGGCAGGCTT CCATGCCCAC GGTATCGATA TGCCATTCCC GCCCTTCCAG
ATGCGTCTGG AAAGCCTGAA TGGTAAACAA ACGGGGAGAA CATTAACTTC TGCGGGCAAA
GGTCGTCAGG CGGGAAGTTT GTAA
 
Protein sequence
MRLIITFLMA WCLSWGAYAA TAPDSKQISQ ELEQAKAAKP AQPEVVEALQ SALNALEERK 
GSLERIKQYQ EVIDNYPKLS ATLRAQLNNM RDEPRSVSPG MSTDALNQEI LQVSSQLLDK
SRQAQQEQER AREIADSLNQ LPQQQTDARR QLNEIERRLG TLTGNSPLNQ AQNFALQSDS
ARLKALVDEL ELAQLSANNR QELARLRSEL AEKESQQLDA YLQALRNQLN SQRQLEAERA
LESTELLAEN SADLPKDIVA QFKINRELSA ALNQQAQRMD LVASQQRQAA SQTLQVRQAL
NTLREQSQWL GSSNLLGEAL RAQVARLPEM PKPQQLDTEM AQLRVQRLRY EDLLNKQPQL
RQIHQADGQP LTAEQNRILE AQLRTQRELL NSLLQGGDTL LLELTKLKVS NGQLEDALKE
VNEATHRYLF WTSDVRPMTI AWPLEIAQDL RRLISLDTFS QLGKASVMML TSKETILPLF
GALILVGCSI YSRRYFTRFL ERSAAKVGKV TQDHFWLTLR TLFWSILVAS PLPVLWMTLG
YGLREAWPYP LAVAIGDGVT ATVPLLWVVM ICATFARPNG LFIAHFGWPR ERVSRGMRYY
LMSIGLIVPL IMALMMFDNL DDREFSGSLG RLCFILICGA LAVVTLSLKK AGIPLYLNKE
GSGDNITNHM LWNMMISAPL VAILASAVGY LATAQALLAR LETSVAIWFL LLVVYHVIRR
WMLIQRRRLA FDRAKHRRAE MLAQRARGEE EAHHHSSPEG AIEVDESEVD LDAISAQSLR
LVRSILMLIA LLSVIVLWSE IHSAFGFLEN ISLWDVTSTV QGVESLEPIT LGAVLIAILV
FIITTQLVRN LPALLELAIL QHLDLTPGTG YAITTITKYL LMLIGGLVGF SMIGIEWSKL
QWLVAALGVG LGFGLQEIFA NFISGLIILF EKPIRIGDTV TIRDLTGSVT KINTRATTIS
DWDRKEIIVP NKAFITEQFI NWSLSDSVTR VVLTIPAPAD ANSEEVTEIL LTAARRCSLV
IDNPAPEVFL VDLQQGIQIF ELRIYAAEMG HRMPLRHEIH QLILAGFHAH GIDMPFPPFQ
MRLESLNGKQ TGRTLTSAGK GRQAGSL