Gene EcSMS35_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0831 
Symbol 
ID6143031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp835077 
End bp837302 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content54% 
IMG OID641615720 
Producthypothetical protein 
Protein accessionYP_001742912 
Protein GI170680728 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0668] Small-conductance mechanosensitive channel 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0229209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTGGA TCCTGTTCAT CCTCTTCTGC CTGCTGGGCG CACCTGCCCA CGCGGTATCC 
ATACCCGGCG TTACAACCAC AACGACAACG GACTCAACGA CTGAACCGGC CCCGGAACCG
GATATCGAAC AAAAAAAAGC GGCCTATGGC GCACTGGCGG ATGTGCTGGA TAATGACACC
TCGCGTAAAG AGTTGATCGA CCAGTTGCGC ACCGTCGCCG CGACGCCCCC GGCAGAACCC
GTACCGAAGA TCGTGCCGCC GACGCTGGTT GAAGAGCAAA CCGTGCTGCA AAAGGTCACC
GAAGTCAGCC GCCATTATGG TGAAGCCCTT TCCGCCCGCT TCGGGCAACT TTATCGCAAT
ATCACCGGCT CCCCGCATAA GCCGTTTAAT CCACAAACCT TCAGCAATGC GTTGACCCAT
TTTTCAATGT TAGCGGTATT AGTGTTTGTT TTTTACTGGC TGATTCGCCT GTGCGCACTT
CCGCTGTATC GCAAAATGGG CCAGTGGGCG CGGCAAAAAA ATCGTGAGCG CAGTAACTGG
TTGCAGCTTC CGGCGATGAT TATCGGGGCG TTTATTATCG ACCTGCTGTT ACTGGCACTG
ACATTGTTTG TCGGCCAGGT ATTAAGCGAC AACCTGAATG CGGGCAGTCG CACCATCGCT
TTCCAACAAA GTTTGTTTCT CAACGCCTTT GCCCTGATTG AATTTTTCAA AGCCGTACTA
CGCCTGATTT TTTGCCCAAA CGTGGCTGAA CTGCGCCCGT TCACGATTCA TGACGAGACC
GCCCGTTACT GGAGCCGTCG CCTGAGCTGG TTAAGCAGCC TGATTGGCTA TGGCCTGATT
GTGGCCGTGC CGATTATCTC TAATCAGGTG AATGTACAGA TAGGTGCGCT GGCGAACGTC
ATCATTATGC TGTGCATGAC GGTCTGGGCG CTGTACCTGA TCTTTCGTAA TAAAAAAGAG
ATTACCCAGC ATTTGCTCAA CTTCGCGGAG CATTCGCTGG CCTTTTTCAG CCTGTTTATC
CGCGCCTTTG CGCTGGTGTG GCACTGGCTG GCAAGCGCCT ATTTTATCGT GCTGTTTTTC
TTTTCGTTGT TCGATCCGGG CAACAGCCTG AAATTTATGA TGGGTGCAAC GGTGCGCAGC
CTGGCGATTA TTGGTATCGC AGCATTTGTT TCCGGTATGT TTTCCCGCTG GCTGGCGAAA
ACCATCACCC TCTCGCCACA TACTCAGCGT AACTATCCGG AGCTGCAAAA ACGGTTGAAT
GGCTGGCTGT CGGCGGCGCT GAAAACGGCG CGTATTCTGA CAGTCTGCGT GGCGGTAATG
CTGCTGTTGA GCGCATGGGG ATTGTTCGAT TTCTGGAACT GGCTGCAAAA CGGCGCGGGG
CAGAAAACCG TAGATATCCT GATCCGTATC GCACTCATTC TTTTCTTCTC GGCGGTTGGC
TGGACAGTGC TCGCCAGTTT GATCGAAAAC CGGCTGGCTT CGGATATTCA TGGCCGCCCG
CTACCCAGCG CCCGTACGCG TACCCTGCTG ACGCTGTTTC GTAACGCGCT GGCGGTGATT
ATCAGTACCA TCACCATCAT GATTGTGTTG TCGGAAATCG GCGTTAATAT CGCGCCATTG
CTGGCAGGTG CCGGGGCATT AGGTCTGGCT ATCTCGTTTG GTTCGCAAAC GCTGGTGAAA
GATATTATCA CCGGGGTATT TATTCAGTTT GAAAACGGCA TGAACACTGG AGATTTGGTG
ACTATCGGGC CGTTGACCGG CACTGTGGAA CGGATGTCGA TTCGCTCCGT GGGCGTGCGT
CAGGATACCG GGGCGTATCA CATCATTCCG TGGTCTTCGA TAACTACCTT TGCTAACTTC
GTCCGCGGCA TTGGTTCGGT GGTGGCAAAT TACGATGTTG ATCGCCATGA AGATGCTGAT
AAAGCCAATC AGGCACTGAA AGATGCGGTA GCGGAATTAA TGGAAAACGA AGAAATTCGC
GGGCTGATTA TTGGTGAACC GAATTTTGCC GGGATTGTCG GCTTAAGCAA TACCGCGTTT
ACACTGCGTG TTTCGTTCAC CACGCTGCCA CTCAAACAGT GGACGGTCCG CTTTGCCCTC
GACAGCCAGG TGAAAAAACA TTTCGACCTG GCGGGCGTTC GCGCGCCAGT GCAGACTTAT
CAGGTGCTGC CTGCTCCGGG CGCGACCCCG GCTGAACCGT TGCCGCCGGG GGAACCAACG
CTTTAA
 
Protein sequence
MRWILFILFC LLGAPAHAVS IPGVTTTTTT DSTTEPAPEP DIEQKKAAYG ALADVLDNDT 
SRKELIDQLR TVAATPPAEP VPKIVPPTLV EEQTVLQKVT EVSRHYGEAL SARFGQLYRN
ITGSPHKPFN PQTFSNALTH FSMLAVLVFV FYWLIRLCAL PLYRKMGQWA RQKNRERSNW
LQLPAMIIGA FIIDLLLLAL TLFVGQVLSD NLNAGSRTIA FQQSLFLNAF ALIEFFKAVL
RLIFCPNVAE LRPFTIHDET ARYWSRRLSW LSSLIGYGLI VAVPIISNQV NVQIGALANV
IIMLCMTVWA LYLIFRNKKE ITQHLLNFAE HSLAFFSLFI RAFALVWHWL ASAYFIVLFF
FSLFDPGNSL KFMMGATVRS LAIIGIAAFV SGMFSRWLAK TITLSPHTQR NYPELQKRLN
GWLSAALKTA RILTVCVAVM LLLSAWGLFD FWNWLQNGAG QKTVDILIRI ALILFFSAVG
WTVLASLIEN RLASDIHGRP LPSARTRTLL TLFRNALAVI ISTITIMIVL SEIGVNIAPL
LAGAGALGLA ISFGSQTLVK DIITGVFIQF ENGMNTGDLV TIGPLTGTVE RMSIRSVGVR
QDTGAYHIIP WSSITTFANF VRGIGSVVAN YDVDRHEDAD KANQALKDAV AELMENEEIR
GLIIGEPNFA GIVGLSNTAF TLRVSFTTLP LKQWTVRFAL DSQVKKHFDL AGVRAPVQTY
QVLPAPGATP AEPLPPGEPT L