Gene EcSMS35_0993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0993 
Symbol 
ID6146501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1008367 
End bp1011684 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content52% 
IMG OID641615880 
Productputative sensor protein 
Protein accessionYP_001743072 
Protein GI170682376 
COG category[T] Signal transduction mechanisms 
COG ID[COG3447] Predicted integral membrane sensor domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.7374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC AATCACAGCA TGTATTAATT GCCCTGCCCC ACCCGCTGCT TCACCTGGTC 
AGTTTAGGTT TAGTCTCGTT TATCTTTACC CTTTTCTCGC TTGAGCTTTC GCAGTTTGGC
ACCCAACTCG CCCCACTGTG GTTCCCGACG TCCATCATGA TGGTGGCGTT TTATCGCCAT
GCCGGGCGCA TGTGGCCGGG GATTGCGCTG AGCTGCTCGC TGGGAAATAT CGCCGCATCC
ATCCTGCTTT TTTCCACCAG CTCGCTGAAC ATGACCTGGA CGACCATCAA TATTGTTGAA
GCCGTGGTCG GGGCAGTGCT ACTGCGTAAA TTGCTGCCGT GGTATAACCC ATTGCAAAAT
CTGGCTGACT GGCTGCGTCT GGCACTCGGC AGCGCCATTG TTCCACCTCT GTTAGGGGGT
GTTCTGGTTA TCCTGCTGAC GCCCGGAGAC GATCCTCTCA GGGCATTTTT GATATGGGTA
CTGTCAGAAT CCATCGGCGC ACTGGCACTG GTGCCGCTGG GATTGTTATT TAAACCACAC
TATCTGCTGC GCCATCGCAA CCCACGGTTG CTTTTTGAGT CGCTGCTCAC GTTAGCCATC
ACACTGACGT TAAGCTGGCT TTCGATGCTG TATCTGCCGT GGCCTTTTAC TTTCATTATT
GTGCTGTTGA TGTGGAGCGC CGTGCGCCTG CCACGAATGG AAGCCTTTTT GATCTTCCTT
ACCACGGTGA TGATGGTGTC ACTGATGATG GCCGCTGATC CCTCCCTGCT TGCTACGCCG
CGTACATACC TGATGAGCCA TATGCCGTGG CTACCGTTTT TGCTGATCCT GCTGCCCGCC
AACATCATGA CCATGGTGAT GTATGCCTTT CGTGCGGAAC GCAAACACAT TTCCGAAAGC
GAAACCCGTT TTCGTAACGC CATGGAATAT TCCGCCATCG GCATGGCATT AGTGGGCACC
GAGGGGCAAT GGCTGCAATC CAACAAAGCA CTCTGCCAGT TTCTCGGTTA CAGTCAGGAA
GAGCTGCGCG GACTCACCTT TCAGCAACTG ACCTGGCCGG AGGATCTCAA TAAAGATCTC
CAACAGGTTG AAAAGCTGAT CAGCGGTGAA ATAAACACCT ATTCAATGGA AAAACGCTAC
TACAACCGCA ATGGCGATGT TGTCTGGGCG TTGCTTGCCG TCTCACTGGT GCGCCACACG
GATGGCACGC CGCTCTATTT TATCGCTCAG ATTGAAGACA TTAACGAGCT AAAACGCACC
GAACAGGTGA ATCAGCAACT GATGGAGCGC ATCACGCTGG CCAACGAAGC GGGCGGGATT
GGCATCTGGG AGTGGGAGCT GAAGCCGAAT ATTTTTAGCT GGGATAAGCG GATGTTCGAG
CTGTATGAAA TTCCTCCGCA TATCAAACCG AACTGGCAGG TGTGGTACGA GTGCGTGCTG
CCGGAAGATC GCCAGCACGC CGAAAAAGTG ATTCGTGATT CGTTGCAATC ACGCTCGCCC
TTTAAACTGG AATTTCGCAT TACCGTGAAA GACGGCATTC GCCATATCCG CGCCCTCGCT
AACCGGGTAC TGAATAAAGA AGGCGAAGTC GAACGCCTGC TCGGCATTAA TATGGATATG
ACCGAAGTGA AACAGCTTAA CGAGGCATTG TTTCAGGAAA AAGAGCGCCT GCACATAACG
CTGGATTCCA TCGGCGAAGC CGTGGTCTGT ATTGATATGG CAATGAAAAT TACCTTTATG
AATCCGGTGG CGGAGAAGAT GAGCGGCTGG ACGCAGGAAG AAGCGTTAGG TGTTCCGCTC
CTGACGGTGT TGCATATTAC TTTTGGCGAC AACGGACCAT TAATGGAGAA CATTTACAGT
GCCGACACCT CACGTTCCGC GATTGAACAA GATGTGGTGT TGCACTGCCG AAGCGGCGGC
AGCTACGACG TGCATTACAG TATTACGCCG TTAAGTACTC TGGACGGCAG CAATATTGGT
TCGGTTCTGG TGATTCAGGA CGTCACCGAA TCGCGCAAAA TGCTGCGCCA GCTGAGCTAC
AGCGCCTCCC ATGATGCACT GACGCATCTC GCCAATCGCG CCAGTTTTGA GAAGCAACTA
CGCATCCTGC TGCAAACGGT AAACAGTACG CATCAGCGAC ATGCCCTGGT GTTTATCGAT
CTTGATCGCT TTAAAGCGGT GAATGACAGC GCCGGGCACG CCGCTGGCGA CGCTTTACTG
CGCGAACTGG CGTCATTGAT GCTGAGTATG CTGCGCTCCA GCGACGTGCT GGCGCGACTC
GGTGGCGATG AATTTGGTCT GCTGCTGCCA GATTGTAATG TCGAAAGTGC GCGTTTTATC
GCTACACGCA TTATCAGCGC CGTGAATGAC TATCACTTTA TCTGGGAAGG ACGAGTACAT
CGGGTAGGTG CCAGTGCCGG GATTACCTTG ATTGATGACA ACAATCATCA GGCGGCTGAA
GTGATGTCGC AGGCTGATAT CGCCTGTTAT GCCTCCAAAA ATGGTGGACG GGGCCGGGTG
ACGGTTTACG AACCGCAGCA AGCTGCCACA AATAGCGAAC GGGCGGTGAT GTCGCTTGAT
GAACAGTGGC GGATGATTAA AGAGAATCAG TTGATGATGA TCGCCCACGG TGTCGCTTCG
CCGCGGATCC CGCAAGCGCG TAATTTGTGG CTGATTTCAC TTAAGCTCTG GAGTTGCGAA
GGCGAGATTA TTGATGAACA AACATTTCGT CGTAGCTTCA GCGATCCGGC ACTTAGCCAT
GCTCTTGACC GACGGGTATT CCACGATTTT TTCCAGCAGA CCGCAAAAGC GATTGCCAGT
AAAGGCTTAA GCATCGCCCT CCCCCTTTCC GTTGCCGGTT TGAGTAGCGC CACGCTGGTG
AATGAACTAA TTGAGCAGCT GGAAAATAGC CCTCTACCAC CACGGTTATT ACATCTGATT
ATTCCGGCAG ACGCGATTTT AGATCACGCA GAAAGCGTGC AAAAACTGCG GCTGGCGGGA
TGTCGGATCG TATTCAGTCA GGTGGGCCGC GATCTGCAAA TCTTCAACTC GTTGAAAGCA
AATATGGCAG ATTACCTGCT ACTTGATGGT GAGTTATGCG CCAACGTGCA GGGAAATTTG
ATGGATGAGA TGCTGATTAC GATCATTCAG GGGCACGCTC AGCGACTCGG GATGAAAACC
ATCGCCGGGC CAGTCGTTTT ACCCTTAGTG ATGGATACGC TTTCTGGCAT CGGCGTCGAT
CTGATTTATG GCGATGTGAT TGCCGATGCC CAACCGCTGG ATTTGCTGGT GAATAGCAGT
TATTTCGCGA TTAACTGA
 
Protein sequence
MSKQSQHVLI ALPHPLLHLV SLGLVSFIFT LFSLELSQFG TQLAPLWFPT SIMMVAFYRH 
AGRMWPGIAL SCSLGNIAAS ILLFSTSSLN MTWTTINIVE AVVGAVLLRK LLPWYNPLQN
LADWLRLALG SAIVPPLLGG VLVILLTPGD DPLRAFLIWV LSESIGALAL VPLGLLFKPH
YLLRHRNPRL LFESLLTLAI TLTLSWLSML YLPWPFTFII VLLMWSAVRL PRMEAFLIFL
TTVMMVSLMM AADPSLLATP RTYLMSHMPW LPFLLILLPA NIMTMVMYAF RAERKHISES
ETRFRNAMEY SAIGMALVGT EGQWLQSNKA LCQFLGYSQE ELRGLTFQQL TWPEDLNKDL
QQVEKLISGE INTYSMEKRY YNRNGDVVWA LLAVSLVRHT DGTPLYFIAQ IEDINELKRT
EQVNQQLMER ITLANEAGGI GIWEWELKPN IFSWDKRMFE LYEIPPHIKP NWQVWYECVL
PEDRQHAEKV IRDSLQSRSP FKLEFRITVK DGIRHIRALA NRVLNKEGEV ERLLGINMDM
TEVKQLNEAL FQEKERLHIT LDSIGEAVVC IDMAMKITFM NPVAEKMSGW TQEEALGVPL
LTVLHITFGD NGPLMENIYS ADTSRSAIEQ DVVLHCRSGG SYDVHYSITP LSTLDGSNIG
SVLVIQDVTE SRKMLRQLSY SASHDALTHL ANRASFEKQL RILLQTVNST HQRHALVFID
LDRFKAVNDS AGHAAGDALL RELASLMLSM LRSSDVLARL GGDEFGLLLP DCNVESARFI
ATRIISAVND YHFIWEGRVH RVGASAGITL IDDNNHQAAE VMSQADIACY ASKNGGRGRV
TVYEPQQAAT NSERAVMSLD EQWRMIKENQ LMMIAHGVAS PRIPQARNLW LISLKLWSCE
GEIIDEQTFR RSFSDPALSH ALDRRVFHDF FQQTAKAIAS KGLSIALPLS VAGLSSATLV
NELIEQLENS PLPPRLLHLI IPADAILDHA ESVQKLRLAG CRIVFSQVGR DLQIFNSLKA
NMADYLLLDG ELCANVQGNL MDEMLITIIQ GHAQRLGMKT IAGPVVLPLV MDTLSGIGVD
LIYGDVIADA QPLDLLVNSS YFAIN