Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0993 |
Symbol | |
ID | 6146501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1008367 |
End bp | 1011684 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615880 |
Product | putative sensor protein |
Protein accession | YP_001743072 |
Protein GI | 170682376 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3447] Predicted integral membrane sensor domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.7374 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAC AATCACAGCA TGTATTAATT GCCCTGCCCC ACCCGCTGCT TCACCTGGTC AGTTTAGGTT TAGTCTCGTT TATCTTTACC CTTTTCTCGC TTGAGCTTTC GCAGTTTGGC ACCCAACTCG CCCCACTGTG GTTCCCGACG TCCATCATGA TGGTGGCGTT TTATCGCCAT GCCGGGCGCA TGTGGCCGGG GATTGCGCTG AGCTGCTCGC TGGGAAATAT CGCCGCATCC ATCCTGCTTT TTTCCACCAG CTCGCTGAAC ATGACCTGGA CGACCATCAA TATTGTTGAA GCCGTGGTCG GGGCAGTGCT ACTGCGTAAA TTGCTGCCGT GGTATAACCC ATTGCAAAAT CTGGCTGACT GGCTGCGTCT GGCACTCGGC AGCGCCATTG TTCCACCTCT GTTAGGGGGT GTTCTGGTTA TCCTGCTGAC GCCCGGAGAC GATCCTCTCA GGGCATTTTT GATATGGGTA CTGTCAGAAT CCATCGGCGC ACTGGCACTG GTGCCGCTGG GATTGTTATT TAAACCACAC TATCTGCTGC GCCATCGCAA CCCACGGTTG CTTTTTGAGT CGCTGCTCAC GTTAGCCATC ACACTGACGT TAAGCTGGCT TTCGATGCTG TATCTGCCGT GGCCTTTTAC TTTCATTATT GTGCTGTTGA TGTGGAGCGC CGTGCGCCTG CCACGAATGG AAGCCTTTTT GATCTTCCTT ACCACGGTGA TGATGGTGTC ACTGATGATG GCCGCTGATC CCTCCCTGCT TGCTACGCCG CGTACATACC TGATGAGCCA TATGCCGTGG CTACCGTTTT TGCTGATCCT GCTGCCCGCC AACATCATGA CCATGGTGAT GTATGCCTTT CGTGCGGAAC GCAAACACAT TTCCGAAAGC GAAACCCGTT TTCGTAACGC CATGGAATAT TCCGCCATCG GCATGGCATT AGTGGGCACC GAGGGGCAAT GGCTGCAATC CAACAAAGCA CTCTGCCAGT TTCTCGGTTA CAGTCAGGAA GAGCTGCGCG GACTCACCTT TCAGCAACTG ACCTGGCCGG AGGATCTCAA TAAAGATCTC CAACAGGTTG AAAAGCTGAT CAGCGGTGAA ATAAACACCT ATTCAATGGA AAAACGCTAC TACAACCGCA ATGGCGATGT TGTCTGGGCG TTGCTTGCCG TCTCACTGGT GCGCCACACG GATGGCACGC CGCTCTATTT TATCGCTCAG ATTGAAGACA TTAACGAGCT AAAACGCACC GAACAGGTGA ATCAGCAACT GATGGAGCGC ATCACGCTGG CCAACGAAGC GGGCGGGATT GGCATCTGGG AGTGGGAGCT GAAGCCGAAT ATTTTTAGCT GGGATAAGCG GATGTTCGAG CTGTATGAAA TTCCTCCGCA TATCAAACCG AACTGGCAGG TGTGGTACGA GTGCGTGCTG CCGGAAGATC GCCAGCACGC CGAAAAAGTG ATTCGTGATT CGTTGCAATC ACGCTCGCCC TTTAAACTGG AATTTCGCAT TACCGTGAAA GACGGCATTC GCCATATCCG CGCCCTCGCT AACCGGGTAC TGAATAAAGA AGGCGAAGTC GAACGCCTGC TCGGCATTAA TATGGATATG ACCGAAGTGA AACAGCTTAA CGAGGCATTG TTTCAGGAAA AAGAGCGCCT GCACATAACG CTGGATTCCA TCGGCGAAGC CGTGGTCTGT ATTGATATGG CAATGAAAAT TACCTTTATG AATCCGGTGG CGGAGAAGAT GAGCGGCTGG ACGCAGGAAG AAGCGTTAGG TGTTCCGCTC CTGACGGTGT TGCATATTAC TTTTGGCGAC AACGGACCAT TAATGGAGAA CATTTACAGT GCCGACACCT CACGTTCCGC GATTGAACAA GATGTGGTGT TGCACTGCCG AAGCGGCGGC AGCTACGACG TGCATTACAG TATTACGCCG TTAAGTACTC TGGACGGCAG CAATATTGGT TCGGTTCTGG TGATTCAGGA CGTCACCGAA TCGCGCAAAA TGCTGCGCCA GCTGAGCTAC AGCGCCTCCC ATGATGCACT GACGCATCTC GCCAATCGCG CCAGTTTTGA GAAGCAACTA CGCATCCTGC TGCAAACGGT AAACAGTACG CATCAGCGAC ATGCCCTGGT GTTTATCGAT CTTGATCGCT TTAAAGCGGT GAATGACAGC GCCGGGCACG CCGCTGGCGA CGCTTTACTG CGCGAACTGG CGTCATTGAT GCTGAGTATG CTGCGCTCCA GCGACGTGCT GGCGCGACTC GGTGGCGATG AATTTGGTCT GCTGCTGCCA GATTGTAATG TCGAAAGTGC GCGTTTTATC GCTACACGCA TTATCAGCGC CGTGAATGAC TATCACTTTA TCTGGGAAGG ACGAGTACAT CGGGTAGGTG CCAGTGCCGG GATTACCTTG ATTGATGACA ACAATCATCA GGCGGCTGAA GTGATGTCGC AGGCTGATAT CGCCTGTTAT GCCTCCAAAA ATGGTGGACG GGGCCGGGTG ACGGTTTACG AACCGCAGCA AGCTGCCACA AATAGCGAAC GGGCGGTGAT GTCGCTTGAT GAACAGTGGC GGATGATTAA AGAGAATCAG TTGATGATGA TCGCCCACGG TGTCGCTTCG CCGCGGATCC CGCAAGCGCG TAATTTGTGG CTGATTTCAC TTAAGCTCTG GAGTTGCGAA GGCGAGATTA TTGATGAACA AACATTTCGT CGTAGCTTCA GCGATCCGGC ACTTAGCCAT GCTCTTGACC GACGGGTATT CCACGATTTT TTCCAGCAGA CCGCAAAAGC GATTGCCAGT AAAGGCTTAA GCATCGCCCT CCCCCTTTCC GTTGCCGGTT TGAGTAGCGC CACGCTGGTG AATGAACTAA TTGAGCAGCT GGAAAATAGC CCTCTACCAC CACGGTTATT ACATCTGATT ATTCCGGCAG ACGCGATTTT AGATCACGCA GAAAGCGTGC AAAAACTGCG GCTGGCGGGA TGTCGGATCG TATTCAGTCA GGTGGGCCGC GATCTGCAAA TCTTCAACTC GTTGAAAGCA AATATGGCAG ATTACCTGCT ACTTGATGGT GAGTTATGCG CCAACGTGCA GGGAAATTTG ATGGATGAGA TGCTGATTAC GATCATTCAG GGGCACGCTC AGCGACTCGG GATGAAAACC ATCGCCGGGC CAGTCGTTTT ACCCTTAGTG ATGGATACGC TTTCTGGCAT CGGCGTCGAT CTGATTTATG GCGATGTGAT TGCCGATGCC CAACCGCTGG ATTTGCTGGT GAATAGCAGT TATTTCGCGA TTAACTGA
|
Protein sequence | MSKQSQHVLI ALPHPLLHLV SLGLVSFIFT LFSLELSQFG TQLAPLWFPT SIMMVAFYRH AGRMWPGIAL SCSLGNIAAS ILLFSTSSLN MTWTTINIVE AVVGAVLLRK LLPWYNPLQN LADWLRLALG SAIVPPLLGG VLVILLTPGD DPLRAFLIWV LSESIGALAL VPLGLLFKPH YLLRHRNPRL LFESLLTLAI TLTLSWLSML YLPWPFTFII VLLMWSAVRL PRMEAFLIFL TTVMMVSLMM AADPSLLATP RTYLMSHMPW LPFLLILLPA NIMTMVMYAF RAERKHISES ETRFRNAMEY SAIGMALVGT EGQWLQSNKA LCQFLGYSQE ELRGLTFQQL TWPEDLNKDL QQVEKLISGE INTYSMEKRY YNRNGDVVWA LLAVSLVRHT DGTPLYFIAQ IEDINELKRT EQVNQQLMER ITLANEAGGI GIWEWELKPN IFSWDKRMFE LYEIPPHIKP NWQVWYECVL PEDRQHAEKV IRDSLQSRSP FKLEFRITVK DGIRHIRALA NRVLNKEGEV ERLLGINMDM TEVKQLNEAL FQEKERLHIT LDSIGEAVVC IDMAMKITFM NPVAEKMSGW TQEEALGVPL LTVLHITFGD NGPLMENIYS ADTSRSAIEQ DVVLHCRSGG SYDVHYSITP LSTLDGSNIG SVLVIQDVTE SRKMLRQLSY SASHDALTHL ANRASFEKQL RILLQTVNST HQRHALVFID LDRFKAVNDS AGHAAGDALL RELASLMLSM LRSSDVLARL GGDEFGLLLP DCNVESARFI ATRIISAVND YHFIWEGRVH RVGASAGITL IDDNNHQAAE VMSQADIACY ASKNGGRGRV TVYEPQQAAT NSERAVMSLD EQWRMIKENQ LMMIAHGVAS PRIPQARNLW LISLKLWSCE GEIIDEQTFR RSFSDPALSH ALDRRVFHDF FQQTAKAIAS KGLSIALPLS VAGLSSATLV NELIEQLENS PLPPRLLHLI IPADAILDHA ESVQKLRLAG CRIVFSQVGR DLQIFNSLKA NMADYLLLDG ELCANVQGNL MDEMLITIIQ GHAQRLGMKT IAGPVVLPLV MDTLSGIGVD LIYGDVIADA QPLDLLVNSS YFAIN
|
| |