Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1217 |
Symbol | |
ID | 6144957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1221247 |
End bp | 1222605 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641616095 |
Product | heavy metal sensor histidine kinase |
Protein accession | YP_001743278 |
Protein GI | 170681228 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR01386] heavy metal sensor kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00057319 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.58155 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAT TATCTATAAC CGTCCGTTTA ACCTTGCTTT TTATATTGCT GCTGTCTGTT GCTGGCGCTG GAATTGTATG GACTCTCTAT AATGGCCTGG CAAGTGAGTT GAAATGGCGC GATGATACAA CACTCATTAA CCGGACAGCG CAGATCAAGC AGTTGTTAAT TGATGGGGTA AATCCAGATA CGTTACCTGT GTACTTTAAC CGGATGATGG ATGTTAGTCA GGATATCTTG ATCATTCATG GTGATGGCAT CAATAAAATT GTTAACCGGA CGAATGTCAG TGATGACATG TTAAATAACA TACCTGCTAG TGAGACAATC AGCGCAGCTG GCATTTACAG AAGCATTATT AATGATACAG AGATAGATGC TTTACGAATT AATATTGATG AAGTTTCGCC ATCATTAACG GTTACTGTGG CTAAATTGGC TTCAGCCAGA CATAACATGC TTGAACAGTA TAAAATTAAT AGCATTATAA TTTGCATTGT CGCCATTGTA CTTTGCTCAG TATTAAGTCC GCTGTTAATC AGAACGGGAT TACGAGAGAT CAAAAAGTTG AGTGGTGTAA CGGAAGCGCT GAATTATAAC GATAGCCGGG AGCCTGTTGA GGTTAGCGCA TTACCGAGAG AACTAAAACC TCTTGGGCAG GCGTTGAATA AAATGCATCA AGCCTTAGTC AAAGATTTTG AACGCCTAAG TCAATTTGCT GACGATCTCG CTCATGAACT TAGAACGCCC ATTAATGCAT TACTGGGTCA GAATCAGGTT ACGCTCAGTC AAACCAGAAG TATCGCTGAA TATCAAAAAA CAATTGCCGG TAACATTGAA GAGCTGGAAA ATATTTCGCG GTTAACAGAG AACATACTGT TTCTTGCCCG GGCAGATAAA AACAATGTTT TGGTGAAACT GGACTCGCTT TCTCTCAATA AGGAAGTCGA AAATTTGTTG GATTATCTTG AATACCTTTC AGACGAGAAA GAGATTTGCT TTAAGGTCGA GTGCAATCAG CAAATCTTTG CGGATAAAAT TTTACTGCAA CGAATGTTAT CGAATCTTAT TGTTAATGCC ATTAGATATT CACCAGAAAA ATCGCATATT CATATAACCA GTTTTCTTGA TACCAACGGC TATCTTAATA TTGATGTCGC CAGTCCTGGA ACGAAAATTC ATGAGCCTGA AAAACTCTTC CGTAGATTTT GGCGGGGAGA TAATTCGCGT CATTCCGTAG GTCAGGGACT AGGCCTTTCT TTAGTCAAAG CGATTGCCGA ATTACATGGG GGAAGTGCTA CGTATCACTA TCTCAATAAG CATAATGTGT TCCGGATTAC GTTACCGCAA AGAAATTAA
|
Protein sequence | MKRLSITVRL TLLFILLLSV AGAGIVWTLY NGLASELKWR DDTTLINRTA QIKQLLIDGV NPDTLPVYFN RMMDVSQDIL IIHGDGINKI VNRTNVSDDM LNNIPASETI SAAGIYRSII NDTEIDALRI NIDEVSPSLT VTVAKLASAR HNMLEQYKIN SIIICIVAIV LCSVLSPLLI RTGLREIKKL SGVTEALNYN DSREPVEVSA LPRELKPLGQ ALNKMHQALV KDFERLSQFA DDLAHELRTP INALLGQNQV TLSQTRSIAE YQKTIAGNIE ELENISRLTE NILFLARADK NNVLVKLDSL SLNKEVENLL DYLEYLSDEK EICFKVECNQ QIFADKILLQ RMLSNLIVNA IRYSPEKSHI HITSFLDTNG YLNIDVASPG TKIHEPEKLF RRFWRGDNSR HSVGQGLGLS LVKAIAELHG GSATYHYLNK HNVFRITLPQ RN
|
| |