Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1996 |
Symbol | phoQ |
ID | 6145416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2016734 |
End bp | 2018194 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616872 |
Product | sensor protein PhoQ |
Protein accession | YP_001744048 |
Protein GI | 170681883 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.687103 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT TACTGCGTCT TTTTTTCCCG CTCTCGCTGC GGGTACGTTT TCTGTTAGCA ACGGCAGCAG TAGTATTGGT GCTTTCGCTT GCCTACGGAA TGGTTGCGCT GATCGGTTAT AGCGTCAGTT TCGATAAAAC CACGTTTCGG CTGTTACGTG GCGAGAGCAA TCTGTTCTAT ACCCTTGCGA AGTGGGAAAA CAATAAGTTG CATGTCGAGT TACCCGAAAA TATCGACAAG CAAAGCCCCA CCATGACGCT AATTTATGAT GAGAACGGGC AGCTTTTATG GGCGCAACGT GACGTGCCCT GGCTGATGAA GATGATCCAG CCTGACTGGC TGAAATCGAA TGGTTTTCAT GAAATTGAAG CGGATGTTAA CGATACCAGC CTCTTGCTGA GTGGAGATCA TTCGATACAG CAACAGTTGC AGGAAGTGCG GGAAGATGAT GACGACGCGG AGATGACCCA CTCGGTAGCG GTAAACGTCT ACCCGGCAAC ATCGCGGATG CCAAAGTTAA CCATTGTGGT GGTGGATACC ATTCCGGTGG AGCTAAAAAG TTCCTATATG GTCTGGAGCT GGTTTATCTA TGTGCTCTCA GCCAATCTGC TGTTAGTGAT CCCGCTGCTG TGGGTCGCCG CCTGGTGGAG TTTACGCCCC ATCGAAGCCC TGGCAAAAGA AGTCCGCGAA CTGGAAGAAC ATAACCGCGA ATTGCTCAAT CCAGCCACAA CGCGAGAACT GACCAGTCTG GTACGAAACC TGAACCGATT GTTAAAAAGT GAACGCGAAC GTTACGACAA ATATCGTACA ACGCTCACCG ACCTGACCCA TAGTCTGAAA ACGCCACTGG CGGTGCTGCA AAGTACGCTG CGTTCTCTGC GTAGTGAGAA GATGAGCGTC AGTGATGCTG AGCCGGTAAT GCTGGAGCAA ATCAGCCGCA TTTCACAGCA AATTGGCTAC TACCTGCATC GTGCCAGTAT GCGCGGCGGG ACATTGCTTA GCCGCGAGCT GCATCCGGTC GCCCCACTGT TGGACAAGCT CACCTCGGCG CTGAACAAAG TGTATCAACG CAAAGGGGTC AATATCTCTC TCGATATTTC GCCAGAGATC AGCTTTGTTG GTGAGCAGAA CGATTTTGTC GAGGTGATGG GCAATGTGCT GGATAATGCC TGTAAATATT GCCTCGAGTT TGTCGAAATT TCTGCAAGGC AAACCGACGA GCATCTCTAT ATTGTGGTCG AGGATGATGG CCCCGGTATT CCATTAAGCA AGCGAGAGGT CATTTTCGAC CGTGGTCAAC GGGTTGATAC TTTACGCCCT GGGCAAGGTG TAGGGCTGGC GGTAGCCCGC GAAATCACCG AGCAATATGA GGGTAAAATC GTCGCCGGAG AGAGCATGCT GGGCGGTGCG CGGATGGAGG TGATTTTTGG TCGCCAGCAT TCTGCGCCGA AAGATGAATA A
|
Protein sequence | MKKLLRLFFP LSLRVRFLLA TAAVVLVLSL AYGMVALIGY SVSFDKTTFR LLRGESNLFY TLAKWENNKL HVELPENIDK QSPTMTLIYD ENGQLLWAQR DVPWLMKMIQ PDWLKSNGFH EIEADVNDTS LLLSGDHSIQ QQLQEVREDD DDAEMTHSVA VNVYPATSRM PKLTIVVVDT IPVELKSSYM VWSWFIYVLS ANLLLVIPLL WVAAWWSLRP IEALAKEVRE LEEHNRELLN PATTRELTSL VRNLNRLLKS ERERYDKYRT TLTDLTHSLK TPLAVLQSTL RSLRSEKMSV SDAEPVMLEQ ISRISQQIGY YLHRASMRGG TLLSRELHPV APLLDKLTSA LNKVYQRKGV NISLDISPEI SFVGEQNDFV EVMGNVLDNA CKYCLEFVEI SARQTDEHLY IVVEDDGPGI PLSKREVIFD RGQRVDTLRP GQGVGLAVAR EITEQYEGKI VAGESMLGGA RMEVIFGRQH SAPKDE
|
| |