Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2368 |
Symbol | atoS |
ID | 6144061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2403640 |
End bp | 2405466 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617241 |
Product | sensory histidine kinase AtoS |
Protein accession | YP_001744413 |
Protein GI | 170680522 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3852] Signal transduction histidine kinase, nitrogen specific |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0221711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTATA TGAAGTGGAT TTATCCACGC CGCTTACGCA ATCAAATGAT CCTGATGGCA ATCCTGATGG TCATTGTCCC AACGCTTACT ATTGGTTATA TCGTAGAAAC GGAAGGACGT TCAGCAGTCT TATCTGAAAA AGAGAAAAAA CTTTCTGCCG TGGTCAACCT GCTTAATCAG GCACTAGGCG ATCGCTATGA TCTCTACATC GACTTACCAC GTGAGGAGCG TATCCGCGCA TTAAATGCAG AACTTGCCCC CATTACCGAA AATATCACTC ACGCCTTCCC TGGCATCGGC GCTGGTTATT ACAACAAAAC GCTGGATGCG ATAATCACCT ACGCGCCTTC AGCGCTATAT CAGAATAATG TCGGCGTTAC CATTGCCGCA GATCACCCTG GTCGCGAAGT CATGCGTACA AATACCCCTT TGGTTTATTC AGGCAGGCAG GTGCGCGGCG ATATTTTGAA TTCAATGATT CCCATTGAGC GTAATGGTGA AATCCTCGGC TATATCTGGG CCAATGAATT AACCGAAGAT ATTCGCCGCC AGGCCTGGAA AATGGATGTG AGGATTATCA TTGTGCTCAC CGCCGGTTTG CTGATAAGCC TGCTGTTGAT TGTCCTTTTC TCCCGTCGCC TGAGCGCCAA TATAGATATC ATCACCGATG GCCTCTCGAC TCTGGCACAA AATATTCCCA CTCGATTACC ACAATTGCCC GGTGAAATGG GGCAAATCAG TCAGAGTGTT AACAACCTCG CCCAGGCACT GCGTGAAACG CGGACACTTA ACGATCTGAT TATTGAAAAC GCTGCCGATG GCGTCATTGC CATTGACCGC CAGGGTGATG TAACCACCAT GAACCCGGCA GCAGAAGTCA TCACTGGCTA TCAACGTCAT GAACTGGTAG GGCAGCCTTA CTCCATGTTG TTCGACAATA CTCAGTTCTA CAGTCCAGTA CTGGATACGC TGGAACATGG CACCGAACAT GTGGCGCTGG AGATCAGTTT TCCAGGCCGT GACCGCACCA TTGAACTCAG TGTCACGACC AGTCGTATTC ATAACACGCA CGGTGAAATG ATAGGTGCTT TGGTGATTTT CTCTGATTTA ACTGCCCGCA AAGAAACCCA GCGCCGCATG GCGCAAGCAG AACGCCTCGC CACACTGGGT GAGCTGATGG CTGGCGTGGC GCATGAAGTA CGTAATCCGC TAACGGCTAT TCGTGGTTAT GTACAGATCT TGCGCCAACA AACCAGCGAC CTAATACATC AGGAATATCT GTCCGTAGTA CTCAAAGAAA TTGATTCAAT TAACAAAGTT ATTCAGCAAT TGCTCGAATT TTCACGTCCA CGCCACAGTC AATGGCAACA AGTCAGCCTC AATGCATTGG TTGAAGAAAC TCTGGTACTG GTACAAACCG CCGGCGTACA AGCGCGGGTC GACTTCATAA GCGAACTGGA TAATGAATTA AGCCCGATTA ACGCCGATCG TGAACTGCTC AAACAGGTAC TACTGAATAT CCTGATCAAT GCCGTCCAGG CTATCAGTGC ACGAGGGAAA ATTCGCATTC GAACCTGGCA ATACAGCGAC TCACAACAGG CCATTTCGAT AGAGGACAAC GGCTGTGGCA TTGATCTCTC GCTGCAAAAA AAGATCTTCG ATCCCTTTTT CACCACCAAA GCCTCAGGAA CCGGGCTTGG TCTGGCGTTA AGTCAACGCA TCATTAATGC CCATCAGGGT GATATTCGCG TCGCCAGTTT GCCGGGCTAC GGCGCAACCT TCACGCTTAT TTTACCGATC AACCCGCAGG GAAATCAGAC TGTATGA
|
Protein sequence | MHYMKWIYPR RLRNQMILMA ILMVIVPTLT IGYIVETEGR SAVLSEKEKK LSAVVNLLNQ ALGDRYDLYI DLPREERIRA LNAELAPITE NITHAFPGIG AGYYNKTLDA IITYAPSALY QNNVGVTIAA DHPGREVMRT NTPLVYSGRQ VRGDILNSMI PIERNGEILG YIWANELTED IRRQAWKMDV RIIIVLTAGL LISLLLIVLF SRRLSANIDI ITDGLSTLAQ NIPTRLPQLP GEMGQISQSV NNLAQALRET RTLNDLIIEN AADGVIAIDR QGDVTTMNPA AEVITGYQRH ELVGQPYSML FDNTQFYSPV LDTLEHGTEH VALEISFPGR DRTIELSVTT SRIHNTHGEM IGALVIFSDL TARKETQRRM AQAERLATLG ELMAGVAHEV RNPLTAIRGY VQILRQQTSD LIHQEYLSVV LKEIDSINKV IQQLLEFSRP RHSQWQQVSL NALVEETLVL VQTAGVQARV DFISELDNEL SPINADRELL KQVLLNILIN AVQAISARGK IRIRTWQYSD SQQAISIEDN GCGIDLSLQK KIFDPFFTTK ASGTGLGLAL SQRIINAHQG DIRVASLPGY GATFTLILPI NPQGNQTV
|
| |