Gene EcSMS35_2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2368 
SymbolatoS 
ID6144061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2403640 
End bp2405466 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content49% 
IMG OID641617241 
Productsensory histidine kinase AtoS 
Protein accessionYP_001744413 
Protein GI170680522 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0221711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTATA TGAAGTGGAT TTATCCACGC CGCTTACGCA ATCAAATGAT CCTGATGGCA 
ATCCTGATGG TCATTGTCCC AACGCTTACT ATTGGTTATA TCGTAGAAAC GGAAGGACGT
TCAGCAGTCT TATCTGAAAA AGAGAAAAAA CTTTCTGCCG TGGTCAACCT GCTTAATCAG
GCACTAGGCG ATCGCTATGA TCTCTACATC GACTTACCAC GTGAGGAGCG TATCCGCGCA
TTAAATGCAG AACTTGCCCC CATTACCGAA AATATCACTC ACGCCTTCCC TGGCATCGGC
GCTGGTTATT ACAACAAAAC GCTGGATGCG ATAATCACCT ACGCGCCTTC AGCGCTATAT
CAGAATAATG TCGGCGTTAC CATTGCCGCA GATCACCCTG GTCGCGAAGT CATGCGTACA
AATACCCCTT TGGTTTATTC AGGCAGGCAG GTGCGCGGCG ATATTTTGAA TTCAATGATT
CCCATTGAGC GTAATGGTGA AATCCTCGGC TATATCTGGG CCAATGAATT AACCGAAGAT
ATTCGCCGCC AGGCCTGGAA AATGGATGTG AGGATTATCA TTGTGCTCAC CGCCGGTTTG
CTGATAAGCC TGCTGTTGAT TGTCCTTTTC TCCCGTCGCC TGAGCGCCAA TATAGATATC
ATCACCGATG GCCTCTCGAC TCTGGCACAA AATATTCCCA CTCGATTACC ACAATTGCCC
GGTGAAATGG GGCAAATCAG TCAGAGTGTT AACAACCTCG CCCAGGCACT GCGTGAAACG
CGGACACTTA ACGATCTGAT TATTGAAAAC GCTGCCGATG GCGTCATTGC CATTGACCGC
CAGGGTGATG TAACCACCAT GAACCCGGCA GCAGAAGTCA TCACTGGCTA TCAACGTCAT
GAACTGGTAG GGCAGCCTTA CTCCATGTTG TTCGACAATA CTCAGTTCTA CAGTCCAGTA
CTGGATACGC TGGAACATGG CACCGAACAT GTGGCGCTGG AGATCAGTTT TCCAGGCCGT
GACCGCACCA TTGAACTCAG TGTCACGACC AGTCGTATTC ATAACACGCA CGGTGAAATG
ATAGGTGCTT TGGTGATTTT CTCTGATTTA ACTGCCCGCA AAGAAACCCA GCGCCGCATG
GCGCAAGCAG AACGCCTCGC CACACTGGGT GAGCTGATGG CTGGCGTGGC GCATGAAGTA
CGTAATCCGC TAACGGCTAT TCGTGGTTAT GTACAGATCT TGCGCCAACA AACCAGCGAC
CTAATACATC AGGAATATCT GTCCGTAGTA CTCAAAGAAA TTGATTCAAT TAACAAAGTT
ATTCAGCAAT TGCTCGAATT TTCACGTCCA CGCCACAGTC AATGGCAACA AGTCAGCCTC
AATGCATTGG TTGAAGAAAC TCTGGTACTG GTACAAACCG CCGGCGTACA AGCGCGGGTC
GACTTCATAA GCGAACTGGA TAATGAATTA AGCCCGATTA ACGCCGATCG TGAACTGCTC
AAACAGGTAC TACTGAATAT CCTGATCAAT GCCGTCCAGG CTATCAGTGC ACGAGGGAAA
ATTCGCATTC GAACCTGGCA ATACAGCGAC TCACAACAGG CCATTTCGAT AGAGGACAAC
GGCTGTGGCA TTGATCTCTC GCTGCAAAAA AAGATCTTCG ATCCCTTTTT CACCACCAAA
GCCTCAGGAA CCGGGCTTGG TCTGGCGTTA AGTCAACGCA TCATTAATGC CCATCAGGGT
GATATTCGCG TCGCCAGTTT GCCGGGCTAC GGCGCAACCT TCACGCTTAT TTTACCGATC
AACCCGCAGG GAAATCAGAC TGTATGA
 
Protein sequence
MHYMKWIYPR RLRNQMILMA ILMVIVPTLT IGYIVETEGR SAVLSEKEKK LSAVVNLLNQ 
ALGDRYDLYI DLPREERIRA LNAELAPITE NITHAFPGIG AGYYNKTLDA IITYAPSALY
QNNVGVTIAA DHPGREVMRT NTPLVYSGRQ VRGDILNSMI PIERNGEILG YIWANELTED
IRRQAWKMDV RIIIVLTAGL LISLLLIVLF SRRLSANIDI ITDGLSTLAQ NIPTRLPQLP
GEMGQISQSV NNLAQALRET RTLNDLIIEN AADGVIAIDR QGDVTTMNPA AEVITGYQRH
ELVGQPYSML FDNTQFYSPV LDTLEHGTEH VALEISFPGR DRTIELSVTT SRIHNTHGEM
IGALVIFSDL TARKETQRRM AQAERLATLG ELMAGVAHEV RNPLTAIRGY VQILRQQTSD
LIHQEYLSVV LKEIDSINKV IQQLLEFSRP RHSQWQQVSL NALVEETLVL VQTAGVQARV
DFISELDNEL SPINADRELL KQVLLNILIN AVQAISARGK IRIRTWQYSD SQQAISIEDN
GCGIDLSLQK KIFDPFFTTK ASGTGLGLAL SQRIINAHQG DIRVASLPGY GATFTLILPI
NPQGNQTV