Gene EcSMS35_2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2520 
SymbolevgS 
ID6143045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2575404 
End bp2578787 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content41% 
IMG OID641617392 
Producthybrid sensory histidine kinase in two-component regulatory system with EvgA 
Protein accessionYP_001744563 
Protein GI170683263 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATACCG ATTCGCAGCA ACGGATTCGT GGTATTAATG CTGATTATTT AAATCTTTTA 
AAAAGAGCGT TAAATATCAA ATTAACACTC CGGGAATACG CAGATCATCA AAAAGCAATG
GACGCGCTGG AAGACGGTGA AGTTGATATA GTGTTATCAC ATTTAGTTGC TTCGCCGCCT
CTTAATGATG ACATTGCTGC AACTAATCCT CTGATAATTA CCTTTCCGGC GCTGGTCACT
ACCCTTCACG ATTCAATGCG ACCGCTTACC TCATCAAAAC CAGTAAATAT TGCTCGAGTA
GCAAATTATC CCCCAGACGA GGTAATTCAT CAATCATTTC CAAAAGCAAC AATTATCTCT
TTTACAAATT TATATCAGGC ATTAGCATCC GTCTCAGCCG GACAGAATGA TTACTTTATT
GGTAGTAACA TCATTACCAG CAGTATGATT TCCCGTTATT TTACTCACTC CTTAAATGTA
GTGAAGTATT ATAACTCACC GCGTCAATAT AACTTTTTAT TAACCAGAAA AGATTCAATC
GTTCTTAATG AAGTACTCAA TCGATTTGTT GATGCTTTAA CAAATGAAGT TCGCTATGAA
GTATCACAAA ATTGGCTTGA TACAGGAAAC CTGGCCTTTC TGAACAAACC ATTAGAACTC
ACTGAACATG AAAAACAGTG GATTAAGCAG CATCCCGATT TAAAGGTGCT GGAAAATCCT
TACTCGCCAC CCTATTCTAT GACAGATGAA ACTGGCTCAG TTCGGGGCGT GATGGGTGAT
ATTCTTAATA TTATTACCTT GCAAACAGGT TTAAATTTTT CTCCGATCAC CGTTTCACAC
AATATTCATG CGGGGACACA GCTTAACCCC GGTGGATGGG ATATATTGCC CGCCGCTATT
TATAGTGAAG ATCGAGAAAA TAATGTTTCA TTTGCTGAAG CCTTCATAAC AACGCCTTAC
GTTTTTGTCA TGCAAAAAGC GCCTGACAGT GAACAAACAT TAAAAAAAGG AATGAAAGTT
GCCATTCCAT ATTATTATGA GCTTCATTCG CAATTAAAAG AGATGTATCC GGAGGTAGAG
TGGATAAAAG TCGATAACGC CAGCGCTGCA TTTCACAAGG TCAAGGAAGG TGAACTTGAT
GCTCTGGTCG CGACACAGTT AAATTCGCGT TACATGATCG ACCATTACTA TCCTAATGAA
CTTTATCATT TTCTTATTCC CGGCGTTCAG AATGCATCAC TTTCGTTCGC TTTTCCTCGC
GGAGAACCGG AACTTAAGGA TATTATTAAT AAAGCACTGA ATGCAATTCC CCCAAGCGAA
GTTCTGCGCC TGACCGAAAA ATGGATTAAA ATGCCCAATG TGACCATTGA TACATGGGAC
CTCTATAGCG AGCAATTTTA TATTGTTACG ACATTATCCG TTTTATTAGT TGGCAGTAGC
CTTTTATGGG GATTCTACCT GTTACGCTCA GTTCGTCGTC GTAAAGTTAT TCAGGGTGAT
TTAGAAAACC AAATATCATT CCGGAAAGCA CTCTCGGACT CCTTACCGAA TCCAACTTAT
GTTGTAAACT GGCAAGGTAA TGTCATTAGT CACAATAGTG CATTTGAACA TTATTTCACT
GCTGATTACT ATAAAAGTGC AATGTTACCA TTAGAAAACA GTGAATCCCC CTTTAAAGAT
GTTTTTTCTA ATACGCATGA AGTCACAGCA GAAACGAAAG AAAACCGAAC AATATACACA
CAGGTATTTG AAATTGATAA TGGCATCGAG AAAAGATGCA TTAATCACTG GCATACCTTA
TGTAATCTGC CAGCAAGCGA ACATGCTGTT TATATTTGTG GTTGGCAAGA TATTACCGAG
ACGCGTGATT TAATTCATGC ACTCGAAGTA GAGAGAAATA AAGCGATCAA TGCAACTGTC
GCAAAAAGCC AGTTTCTGGC AACAATGAGT CACGAAATAA GAACACCAAT AAGCTCCATT
ATGGGCTTCC TGGAACTACT GTCGGGTTCT GGTCTTAGCA AGGAGCAACG GGTGGAGGCG
ATTTCACTTG CCTACGCCAC CGGACAATCA CTCCTCGGCT TAATTGGTGA AATCCTTGAT
GTCGACAAAA TTGAATCGGG TAACTATCAA CTTCAACCAC AATGGGTCGA TATCCCTACT
TTAGTCCAGA ACACTTGTCA CTCTTTCGGT GCGATTGCTG CAAGCAAATC CATCGCATTA
AGTTGCAGCA GTACGTTTCC TGAACGCTAC CTGGTTAAGA TCGACCCTCA GGCGTTTAAG
CAGGTCTTAT CTAATTTACT GAGTAATGCT CTCAAATTTA CTACCGAGGG AGCAGTAAAA
ATTACGACCT CCCTGGGTCA CATTGATGAC AACCACGCTG TTATCAAAAT GACGATTATG
GATTCTGGAA GTGGATTATC GCAGGAAGAA CAACAACAAC TGTTTAAACG CTATAGCCAA
ACAAGTGCAG GTCGTCAGCA AACAGGTTCT GGTTTAGGCT TAATGATCTG CAAAGAATTA
ATAAAAAACA TGCAGGGCGA TTTGTCATTA GAAAGTCATC CAGGCATAGG AACGATATTT
ACGATCACAA TCCCGGTAGA AATTACCCAA CAAGTGGCGG CTATAGAGAC AAAAGCAGAA
CAACCTATCA CACTACCTGA AAAGTTGAGC ATATTAATCG CGGATGATCA TCCGACCAAC
AGGCTATTAC TCAAACGCCA GCTAAATCTA TTAGGATATG ATGTTGATGA AGCCACTGAT
GGTGTGCAAG CGCTACACAA AATCAGTATG CAACATTATG ATCTGCTTAT TACTGACGTA
AATATGCCGA ATATGGATGG TTTTGAGTTG ACTCGCAAAC TCCGTGAGCA AAATTCTTCC
TTACCCATCT GGGGGCTTAC AGCCAACGCA CAGGCTAACG AACGTGAAAA AGGGTTAAAT
TGCGGCATGA ACTTATGTTT GTTCAAACCG TTGACTCTGG ATGTACTGAA AACACATTTA
AGTCAGTTAC ACCAGGTTGC GCATATTGCA CCTCAGTATC GCCACCTTGA TATCGAGGCC
CTGAAGAATA ATACGGCGAA TGATCTACAA CTGATGCAGG AGATTCTCAT GACTTTCCAG
CATGAAACGC ATAAAGATCT ACCCGCTGCG TTTCATGCAC TAGAAGCTGG CGATAACAGA
ACTTTCCATC AGTGTATTCA TCGCATCCAC GGTGCGGCTA ACATCCTGAA TTTGCAAAAG
TTGATTAATA TTAGCCATCA GTTAGAAATA ACACCTGTTT CAGATGACAG TAAGCCTGAA
ATTCTTCAGT TGCTGAACTC TGTAAAAGAA CACATTGCAG AGCTGGACCA GGAGATTGCT
GTTTTCTGTC AGAAAAATGA CTAA
 
Protein sequence
MHTDSQQRIR GINADYLNLL KRALNIKLTL REYADHQKAM DALEDGEVDI VLSHLVASPP 
LNDDIAATNP LIITFPALVT TLHDSMRPLT SSKPVNIARV ANYPPDEVIH QSFPKATIIS
FTNLYQALAS VSAGQNDYFI GSNIITSSMI SRYFTHSLNV VKYYNSPRQY NFLLTRKDSI
VLNEVLNRFV DALTNEVRYE VSQNWLDTGN LAFLNKPLEL TEHEKQWIKQ HPDLKVLENP
YSPPYSMTDE TGSVRGVMGD ILNIITLQTG LNFSPITVSH NIHAGTQLNP GGWDILPAAI
YSEDRENNVS FAEAFITTPY VFVMQKAPDS EQTLKKGMKV AIPYYYELHS QLKEMYPEVE
WIKVDNASAA FHKVKEGELD ALVATQLNSR YMIDHYYPNE LYHFLIPGVQ NASLSFAFPR
GEPELKDIIN KALNAIPPSE VLRLTEKWIK MPNVTIDTWD LYSEQFYIVT TLSVLLVGSS
LLWGFYLLRS VRRRKVIQGD LENQISFRKA LSDSLPNPTY VVNWQGNVIS HNSAFEHYFT
ADYYKSAMLP LENSESPFKD VFSNTHEVTA ETKENRTIYT QVFEIDNGIE KRCINHWHTL
CNLPASEHAV YICGWQDITE TRDLIHALEV ERNKAINATV AKSQFLATMS HEIRTPISSI
MGFLELLSGS GLSKEQRVEA ISLAYATGQS LLGLIGEILD VDKIESGNYQ LQPQWVDIPT
LVQNTCHSFG AIAASKSIAL SCSSTFPERY LVKIDPQAFK QVLSNLLSNA LKFTTEGAVK
ITTSLGHIDD NHAVIKMTIM DSGSGLSQEE QQQLFKRYSQ TSAGRQQTGS GLGLMICKEL
IKNMQGDLSL ESHPGIGTIF TITIPVEITQ QVAAIETKAE QPITLPEKLS ILIADDHPTN
RLLLKRQLNL LGYDVDEATD GVQALHKISM QHYDLLITDV NMPNMDGFEL TRKLREQNSS
LPIWGLTANA QANEREKGLN CGMNLCLFKP LTLDVLKTHL SQLHQVAHIA PQYRHLDIEA
LKNNTANDLQ LMQEILMTFQ HETHKDLPAA FHALEAGDNR TFHQCIHRIH GAANILNLQK
LINISHQLEI TPVSDDSKPE ILQLLNSVKE HIAELDQEIA VFCQKND