Gene EcSMS35_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0638 
SymboldpiB 
ID6145500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp651634 
End bp653292 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content50% 
IMG OID641615530 
Productsensor histidine kinase DpiB 
Protein accessionYP_001742736 
Protein GI170681994 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.736103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCAGC TTAACGAGAA TAAACAGTTT GCATTTTTCC AAAGACTGGC ATTTCCGCTG 
CGTATCTTTT TGCTGATTCT GGTGTTCTCA ATATTTGTCA TTGCAGCCCT GGCGCAATAT
TTTACAGCCA GTTTTGAGGA CTATTTAACG CTTCATGTGC GCGACATGGC AATGAATCAG
GCAAAAATTA TTGCTTCTAA TGACAGTATC ATCAGCGCAG TGAAAACGCG TGACTACAAG
AGGCTGGCGA CCATCGCTGA CAAATTACAA AGAGATACTG ATTTCGATTA TGTGGTGATT
GGCGATCGGC ACTCGATTCG TCTTTACCAT CCTAATCCGG AGAAAATTGG TTATCCTATG
CAGTTCACCA AACCGGGCGC GCTGGAGAAA GGGGAGAGCT ACTTCATCAC CGGGAAAGGG
TCAATTGGCA TGGCGATGCG TGCCAAAACG CCAATCTTTG ATGACGATGG AAAAGTCATC
GGCGTGGTGT CGATTGGCTA CCTGGTGAGT AAAATCGATA GCTGGCGGGC TGAGTTTTTA
TTACCGATGG CTGGCGTGTT TGTCGTGCTG TTAGGGATTC TGATGTTGCT ATCGTGGTTC
CTGGCCGCGC ATATCCGTCG GCAGATGATG GGCATGGAGC CAAAGCAAAT CGCACGCGTG
GTCCGTCAGC AAGAGGCGCT GTTTAGTTCG GTTTATGAAG GGCTCATTGC GGTGGATCCG
CATGGTTACA TTACCGCCAT CAATCGTAAC GCAAGAAAGA TGCTGGGTCT GAGTTCCCCC
GGACGGCAAT GGTTGGGTAA ACCCATTGCT GAAGTGGTCA GGCCCGCCGA TTTCTTTACC
GAACAGATTG ATGAAAAACG TCAGGATGTG GTGGCGAACT TTAACGGTCT GAGCGTTATT
GCCAACCGGG AAGCTATTCG TTCTGGTGAT GATTTGCTGG GGGCCATTAT CAGCTTTCGT
AGTAAAGACG AAATATCCAC CCTCAATGCG CAACTGACGC AAATTAAACA ATACGTCGAG
AGCCTGCGCA CATTGCGACA CGAGCATCTC AATTGGATGT CGACGCTCAA TGGTCTGTTG
CAGATGAAAG AGTATGATCG CGTGCTGGCG ATGGTGCAGG GGGAGTCTCA GGCCCAGCAA
CAGCTTATTG ATAGCCTGCG CGAGGCGTTT GCCGATCGCC AGGTGGCGGG GCTGCTTTTT
GGTAAAGTGC AGCGCGCCCG CGAACTGGGG CTAAAAATGG TCATCGTCCC CGGAAGTCAG
CTTTCGCAAC TGCCGCCAGG ACTGGACAGC ACCGAGTTTG CAGCCATTGT TGGCAATTTA
CTTGATAACG CCTTCGAAGC CAGCCTACGT AGCGATGAAG GAAACAAGAT CGTTGAATTA
TTCCTCAGCG ATGAAGGCGA TGATGTGGTG ATTGAAGTCG CCGATCAGGG CTGCGGCGTT
CCAGAGTCTC TACGAGACAA AATATTTGAG CAGGGTGTCA GTACGCGTGC TGACGAGCCC
GGCGAACATG GCATTGGGTT GTACTTGATT GCCAGCTACG TAACGCGCTG CGGTGGTGTT
ATCACTCTCG AAGATAATGA TCCCTGCGGT ACCTTATTTT CAATCTATAT TCCGAAAGTG
AAACCTAATG ACAGCTCCAT TAACCCTATT GATCGTTGA
 
Protein sequence
MLQLNENKQF AFFQRLAFPL RIFLLILVFS IFVIAALAQY FTASFEDYLT LHVRDMAMNQ 
AKIIASNDSI ISAVKTRDYK RLATIADKLQ RDTDFDYVVI GDRHSIRLYH PNPEKIGYPM
QFTKPGALEK GESYFITGKG SIGMAMRAKT PIFDDDGKVI GVVSIGYLVS KIDSWRAEFL
LPMAGVFVVL LGILMLLSWF LAAHIRRQMM GMEPKQIARV VRQQEALFSS VYEGLIAVDP
HGYITAINRN ARKMLGLSSP GRQWLGKPIA EVVRPADFFT EQIDEKRQDV VANFNGLSVI
ANREAIRSGD DLLGAIISFR SKDEISTLNA QLTQIKQYVE SLRTLRHEHL NWMSTLNGLL
QMKEYDRVLA MVQGESQAQQ QLIDSLREAF ADRQVAGLLF GKVQRARELG LKMVIVPGSQ
LSQLPPGLDS TEFAAIVGNL LDNAFEASLR SDEGNKIVEL FLSDEGDDVV IEVADQGCGV
PESLRDKIFE QGVSTRADEP GEHGIGLYLI ASYVTRCGGV ITLEDNDPCG TLFSIYIPKV
KPNDSSINPI DR