Gene EcHS_A2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2251 
Symbol 
ID5591477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2242627 
End bp2246259 
Gene Length3633 bp 
Protein Length1210 aa 
Translation table11 
GC content51% 
IMG OID640921381 
Productmolybdate metabolism regulator MolR-like protein 
Protein accessionYP_001458917 
Protein GI157161599 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGG AATTACCGTG GCTGGCGGAT AACGCCCAAC TGGAACTGAA ATATAAAAAA 
GGCAAAACGC CGCTCAGTCA TCGTCGCTGG CCGGGCGAAC CAGTGTCCGT TATCACTGGA
AGTCTCATCC AGACATTGGG TGATGAATTG CTACAAAAAG CTGAGAAGAA AAAAAACATT
GTCTGGCGTT ATGAGAATTT TTCACTGGAG TGGCAGTCCG CCATCACGCA GGCCATCAAC
TTGATCGGCG AACACAAACC CTCAATCCCG GCCCGGACAA TGGCGGCGCT AGCCTGTATC
GCGCAAAATG ACAGCCAACA GTTGCTCGAC GAAATCGTCC AACAAGAGGG GCTGGAATAT
GCGACTGAGG TGGTGATTGC ACGCCAGTTT ATTGCGCGGT GTTATGAGAG TGATCCTCTG
GTAGTGACAT TGCAGTATCA GGACGAGGAT TATGGCTATG GTTATCGCTC AGAAACCTAT
AACGAATTCG ATCTCCGACT GCGTAAGCAT CTCTCTCTGG CAGAGGAAAG CTGCTGGCAG
CGTTGCGCCG ACAAACTCAT TGCCGCACTA CCAGGAATAA CCAAAGTTCG CCGCCCTTTT
ATTGCGCTGA TCCTCCCGGA AAAACCAGAA ATAGCCAATG AGTTGGTAGG CCTTGAATGC
CCGCGAACTC ATTTTCATTC TAAGGAGTGG TTAAAAGTTG TTGCTAATGA CCCCACAGCG
GTGAGAAAAC TCGAACACTA CTGGAGCCAG GATATATTTA GCGATCGAGA AGCCAGCTAC
ATGTCGCATG AAAACCACTT CGGCTACGCG GCCTGCGCCG CCCTTTTGCG CGAACAAGGA
CTGGCAGCCA TTCCGCGCCT CGCGATGTAT GCCCATAAAG AAGATTGCGG CAGTCTGCTG
GTACAAATTA ACCATCCGCA AGTCATCCGC ACCTTGCTAC TGGTGGCTGA TAAAAACAAA
CCCAGCCTGC AACGTGTAGC TAAATACCAT AAAAACTTCC CCCATGCGAC GCTCGCCGCA
CTGGCAGAAC TGCTGGCGTT AACAGAACCA CCAGCCCGCC CTGGTTATCC AATCATCGAA
GACAAAAAGC TGCCTGCACA GCAAAAAGCA CGCGATGAAT ACTGGCGTAC GCTGTTACAA
ACGCTGATGG CATCGCAGCC ACAACTGGCA GCAGAAGTGA TGCCGTGGTT AAGTACTCAA
CCCCAGTCAG TGCTGAAGAG TTATTTATCG GCACCGCCCA AACCGGTTAT TGATGGCACC
GATAACAGCA ATCTGCCAGA AATCCTCGTT TCACCACCGT GGCGTAGTAA GAAAAAAATG
ACAGCTCCAC GTCTTGATTT GGCACCGCTC GAATTAACTC CGCAAGTTTA CTGGCAACCA
GGCGAACAAG AGAGGCTTGC CGCCACTGAG CCTGCCCGTT ATTTCAGCAC GGAATCTCTT
GCGCAACGCA TGGAACAAAA AAGTGGACGA GTTGTATTAC AGGAACTGGG TTTTGGGGAT
GATGTATGGC TGTTTCTGAA TTATATACTC CCCGGAAAAC TGGATGCTGC ACGCAATTCA
CTCTTTGTTC AGTGGCATTA CTACCAGGGG CGGGTTGAAG AGATCCTGAA TGGCTGGAAC
TCCCCGGAAG CACAATTAGC AGAACAGGCG CTCCGCAGCG GTCACATAGA AGCGTTAATT
AACATATGGG AAAATGACAA CTACTCACAT TATCGTCCGG AAAAGAGTGT CTGGAACCTG
TATTTATTGG CACAGTTGCC GCGTGAGATG GCTTTGACCT TCTGGCTGCG TGTCAATGAG
AAAAAGCATC TGTTCGCGGG TGAGGACTAT TTTCTCAGTA TCCTCGGATT GGATGCGCTA
CCAGGTCTGC TGTTGGCTTT TTCACATCGT CCAAAAGAAA CATTTCCGTT AATTTTAAAT
TTTGGCGCAA CAGAACTGGC GCTGCCTGTT GCCAACGTCT GGCGACGTTT CGCGGCGCAG
CGTGATCTGG CTCGCCAGTG GATTTTACAA TGGCCGGAAC ATACGGCTAG TGCACTTATC
CCTCTTGTCT TTACCAAACC CAGCGATAAT AGCGAAGCCG CATTACTTGC CCTGCGTTTA
CTGTACGAAC AGGGACATGG CGAATTGCTA CAAACCGTGG CAAACCGCTG GCAGCGTACA
GATGTATGGT CTGCCCTGGA GCAGTTGCTT AAACAGGGTC CAATGGACAT TTACCCGGCA
CGCATTCCAA AAGCCCCTGA TTTCTGGCAT CCGGCAATGT GGTCCAGGCC TCGACTTATC
ACTAATAATC AGCCTGTTAC CGGTGACGCT CTGGAAATTA TCGGCGAAAT GCTGCGCTTT
ACCCAGGGGG GACGTTTTTA TAGCGGGCTG GAACAACTGA AAACGTTCTG CCAGCCACAA
ACGCTGGCAG CTTTTGCCTG GGATCTCTTC ACTGCGTGGC AACAAGCTGG TGCCCCCGCA
AAAGACAACT GGGCATTTCT GGCGTTAAGT CTCTTTGGTG ACGAAAGCAC GGCACGGGAT
CTGACGACAC AGATCCTCGC CTGGCCACAA GAAGGCAAAT CTGCCCGTGC TGTCAGCGGC
CTGAACATCC TTACCCTGAT GAATAATGAT ATGGCGCTGA TACAGCTGCA TCATATATCG
CAACGGGCTA AATCCCGCCC CTTACGTGAT AACGCGGCGG AATTTCTTCA GGTGGTCGCA
GAAAATCGCG GGCTAAGCCA GGAAGAGCTA GCGGACAGAT TAGTCCCAAC CCTGGGCCTT
GATGATCCGC AGGCGTTGAG TTTTGATTTT GGTCCCCGGC AGTTTACCGT TCGCTTCGAT
GAAAACCTCA ACCCGGTTAT CTTTGATCAG CAAAACGTTC GCCAGAAAAG CGTTCCCCGT
TTGCGCGCCG ATGACGATCA ACTGAAAGCG CACGAGGCAC TGGCCCGACT AAAAGGGCTA
AAAAAAGATG CTACTCAGGT GAGCAAAAAC CTGCTCCCGC GTCTTGAAGC TGCCCTACGT
ACCACCCGAC GCTGGTCGCT GGCAGATTTT CATTCTCTGT TTGTTAATCA TCCCTTTACC
CGTCTGGTTA CCCAGCGATT AATATGGGGG GGTTATCCGG CAAATGAACC GCGTCGTTTA
CTCAACGCCT TTCGTGTGGC CGCAGAGGGG GAGTTCTGCA ATGCGCAAGA TGAGCCAATT
GACCTGCCTG CGGACGCTCT GATTGGCATT GCCCACCCGT TAGAAATGGC AGTAGAAATG
CGCAGTGAAT TTGCACAGCT TTTTGCCGAT TACGAAATTA TGCCGCCTTT TCGCCAGTTG
TCGCGCCGCA CGGTGCTGCT CACACCTGAC GAGTCAACCA GTAACAGCCT GACTCGCTGG
GAAGGTAAAT CCGCTACCGT TGGGCAACTT ATGGGAATGC GATACAAAGG CTGGGAGTCA
GGCTATGAGG ACACATTTGT CTATGACCTG GGCGAGTACC GGCTGGTCCT TAAGTTTTCA
CCCGGTTTTA ACCACTACAA TGTTGATAGC AAAGCGTTAA TGAGCTGCCG TTCTCTTCGA
GTGTACCGTG ACAATAAATC CGTCACTTTT GCCGAACTTG ATGTGTTTGA TTTGAGTGAG
GCGTTAAGCG CACCTGACGT CATTTTCCAT TAA
 
Protein sequence
MDKELPWLAD NAQLELKYKK GKTPLSHRRW PGEPVSVITG SLIQTLGDEL LQKAEKKKNI 
VWRYENFSLE WQSAITQAIN LIGEHKPSIP ARTMAALACI AQNDSQQLLD EIVQQEGLEY
ATEVVIARQF IARCYESDPL VVTLQYQDED YGYGYRSETY NEFDLRLRKH LSLAEESCWQ
RCADKLIAAL PGITKVRRPF IALILPEKPE IANELVGLEC PRTHFHSKEW LKVVANDPTA
VRKLEHYWSQ DIFSDREASY MSHENHFGYA ACAALLREQG LAAIPRLAMY AHKEDCGSLL
VQINHPQVIR TLLLVADKNK PSLQRVAKYH KNFPHATLAA LAELLALTEP PARPGYPIIE
DKKLPAQQKA RDEYWRTLLQ TLMASQPQLA AEVMPWLSTQ PQSVLKSYLS APPKPVIDGT
DNSNLPEILV SPPWRSKKKM TAPRLDLAPL ELTPQVYWQP GEQERLAATE PARYFSTESL
AQRMEQKSGR VVLQELGFGD DVWLFLNYIL PGKLDAARNS LFVQWHYYQG RVEEILNGWN
SPEAQLAEQA LRSGHIEALI NIWENDNYSH YRPEKSVWNL YLLAQLPREM ALTFWLRVNE
KKHLFAGEDY FLSILGLDAL PGLLLAFSHR PKETFPLILN FGATELALPV ANVWRRFAAQ
RDLARQWILQ WPEHTASALI PLVFTKPSDN SEAALLALRL LYEQGHGELL QTVANRWQRT
DVWSALEQLL KQGPMDIYPA RIPKAPDFWH PAMWSRPRLI TNNQPVTGDA LEIIGEMLRF
TQGGRFYSGL EQLKTFCQPQ TLAAFAWDLF TAWQQAGAPA KDNWAFLALS LFGDESTARD
LTTQILAWPQ EGKSARAVSG LNILTLMNND MALIQLHHIS QRAKSRPLRD NAAEFLQVVA
ENRGLSQEEL ADRLVPTLGL DDPQALSFDF GPRQFTVRFD ENLNPVIFDQ QNVRQKSVPR
LRADDDQLKA HEALARLKGL KKDATQVSKN LLPRLEAALR TTRRWSLADF HSLFVNHPFT
RLVTQRLIWG GYPANEPRRL LNAFRVAAEG EFCNAQDEPI DLPADALIGI AHPLEMAVEM
RSEFAQLFAD YEIMPPFRQL SRRTVLLTPD ESTSNSLTRW EGKSATVGQL MGMRYKGWES
GYEDTFVYDL GEYRLVLKFS PGFNHYNVDS KALMSCRSLR VYRDNKSVTF AELDVFDLSE
ALSAPDVIFH