Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2251 |
Symbol | |
ID | 5591477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2242627 |
End bp | 2246259 |
Gene Length | 3633 bp |
Protein Length | 1210 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640921381 |
Product | molybdate metabolism regulator MolR-like protein |
Protein accession | YP_001458917 |
Protein GI | 157161599 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAGG AATTACCGTG GCTGGCGGAT AACGCCCAAC TGGAACTGAA ATATAAAAAA GGCAAAACGC CGCTCAGTCA TCGTCGCTGG CCGGGCGAAC CAGTGTCCGT TATCACTGGA AGTCTCATCC AGACATTGGG TGATGAATTG CTACAAAAAG CTGAGAAGAA AAAAAACATT GTCTGGCGTT ATGAGAATTT TTCACTGGAG TGGCAGTCCG CCATCACGCA GGCCATCAAC TTGATCGGCG AACACAAACC CTCAATCCCG GCCCGGACAA TGGCGGCGCT AGCCTGTATC GCGCAAAATG ACAGCCAACA GTTGCTCGAC GAAATCGTCC AACAAGAGGG GCTGGAATAT GCGACTGAGG TGGTGATTGC ACGCCAGTTT ATTGCGCGGT GTTATGAGAG TGATCCTCTG GTAGTGACAT TGCAGTATCA GGACGAGGAT TATGGCTATG GTTATCGCTC AGAAACCTAT AACGAATTCG ATCTCCGACT GCGTAAGCAT CTCTCTCTGG CAGAGGAAAG CTGCTGGCAG CGTTGCGCCG ACAAACTCAT TGCCGCACTA CCAGGAATAA CCAAAGTTCG CCGCCCTTTT ATTGCGCTGA TCCTCCCGGA AAAACCAGAA ATAGCCAATG AGTTGGTAGG CCTTGAATGC CCGCGAACTC ATTTTCATTC TAAGGAGTGG TTAAAAGTTG TTGCTAATGA CCCCACAGCG GTGAGAAAAC TCGAACACTA CTGGAGCCAG GATATATTTA GCGATCGAGA AGCCAGCTAC ATGTCGCATG AAAACCACTT CGGCTACGCG GCCTGCGCCG CCCTTTTGCG CGAACAAGGA CTGGCAGCCA TTCCGCGCCT CGCGATGTAT GCCCATAAAG AAGATTGCGG CAGTCTGCTG GTACAAATTA ACCATCCGCA AGTCATCCGC ACCTTGCTAC TGGTGGCTGA TAAAAACAAA CCCAGCCTGC AACGTGTAGC TAAATACCAT AAAAACTTCC CCCATGCGAC GCTCGCCGCA CTGGCAGAAC TGCTGGCGTT AACAGAACCA CCAGCCCGCC CTGGTTATCC AATCATCGAA GACAAAAAGC TGCCTGCACA GCAAAAAGCA CGCGATGAAT ACTGGCGTAC GCTGTTACAA ACGCTGATGG CATCGCAGCC ACAACTGGCA GCAGAAGTGA TGCCGTGGTT AAGTACTCAA CCCCAGTCAG TGCTGAAGAG TTATTTATCG GCACCGCCCA AACCGGTTAT TGATGGCACC GATAACAGCA ATCTGCCAGA AATCCTCGTT TCACCACCGT GGCGTAGTAA GAAAAAAATG ACAGCTCCAC GTCTTGATTT GGCACCGCTC GAATTAACTC CGCAAGTTTA CTGGCAACCA GGCGAACAAG AGAGGCTTGC CGCCACTGAG CCTGCCCGTT ATTTCAGCAC GGAATCTCTT GCGCAACGCA TGGAACAAAA AAGTGGACGA GTTGTATTAC AGGAACTGGG TTTTGGGGAT GATGTATGGC TGTTTCTGAA TTATATACTC CCCGGAAAAC TGGATGCTGC ACGCAATTCA CTCTTTGTTC AGTGGCATTA CTACCAGGGG CGGGTTGAAG AGATCCTGAA TGGCTGGAAC TCCCCGGAAG CACAATTAGC AGAACAGGCG CTCCGCAGCG GTCACATAGA AGCGTTAATT AACATATGGG AAAATGACAA CTACTCACAT TATCGTCCGG AAAAGAGTGT CTGGAACCTG TATTTATTGG CACAGTTGCC GCGTGAGATG GCTTTGACCT TCTGGCTGCG TGTCAATGAG AAAAAGCATC TGTTCGCGGG TGAGGACTAT TTTCTCAGTA TCCTCGGATT GGATGCGCTA CCAGGTCTGC TGTTGGCTTT TTCACATCGT CCAAAAGAAA CATTTCCGTT AATTTTAAAT TTTGGCGCAA CAGAACTGGC GCTGCCTGTT GCCAACGTCT GGCGACGTTT CGCGGCGCAG CGTGATCTGG CTCGCCAGTG GATTTTACAA TGGCCGGAAC ATACGGCTAG TGCACTTATC CCTCTTGTCT TTACCAAACC CAGCGATAAT AGCGAAGCCG CATTACTTGC CCTGCGTTTA CTGTACGAAC AGGGACATGG CGAATTGCTA CAAACCGTGG CAAACCGCTG GCAGCGTACA GATGTATGGT CTGCCCTGGA GCAGTTGCTT AAACAGGGTC CAATGGACAT TTACCCGGCA CGCATTCCAA AAGCCCCTGA TTTCTGGCAT CCGGCAATGT GGTCCAGGCC TCGACTTATC ACTAATAATC AGCCTGTTAC CGGTGACGCT CTGGAAATTA TCGGCGAAAT GCTGCGCTTT ACCCAGGGGG GACGTTTTTA TAGCGGGCTG GAACAACTGA AAACGTTCTG CCAGCCACAA ACGCTGGCAG CTTTTGCCTG GGATCTCTTC ACTGCGTGGC AACAAGCTGG TGCCCCCGCA AAAGACAACT GGGCATTTCT GGCGTTAAGT CTCTTTGGTG ACGAAAGCAC GGCACGGGAT CTGACGACAC AGATCCTCGC CTGGCCACAA GAAGGCAAAT CTGCCCGTGC TGTCAGCGGC CTGAACATCC TTACCCTGAT GAATAATGAT ATGGCGCTGA TACAGCTGCA TCATATATCG CAACGGGCTA AATCCCGCCC CTTACGTGAT AACGCGGCGG AATTTCTTCA GGTGGTCGCA GAAAATCGCG GGCTAAGCCA GGAAGAGCTA GCGGACAGAT TAGTCCCAAC CCTGGGCCTT GATGATCCGC AGGCGTTGAG TTTTGATTTT GGTCCCCGGC AGTTTACCGT TCGCTTCGAT GAAAACCTCA ACCCGGTTAT CTTTGATCAG CAAAACGTTC GCCAGAAAAG CGTTCCCCGT TTGCGCGCCG ATGACGATCA ACTGAAAGCG CACGAGGCAC TGGCCCGACT AAAAGGGCTA AAAAAAGATG CTACTCAGGT GAGCAAAAAC CTGCTCCCGC GTCTTGAAGC TGCCCTACGT ACCACCCGAC GCTGGTCGCT GGCAGATTTT CATTCTCTGT TTGTTAATCA TCCCTTTACC CGTCTGGTTA CCCAGCGATT AATATGGGGG GGTTATCCGG CAAATGAACC GCGTCGTTTA CTCAACGCCT TTCGTGTGGC CGCAGAGGGG GAGTTCTGCA ATGCGCAAGA TGAGCCAATT GACCTGCCTG CGGACGCTCT GATTGGCATT GCCCACCCGT TAGAAATGGC AGTAGAAATG CGCAGTGAAT TTGCACAGCT TTTTGCCGAT TACGAAATTA TGCCGCCTTT TCGCCAGTTG TCGCGCCGCA CGGTGCTGCT CACACCTGAC GAGTCAACCA GTAACAGCCT GACTCGCTGG GAAGGTAAAT CCGCTACCGT TGGGCAACTT ATGGGAATGC GATACAAAGG CTGGGAGTCA GGCTATGAGG ACACATTTGT CTATGACCTG GGCGAGTACC GGCTGGTCCT TAAGTTTTCA CCCGGTTTTA ACCACTACAA TGTTGATAGC AAAGCGTTAA TGAGCTGCCG TTCTCTTCGA GTGTACCGTG ACAATAAATC CGTCACTTTT GCCGAACTTG ATGTGTTTGA TTTGAGTGAG GCGTTAAGCG CACCTGACGT CATTTTCCAT TAA
|
Protein sequence | MDKELPWLAD NAQLELKYKK GKTPLSHRRW PGEPVSVITG SLIQTLGDEL LQKAEKKKNI VWRYENFSLE WQSAITQAIN LIGEHKPSIP ARTMAALACI AQNDSQQLLD EIVQQEGLEY ATEVVIARQF IARCYESDPL VVTLQYQDED YGYGYRSETY NEFDLRLRKH LSLAEESCWQ RCADKLIAAL PGITKVRRPF IALILPEKPE IANELVGLEC PRTHFHSKEW LKVVANDPTA VRKLEHYWSQ DIFSDREASY MSHENHFGYA ACAALLREQG LAAIPRLAMY AHKEDCGSLL VQINHPQVIR TLLLVADKNK PSLQRVAKYH KNFPHATLAA LAELLALTEP PARPGYPIIE DKKLPAQQKA RDEYWRTLLQ TLMASQPQLA AEVMPWLSTQ PQSVLKSYLS APPKPVIDGT DNSNLPEILV SPPWRSKKKM TAPRLDLAPL ELTPQVYWQP GEQERLAATE PARYFSTESL AQRMEQKSGR VVLQELGFGD DVWLFLNYIL PGKLDAARNS LFVQWHYYQG RVEEILNGWN SPEAQLAEQA LRSGHIEALI NIWENDNYSH YRPEKSVWNL YLLAQLPREM ALTFWLRVNE KKHLFAGEDY FLSILGLDAL PGLLLAFSHR PKETFPLILN FGATELALPV ANVWRRFAAQ RDLARQWILQ WPEHTASALI PLVFTKPSDN SEAALLALRL LYEQGHGELL QTVANRWQRT DVWSALEQLL KQGPMDIYPA RIPKAPDFWH PAMWSRPRLI TNNQPVTGDA LEIIGEMLRF TQGGRFYSGL EQLKTFCQPQ TLAAFAWDLF TAWQQAGAPA KDNWAFLALS LFGDESTARD LTTQILAWPQ EGKSARAVSG LNILTLMNND MALIQLHHIS QRAKSRPLRD NAAEFLQVVA ENRGLSQEEL ADRLVPTLGL DDPQALSFDF GPRQFTVRFD ENLNPVIFDQ QNVRQKSVPR LRADDDQLKA HEALARLKGL KKDATQVSKN LLPRLEAALR TTRRWSLADF HSLFVNHPFT RLVTQRLIWG GYPANEPRRL LNAFRVAAEG EFCNAQDEPI DLPADALIGI AHPLEMAVEM RSEFAQLFAD YEIMPPFRQL SRRTVLLTPD ESTSNSLTRW EGKSATVGQL MGMRYKGWES GYEDTFVYDL GEYRLVLKFS PGFNHYNVDS KALMSCRSLR VYRDNKSVTF AELDVFDLSE ALSAPDVIFH
|
| |