Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2405 |
Symbol | molR |
ID | 5590235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2377975 |
End bp | 2381769 |
Gene Length | 3795 bp |
Protein Length | 1264 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640926068 |
Product | molybdate metabolism regulator |
Protein accession | YP_001463463 |
Protein GI | 157159379 |
COG category | [S] Function unknown |
COG ID | [COG3831] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACACT TTATCTATCA GGGCGAAAAA TCACATAAAT TCTGGGCAGT TGAGCAACAG GGAAACGAGT TGCATATCAG TTGGGGAAAA GTTGGCACCA AAGGGCAAAG CCAGATAAAA AGTTTTTCAG ATGCTGCGGC AGTGGCAAAA GCGGAGCTTA AGCTGATTGC GGAGAAGGTG AAGAAGGGGT ATGTGGAGCA AGCGAAGGAT AATTCTTTGC AACCTTCCCA AACGGTAACG GGCTCTCTCA AGGTAGCGGA CTTATTCACC ATTATTCAGG AACAACCCTC TTTCGTAGCA GAAACCCGTG CGCCTGACAA AAATACAGAT GCTGTTTTAC CGTGGCTGGC AAAAGATATT GCTGTCGTTT TTCCGCCCGA AGTTGTACAC ACCACGTTAA GTCATCGCCG CTTTCCCGGA GTTCCTGTTC AGCAAGCAGA CAAATTGACC CAATTACGTC GCTTAACCTG TAGTGTGTCG CAACGGGATA ATAAAACAGC CACATTTGAC TTCAGCGCCT GTTCTTTAGA ATGGCAAAAC ACCGTCGCCC AGGCGATCAG TCAGATCGAC GGCCTGAAAA CAACACAGCT ACCATCACCA GTAATGGCTG TACTCACGGC ACTTGAAATG AAATGCACAA GATATAAAGT GCGTGAGGAT GTTATGGATC AGATCGTCCA GGAAGGCGGT CTGGAATATG CTACTGATGT AATAATACAC CTTCAACAGA TTGATATTAA ATGGGATTAT GCGAATAATG TCATTATTAT TCTGCCGTCT GGCATTGCAC CTGACTACTT GGAGCAATAT TCCAGATTTG AATTACGCCT ACGTAAACAT TTATCACTGG CGGAAGAGTC TCTCTGGCAA AAATGTACAC AAAAACTTAT TGCCGCAATT CCACATATTC CAGAATGGCG GCAACCATTA ATTGCTTTGT TATTACCCGA AAAACCAGAA ATTGCACATG AAATTGCCCA GCGTCTACTG GGGCAAAAAA AATTACCCTC GCTTGAGTGG TTAAAAATAG TGGCGACTGA TGAGCACATT CTTGCCTCAT TAGAAAAATA TCACGAGCCA TATGCCATTT TTGATGATTA CTATTGTGGT GCGATATGGT CAGCCACCGT ATTACAGGAG CAAGGTGTTG CAGCCCTGCC CCGATTTGCT CCCTATGCCG CAAGTGACTA CTGCGCCGAT GTGTTGCGTC ATATCAATCA TCCGTTCGCA TTGACACTGC TTATACGTGT AGCCGGGCAA ACTAAACGCT GTCACGATCG GATGACGAAA GCCATTGCTG CGTTCCCACA TGCAGCAATG GCGGCACTGA CGGAACTTCT TGGGCAAAAA GAAGAGAACA GTTGGCGCAT TATGCTAATG ACAATGCTTA TCTCACAACC AGCACTGGCA GAACAGGTCA TTCCCTGGCT CTCGACACCC GCAGTTGCCG TACTGAAATC ATGCCAGCAA CAACTGACAC AGCCCTCAAA CCATGCCTGC GCCGATCTAC TGCCTGCCAT AGTAGTCTCC CCACCATGGC TTTCGAAAAA GAAAAAAGCG ACGATTCCGG TGCTGGAGTT AGCGCCATTA GGCATTGAGC CAATCTGTTA TCTGACAGAA GAAATCAGTA ATCAACTTTT GGCGAAATAT ATCTGGTATT CAAAACACAT CACGGTTAGC CATGAAGAAA GTACTGCCAA CCTGTTGGCA AGAATGGGTT TTCAACGACG GATCGCTGGT ACATATATTA AAGCTCCCGA AGCGGTAGTT GAGGCATGGC TAAATGAAGA TTATTCAACC TTACTAAGTG AATTTAAGGT GTTTCATTCA CCTACCGGGC ATTATTGGCA GTTGGGGATT TTGACAACAT TGCCGCTGGA GAAAGCAGTA AAAGCATGGA ATGCCCTTAC CCTATCTCCA CATACCGATA CCGAATACGC CATGTTACAT TTTGGACTCA AAGGGTTACC TGGGTTAGTA AACTCACTTG CACGCTATCC ACAGGAAGCC TTGCCCATCA CGAATTACTT CGCAGCGAGT GAGCTGGCAC CTGCCGTCGC CCGCGCCTTC AACAAACTCA AAACTCTACG ACAAGACGCC CGTAGCTGGC TGCTGAAATA CCCGGAACAC GCCATTACCG GCCTGCTGCC TGCTGCGCTC GGCAAAGCCG ATGAAGCTCA GGATAACGCC CGCGCTGCCT TGCGTATGCT TAGCGAAAAC GGTCATCAGC CATTACTGCA AGAAATCGCC CGGCGTTATA ACCAGCCGGA AGTAACCGAC GCGGTGAACG CTCTGCTTGC GCTCGATCCC TTAGATAATC ACCCGACAAA AATCCCCACT CTTCCGGCCT TTTATCAGCC ATCGCTCTGG ACGCGCCCGG TATTAAAAGC AAACGCCCAA TCACTGCCAG ATAGCGCCCT CCTCCACCTC GGCGAAATGC TCCGCTTCCC TCAGGAAGAG GCTCTGTATC CGGGATTATT GCAGGTGAAA GATGCCTGTA CCGCCGACTC ACTGGCTGAA TTTGCCTGGG ATCTGTTTAC CGCCTGGCAG ACCGCTGGCG CGCCGTCGAG AGAGAGTTGG GCGTTCACTG CGTTAGGCGT TCTCGGTAAC GATGACACCG CCCGCAAACT GACGCCATTA ATCCGCGCCT GGCCTGGCGA ATCTCAGCAT AAACGCGCCA CCGTTGGGTT GGATATTCTC GCCGCTATCG GTAGTGATAT AGCCCTGATG CAGCTTAACG GCATTGCCCA GAAACTGAAA TTCAAAGCAT TACAGGAGCG GGCGAAAGAG AAAATTGCCG ACATTGCCGA AAGTCGCGAA CTGACAGTGG CGGAGCTTGA AGATCGGTTA GCACCAGATC TCGGTCTGGA TGATAACGGT TCGTTGCTGC TGGATTTCGG CCCACGCCAG TTCACCGTCA GCTTTGATGA AACCTTAAAA CCGTTTGTGC GTGACGCTTC CGGCAGCCGT CTGAAAGACC TGCCCAAACC AAACAAAAGC GATGATGAAT CACAGGCGAA CGATGCTGTT AACCGCTACA AACTGCTGAA AAAAGATGCG CGTACCGTCG CCGCCCAGCA GGTAGCAAGG CTGGAATCCG CCATGTGCCT GCGTCGCCGC TGGTCGCCAG AAAACTTTCA GCTCTTCCTG GTTGAACATC CACTGGTACG CCACTTAACC CGCCGTCTGA TTTGGGGCGT TTATAGCACC GAAAACCAGC TACTGACGTG CTTCCGCGTG GCGGAAGATA ACAGCTACAG CACCGCTGAC GATGATCTTT TCACCCTGCC GGAAGGCGAT ATCTCTGTTG GCATTCCTCA CGTTCTGGAA ATATCACCGA CGGATGCTGC CGCCTTTGGT CAGCTTTTTG CCGACTACGA ACTGCTACCA CCGTTCCGTC AGCTTGACCG TAACAGTTAC GCCCTGACAG AAGCCGAGCG CAACGCCAGC GAACTGACCC GCTGGGCAGG CAGAAAATGC CCGAGCGGTC GGGTCATGGG GCTGGCGAAT AAAGGCTGGA TAAAGGGCAC CCCGCAGGAT GGAGGCTGGA TCGGCTGGAT GATCAAACCT CTCGGTCGCT GGTCGTTAAT CATGGAAATC GACGAAGGTT TTGCGGTAGG CATGTCGCCA GCCGAACTGA GTGCCGAGCA GCTCTTAAGC AAGCTGTGGC TATGGGAAGG CAAAGCAGAA AGCTATGGCT GGGGGAGTAA TTCAACACAG GAAGCGCAGT TCTCCGTACT CGATGCCATC ACCGCCAGCG AGCTAATTAA CGATATTGAA GCCCTGTTTG AATAA
|
Protein sequence | MRHFIYQGEK SHKFWAVEQQ GNELHISWGK VGTKGQSQIK SFSDAAAVAK AELKLIAEKV KKGYVEQAKD NSLQPSQTVT GSLKVADLFT IIQEQPSFVA ETRAPDKNTD AVLPWLAKDI AVVFPPEVVH TTLSHRRFPG VPVQQADKLT QLRRLTCSVS QRDNKTATFD FSACSLEWQN TVAQAISQID GLKTTQLPSP VMAVLTALEM KCTRYKVRED VMDQIVQEGG LEYATDVIIH LQQIDIKWDY ANNVIIILPS GIAPDYLEQY SRFELRLRKH LSLAEESLWQ KCTQKLIAAI PHIPEWRQPL IALLLPEKPE IAHEIAQRLL GQKKLPSLEW LKIVATDEHI LASLEKYHEP YAIFDDYYCG AIWSATVLQE QGVAALPRFA PYAASDYCAD VLRHINHPFA LTLLIRVAGQ TKRCHDRMTK AIAAFPHAAM AALTELLGQK EENSWRIMLM TMLISQPALA EQVIPWLSTP AVAVLKSCQQ QLTQPSNHAC ADLLPAIVVS PPWLSKKKKA TIPVLELAPL GIEPICYLTE EISNQLLAKY IWYSKHITVS HEESTANLLA RMGFQRRIAG TYIKAPEAVV EAWLNEDYST LLSEFKVFHS PTGHYWQLGI LTTLPLEKAV KAWNALTLSP HTDTEYAMLH FGLKGLPGLV NSLARYPQEA LPITNYFAAS ELAPAVARAF NKLKTLRQDA RSWLLKYPEH AITGLLPAAL GKADEAQDNA RAALRMLSEN GHQPLLQEIA RRYNQPEVTD AVNALLALDP LDNHPTKIPT LPAFYQPSLW TRPVLKANAQ SLPDSALLHL GEMLRFPQEE ALYPGLLQVK DACTADSLAE FAWDLFTAWQ TAGAPSRESW AFTALGVLGN DDTARKLTPL IRAWPGESQH KRATVGLDIL AAIGSDIALM QLNGIAQKLK FKALQERAKE KIADIAESRE LTVAELEDRL APDLGLDDNG SLLLDFGPRQ FTVSFDETLK PFVRDASGSR LKDLPKPNKS DDESQANDAV NRYKLLKKDA RTVAAQQVAR LESAMCLRRR WSPENFQLFL VEHPLVRHLT RRLIWGVYST ENQLLTCFRV AEDNSYSTAD DDLFTLPEGD ISVGIPHVLE ISPTDAAAFG QLFADYELLP PFRQLDRNSY ALTEAERNAS ELTRWAGRKC PSGRVMGLAN KGWIKGTPQD GGWIGWMIKP LGRWSLIMEI DEGFAVGMSP AELSAEQLLS KLWLWEGKAE SYGWGSNSTQ EAQFSVLDAI TASELINDIE ALFE
|
| |