Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0928 |
Symbol | molR |
ID | 6144730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 940755 |
End bp | 944543 |
Gene Length | 3789 bp |
Protein Length | 1262 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641615816 |
Product | molybdate metabolism regulator MolR |
Protein accession | YP_001743008 |
Protein GI | 170680946 |
COG category | [S] Function unknown |
COG ID | [COG3831] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACACT TTATCTTTCA GGACGAAAAA TCACATAAAT TCTGGGCGGT GGAGCAGCAG GGAAACGAAC TGCACATCAA CTGGGGTAAA GTCGGCACCA ACGGACAAAG CCAGGTAAAA AGTTTTGCGG ATGCAGCAGC GGCGGAAAAA GCGGGACTGA AGCTGATTGC GGAGAAGGTG AAGAAGGGAT ATGTGGAGCA AGCGAAGGAT AATTCTTTGC AACCTTCCCA AACGGTAACG GACTCTCTCA AGGTAGCGAA CTTATCCACC ATTATTCAGG AACAACCCTC TTTCGTAGCA GAAACCCGTG CGGCTGACAA AAATACAGAT GCTGTTTTAC CGTGGCTGGC GAAAGATATT GCTGTCGTTT TTCCGCCCGA AGTTGTACAC ACCACGTTAA GTCATCGCCG CTTTCCCGGA GTTCCTGTTC AGCAAGCAGA CAAATTGACC CAATTACGTC GCTTAGCCTG TGGCGTGTCG CAACGGGATA ACAAAACAGC TACATTTGAC TTCAGCGCCT GTACTTTAGA GTGGCAAAAC TTCATCGCTC AGGCGATCGA TCTGATTAAC AGCCCGAAAA CGACAAAATT ACCATCACCG GTAATGGCTG TACTCGCGGC ACTTGAACTG CACTGCACGA GATATAAAGA GCGTGAGGAT GTTATGGATC AGATTATTCA GGAAGGCGGT CTGGAATATG CTACTGATGT AATAATACAT CTTCAACAAA TATCTATTGA ATGGGATTAT GTCAACAATA ATATTGTTTT CCTGTCGTCT GGCATTTCAC CTGATTATTT GCAGCAATAT TCCAGCTTTG AATTACGCCT ACGTAAACAT TTATCACTGG CGGAAGAGTC TCTCTGGCAA AAATGTGCAC AAAAACTTAT TGCCGCAGTT CCACATATTC CAGAATGGCG GCAACCATTA ATTGCTTTGT TATTGCCCGA AAAACCAGAA ATTGCACATG AAATTGCCCA GCGTCTACTG GGGCAAAAGA AATTACCCTC GCTGGAGTGG CTAAAAATAG TGGCGACTGA TGAGCACATT CTTGCCTCAT TAGAAAAATA TCACGAGCCA TATGCCATTT TTGATGATTA CTATTGTGGT GCGATATGGT CAGCCACCGT ATTACAGGAG CAAGGTGTTG CGGCCCTGCC CCGATTTGCT CCCTATGTCG CAAGTGACTA CTGCGCCGAT GTGTTGCGTC ATATCAATCA TCCGTTCGCA TTAACACTGC TTATACGTGT AGCCGGGCAT ACTAAACGCT GTCACGATCG AATGACGAAA GCCTGCGCTG CGTTCCCACA CGCAGCACTG GCAGCACTGG CGGAACTTCT AGTGCAAAAA GAAGAGAATA GTTGGCGCAT TATGCTAATG ACGATGCTTA TCTCGCAACC TACACTAGCA GAACAGGTCA TTCCCTGGCT TTCGACACCC GCAGTTGCAG TGCTGAAATC ATGCCAGCAA CAACTGAAAC AGCCCTCAAA CCATGCCAGT GCCGATCTAC TGCCGGCCAT AGTAGTCTCC CCACCATGGC TTTCGAAAAA GAAAAAATCG CCGATTCCGG TGCTTGATTT AGCACCGCTC AACCTCGAAT CCATATGCAC AATAACCGAC ACAGAAGCTA AAGAGTTTCA AACTCATTGG GACTGGGAAC CGCATAAGCC AGGTGAAGGT GCTAAAGATT TCCTTTACTC GCTGGGGTAT CGCCGCTGGG ATTTTGACAC ATATAAATAT ATCGGCGCTT CTGACTCAGC AATTGACGCA TGGGAACGTG AGGACTTCGC TACACTCATC CAAATGTTTA AAGCCCACCA TGCACCGTAT CAGGGGGAGT GGCATCTCAA CTCACTTCCC TTTTTACCTA TGCAAAAAGC GATTAAGCTA TGGGAATTTC TCAGCAAAGA GCCGCATACC GCGATAAAAC CAGTCATGCT TTATTTGCGA CTGGCAGGTA TGAGTGGTTT TCTACACTCG TTTTCACGCT ACCCTCAAGA AGGTTTTGCG GTTGCTAATT ATTTTGCCGC AACGGAACTG GCACCTGCCG TCGCCCGCGC CTTCAACAAA CTCAAAACCC TACGACAAGA CGCCCGCACC TGGCTGCTGA AATACCCGCA ACACGCCATT ACCGGCCTGC TACCTGCGGC ACTCGGCAAA GCCGGTGAAT CCCAGGATAA CGCCCGTGCC GCTTTACGTA TGTTGATTGA AAATGGTTAT CCGTCATTAC TGCAAGAAAT CGCCCAGCGT TATAACCAGC CGGAAGTAAC CGATGCGGTG AACGCTCTAC TTGCGCTCGA TCCCTTAGAT AATCACCCGA CTAAAATCCC GACACTTCCG GCCTTTTATC AGCCATCGCT CTGGACGCGC CCGGTATTAA AAGCAAACGC CCAATCACTG CCAGATAGCG CCCTCTTCCA CCTCGGTGAA ATGCTCCGCT TCCCTCAGGA AGATACGCTG TATCCGGGGT TATTACAGGT GAAAGACGCC TGTACTGCCG ACTCACTGGC TGAGTTTGCC TGGGATCTGT TTACCGCCTG GCAGACCGCT GGCGTGCCGT CGAAAGAGAG CTGGGCGTTC ACTGCATTAG GTGTTCTTGG CAACGATGAC ACCGCCCGCA AACTGACGCC ATTAATCCGC GCCTGGCCTG GCGAATCTCA GCATAAACGT GCCACCGTTG GGTTGGATAT TCTCGCTGCT ATCGGCAGCG ATATCGCCCT GATGCAGCTT AACGGCATCG CCCAGAAACT GAAATTCAAA GCCTTACAGG AGCGGGCGAA AGAAAAAATT GCCGACATTG CCGAAAGTCG CGAACTCACA GTGGCAGAAC TTGAAGATCG GTTAGCACCG GATCTCGGTC TGGATGATAA CGGTTCGCTG CTGCTGGATT TCGGGCCACG GCAGTTCACC GTCAGCTTTG ATGAAACCTT AAAACCGTTT GTGCGTGATG CTTCCGGCAG CCGCCTGAAA GACTTGCCTA AACCGAACAA AAGCGATGAT GAAACCCAAG CGAACGATGC AGTTAACCGC TACAAACTGT TGAAAAAAGA TGCGCGTACC GTTGCCGCCC AGCAGGTAGC AAGGCTGGAA TCCGCCATGT GCCTGCGCCG CCGCTGGTCG CCGGAAAACT TTCAGCTCTT CCTGGTTGAG CATCCGCTGG TTCGCCACTT AACCCGCCGT CTGATTTGGG GCGTTTATAG CACCGAAAAC GAGCTACTGG CTTGCTTTCG CGTGGCGGAA GATAACAGCT ACAGCACCGC AGACGATGAT CTTTTCACCC TGCCGGAAGG CGATATCTCT GTTGGCATTC CTCACGTTCT GGAAATATCA CCCACGGATG CTGCCGCCTT TGGTCAACTT TTTGCCGACT ACGAACTGCT ACCGCCGTTC CGCCAGCTCG ACCGTAACAG TTACGCCCTG ACAGAAGCCG AGCGCAACGC CAGCGAACTG ACCCGCTGGG CAGGCAGAAA ATGCCCGAGC GGTCGGGTAA TGGGGCTGGC GAATAAAGGC TGGATAAAGG GCACCCCGCA GGATGGAGGC TGGATCGGCT GGATGATCAA ACCTCTCGGT CGCTGGTCGT TAATCATGGA AATCAATGAA GGATTTGCAG TTGGTATGTC GCCAGCCGAA CTCAGTGCCG AGCAACTCTT AAGTAAGCTG TGGCTATGGG AAGGCAAAGC GGAAAGCTAT GGCTGGGGGA GCAATTCAAC ACAGGAAGCG CAGTTCTCCG TACTCGATGC CATCACCGCC AGCGAGCTAA TTAACGATAT TGAAGCCCTT TTTGAATAA
|
Protein sequence | MRHFIFQDEK SHKFWAVEQQ GNELHINWGK VGTNGQSQVK SFADAAAAEK AGLKLIAEKV KKGYVEQAKD NSLQPSQTVT DSLKVANLST IIQEQPSFVA ETRAADKNTD AVLPWLAKDI AVVFPPEVVH TTLSHRRFPG VPVQQADKLT QLRRLACGVS QRDNKTATFD FSACTLEWQN FIAQAIDLIN SPKTTKLPSP VMAVLAALEL HCTRYKERED VMDQIIQEGG LEYATDVIIH LQQISIEWDY VNNNIVFLSS GISPDYLQQY SSFELRLRKH LSLAEESLWQ KCAQKLIAAV PHIPEWRQPL IALLLPEKPE IAHEIAQRLL GQKKLPSLEW LKIVATDEHI LASLEKYHEP YAIFDDYYCG AIWSATVLQE QGVAALPRFA PYVASDYCAD VLRHINHPFA LTLLIRVAGH TKRCHDRMTK ACAAFPHAAL AALAELLVQK EENSWRIMLM TMLISQPTLA EQVIPWLSTP AVAVLKSCQQ QLKQPSNHAS ADLLPAIVVS PPWLSKKKKS PIPVLDLAPL NLESICTITD TEAKEFQTHW DWEPHKPGEG AKDFLYSLGY RRWDFDTYKY IGASDSAIDA WEREDFATLI QMFKAHHAPY QGEWHLNSLP FLPMQKAIKL WEFLSKEPHT AIKPVMLYLR LAGMSGFLHS FSRYPQEGFA VANYFAATEL APAVARAFNK LKTLRQDART WLLKYPQHAI TGLLPAALGK AGESQDNARA ALRMLIENGY PSLLQEIAQR YNQPEVTDAV NALLALDPLD NHPTKIPTLP AFYQPSLWTR PVLKANAQSL PDSALFHLGE MLRFPQEDTL YPGLLQVKDA CTADSLAEFA WDLFTAWQTA GVPSKESWAF TALGVLGNDD TARKLTPLIR AWPGESQHKR ATVGLDILAA IGSDIALMQL NGIAQKLKFK ALQERAKEKI ADIAESRELT VAELEDRLAP DLGLDDNGSL LLDFGPRQFT VSFDETLKPF VRDASGSRLK DLPKPNKSDD ETQANDAVNR YKLLKKDART VAAQQVARLE SAMCLRRRWS PENFQLFLVE HPLVRHLTRR LIWGVYSTEN ELLACFRVAE DNSYSTADDD LFTLPEGDIS VGIPHVLEIS PTDAAAFGQL FADYELLPPF RQLDRNSYAL TEAERNASEL TRWAGRKCPS GRVMGLANKG WIKGTPQDGG WIGWMIKPLG RWSLIMEINE GFAVGMSPAE LSAEQLLSKL WLWEGKAESY GWGSNSTQEA QFSVLDAITA SELINDIEAL FE
|
| |