Gene EcSMS35_0928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0928 
SymbolmolR 
ID6144730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp940755 
End bp944543 
Gene Length3789 bp 
Protein Length1262 aa 
Translation table11 
GC content50% 
IMG OID641615816 
Productmolybdate metabolism regulator MolR 
Protein accessionYP_001743008 
Protein GI170680946 
COG category[S] Function unknown 
COG ID[COG3831] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACACT TTATCTTTCA GGACGAAAAA TCACATAAAT TCTGGGCGGT GGAGCAGCAG 
GGAAACGAAC TGCACATCAA CTGGGGTAAA GTCGGCACCA ACGGACAAAG CCAGGTAAAA
AGTTTTGCGG ATGCAGCAGC GGCGGAAAAA GCGGGACTGA AGCTGATTGC GGAGAAGGTG
AAGAAGGGAT ATGTGGAGCA AGCGAAGGAT AATTCTTTGC AACCTTCCCA AACGGTAACG
GACTCTCTCA AGGTAGCGAA CTTATCCACC ATTATTCAGG AACAACCCTC TTTCGTAGCA
GAAACCCGTG CGGCTGACAA AAATACAGAT GCTGTTTTAC CGTGGCTGGC GAAAGATATT
GCTGTCGTTT TTCCGCCCGA AGTTGTACAC ACCACGTTAA GTCATCGCCG CTTTCCCGGA
GTTCCTGTTC AGCAAGCAGA CAAATTGACC CAATTACGTC GCTTAGCCTG TGGCGTGTCG
CAACGGGATA ACAAAACAGC TACATTTGAC TTCAGCGCCT GTACTTTAGA GTGGCAAAAC
TTCATCGCTC AGGCGATCGA TCTGATTAAC AGCCCGAAAA CGACAAAATT ACCATCACCG
GTAATGGCTG TACTCGCGGC ACTTGAACTG CACTGCACGA GATATAAAGA GCGTGAGGAT
GTTATGGATC AGATTATTCA GGAAGGCGGT CTGGAATATG CTACTGATGT AATAATACAT
CTTCAACAAA TATCTATTGA ATGGGATTAT GTCAACAATA ATATTGTTTT CCTGTCGTCT
GGCATTTCAC CTGATTATTT GCAGCAATAT TCCAGCTTTG AATTACGCCT ACGTAAACAT
TTATCACTGG CGGAAGAGTC TCTCTGGCAA AAATGTGCAC AAAAACTTAT TGCCGCAGTT
CCACATATTC CAGAATGGCG GCAACCATTA ATTGCTTTGT TATTGCCCGA AAAACCAGAA
ATTGCACATG AAATTGCCCA GCGTCTACTG GGGCAAAAGA AATTACCCTC GCTGGAGTGG
CTAAAAATAG TGGCGACTGA TGAGCACATT CTTGCCTCAT TAGAAAAATA TCACGAGCCA
TATGCCATTT TTGATGATTA CTATTGTGGT GCGATATGGT CAGCCACCGT ATTACAGGAG
CAAGGTGTTG CGGCCCTGCC CCGATTTGCT CCCTATGTCG CAAGTGACTA CTGCGCCGAT
GTGTTGCGTC ATATCAATCA TCCGTTCGCA TTAACACTGC TTATACGTGT AGCCGGGCAT
ACTAAACGCT GTCACGATCG AATGACGAAA GCCTGCGCTG CGTTCCCACA CGCAGCACTG
GCAGCACTGG CGGAACTTCT AGTGCAAAAA GAAGAGAATA GTTGGCGCAT TATGCTAATG
ACGATGCTTA TCTCGCAACC TACACTAGCA GAACAGGTCA TTCCCTGGCT TTCGACACCC
GCAGTTGCAG TGCTGAAATC ATGCCAGCAA CAACTGAAAC AGCCCTCAAA CCATGCCAGT
GCCGATCTAC TGCCGGCCAT AGTAGTCTCC CCACCATGGC TTTCGAAAAA GAAAAAATCG
CCGATTCCGG TGCTTGATTT AGCACCGCTC AACCTCGAAT CCATATGCAC AATAACCGAC
ACAGAAGCTA AAGAGTTTCA AACTCATTGG GACTGGGAAC CGCATAAGCC AGGTGAAGGT
GCTAAAGATT TCCTTTACTC GCTGGGGTAT CGCCGCTGGG ATTTTGACAC ATATAAATAT
ATCGGCGCTT CTGACTCAGC AATTGACGCA TGGGAACGTG AGGACTTCGC TACACTCATC
CAAATGTTTA AAGCCCACCA TGCACCGTAT CAGGGGGAGT GGCATCTCAA CTCACTTCCC
TTTTTACCTA TGCAAAAAGC GATTAAGCTA TGGGAATTTC TCAGCAAAGA GCCGCATACC
GCGATAAAAC CAGTCATGCT TTATTTGCGA CTGGCAGGTA TGAGTGGTTT TCTACACTCG
TTTTCACGCT ACCCTCAAGA AGGTTTTGCG GTTGCTAATT ATTTTGCCGC AACGGAACTG
GCACCTGCCG TCGCCCGCGC CTTCAACAAA CTCAAAACCC TACGACAAGA CGCCCGCACC
TGGCTGCTGA AATACCCGCA ACACGCCATT ACCGGCCTGC TACCTGCGGC ACTCGGCAAA
GCCGGTGAAT CCCAGGATAA CGCCCGTGCC GCTTTACGTA TGTTGATTGA AAATGGTTAT
CCGTCATTAC TGCAAGAAAT CGCCCAGCGT TATAACCAGC CGGAAGTAAC CGATGCGGTG
AACGCTCTAC TTGCGCTCGA TCCCTTAGAT AATCACCCGA CTAAAATCCC GACACTTCCG
GCCTTTTATC AGCCATCGCT CTGGACGCGC CCGGTATTAA AAGCAAACGC CCAATCACTG
CCAGATAGCG CCCTCTTCCA CCTCGGTGAA ATGCTCCGCT TCCCTCAGGA AGATACGCTG
TATCCGGGGT TATTACAGGT GAAAGACGCC TGTACTGCCG ACTCACTGGC TGAGTTTGCC
TGGGATCTGT TTACCGCCTG GCAGACCGCT GGCGTGCCGT CGAAAGAGAG CTGGGCGTTC
ACTGCATTAG GTGTTCTTGG CAACGATGAC ACCGCCCGCA AACTGACGCC ATTAATCCGC
GCCTGGCCTG GCGAATCTCA GCATAAACGT GCCACCGTTG GGTTGGATAT TCTCGCTGCT
ATCGGCAGCG ATATCGCCCT GATGCAGCTT AACGGCATCG CCCAGAAACT GAAATTCAAA
GCCTTACAGG AGCGGGCGAA AGAAAAAATT GCCGACATTG CCGAAAGTCG CGAACTCACA
GTGGCAGAAC TTGAAGATCG GTTAGCACCG GATCTCGGTC TGGATGATAA CGGTTCGCTG
CTGCTGGATT TCGGGCCACG GCAGTTCACC GTCAGCTTTG ATGAAACCTT AAAACCGTTT
GTGCGTGATG CTTCCGGCAG CCGCCTGAAA GACTTGCCTA AACCGAACAA AAGCGATGAT
GAAACCCAAG CGAACGATGC AGTTAACCGC TACAAACTGT TGAAAAAAGA TGCGCGTACC
GTTGCCGCCC AGCAGGTAGC AAGGCTGGAA TCCGCCATGT GCCTGCGCCG CCGCTGGTCG
CCGGAAAACT TTCAGCTCTT CCTGGTTGAG CATCCGCTGG TTCGCCACTT AACCCGCCGT
CTGATTTGGG GCGTTTATAG CACCGAAAAC GAGCTACTGG CTTGCTTTCG CGTGGCGGAA
GATAACAGCT ACAGCACCGC AGACGATGAT CTTTTCACCC TGCCGGAAGG CGATATCTCT
GTTGGCATTC CTCACGTTCT GGAAATATCA CCCACGGATG CTGCCGCCTT TGGTCAACTT
TTTGCCGACT ACGAACTGCT ACCGCCGTTC CGCCAGCTCG ACCGTAACAG TTACGCCCTG
ACAGAAGCCG AGCGCAACGC CAGCGAACTG ACCCGCTGGG CAGGCAGAAA ATGCCCGAGC
GGTCGGGTAA TGGGGCTGGC GAATAAAGGC TGGATAAAGG GCACCCCGCA GGATGGAGGC
TGGATCGGCT GGATGATCAA ACCTCTCGGT CGCTGGTCGT TAATCATGGA AATCAATGAA
GGATTTGCAG TTGGTATGTC GCCAGCCGAA CTCAGTGCCG AGCAACTCTT AAGTAAGCTG
TGGCTATGGG AAGGCAAAGC GGAAAGCTAT GGCTGGGGGA GCAATTCAAC ACAGGAAGCG
CAGTTCTCCG TACTCGATGC CATCACCGCC AGCGAGCTAA TTAACGATAT TGAAGCCCTT
TTTGAATAA
 
Protein sequence
MRHFIFQDEK SHKFWAVEQQ GNELHINWGK VGTNGQSQVK SFADAAAAEK AGLKLIAEKV 
KKGYVEQAKD NSLQPSQTVT DSLKVANLST IIQEQPSFVA ETRAADKNTD AVLPWLAKDI
AVVFPPEVVH TTLSHRRFPG VPVQQADKLT QLRRLACGVS QRDNKTATFD FSACTLEWQN
FIAQAIDLIN SPKTTKLPSP VMAVLAALEL HCTRYKERED VMDQIIQEGG LEYATDVIIH
LQQISIEWDY VNNNIVFLSS GISPDYLQQY SSFELRLRKH LSLAEESLWQ KCAQKLIAAV
PHIPEWRQPL IALLLPEKPE IAHEIAQRLL GQKKLPSLEW LKIVATDEHI LASLEKYHEP
YAIFDDYYCG AIWSATVLQE QGVAALPRFA PYVASDYCAD VLRHINHPFA LTLLIRVAGH
TKRCHDRMTK ACAAFPHAAL AALAELLVQK EENSWRIMLM TMLISQPTLA EQVIPWLSTP
AVAVLKSCQQ QLKQPSNHAS ADLLPAIVVS PPWLSKKKKS PIPVLDLAPL NLESICTITD
TEAKEFQTHW DWEPHKPGEG AKDFLYSLGY RRWDFDTYKY IGASDSAIDA WEREDFATLI
QMFKAHHAPY QGEWHLNSLP FLPMQKAIKL WEFLSKEPHT AIKPVMLYLR LAGMSGFLHS
FSRYPQEGFA VANYFAATEL APAVARAFNK LKTLRQDART WLLKYPQHAI TGLLPAALGK
AGESQDNARA ALRMLIENGY PSLLQEIAQR YNQPEVTDAV NALLALDPLD NHPTKIPTLP
AFYQPSLWTR PVLKANAQSL PDSALFHLGE MLRFPQEDTL YPGLLQVKDA CTADSLAEFA
WDLFTAWQTA GVPSKESWAF TALGVLGNDD TARKLTPLIR AWPGESQHKR ATVGLDILAA
IGSDIALMQL NGIAQKLKFK ALQERAKEKI ADIAESRELT VAELEDRLAP DLGLDDNGSL
LLDFGPRQFT VSFDETLKPF VRDASGSRLK DLPKPNKSDD ETQANDAVNR YKLLKKDART
VAAQQVARLE SAMCLRRRWS PENFQLFLVE HPLVRHLTRR LIWGVYSTEN ELLACFRVAE
DNSYSTADDD LFTLPEGDIS VGIPHVLEIS PTDAAAFGQL FADYELLPPF RQLDRNSYAL
TEAERNASEL TRWAGRKCPS GRVMGLANKG WIKGTPQDGG WIGWMIKPLG RWSLIMEINE
GFAVGMSPAE LSAEQLLSKL WLWEGKAESY GWGSNSTQEA QFSVLDAITA SELINDIEAL
FE