Gene ECD_02045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02045 
SymbolmolR 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2091337 
End bp2095131 
Gene Length3795 bp 
Protein Length1264 aa 
Translation table11 
GC content50% 
IMG OID 
Productmolybdate metabolism regulator 
Protein accessionACT43869 
Protein GI253978199 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACACT TTATCTATCA GGACGAAAAA TCACATAAAT TCTGGGCGGT GGAGCAACAG 
GGAAACGAGT TGCATATCAG TTGGGGAAAA GTTGGCACCA AAGGGCAAAG TCAGATAAAA
AGTTTTTCAG ATGCTGCGGC AGCGGAAAAA GCGGAACTTA AGCTGATTGC GGAGAAGGTG
AAGAAGGGGT ATGTGGAGCA AGCGAAGGAT AATTCTTTGC AACCTTCCCA AACGGTAACG
GGCTCTCTCA AGGTAGCGGA CTTATCCACC ATTATTCAGG AACAACCCTC TTTCGTAGCA
GAAACCCGTG CGCCTGACAA AAATACAGAT GCTGTTTTAC CGTGGCTGGC GAAAGATATT
GCTGTCGTTT TTCCGCCCGA AGTTGTACAC ACCACGTTAA GTCATCGCCG CTTTCCCGGA
GTTCCTGTTC AGCAAGCAGA CAAATTGACC CAATTACGTC GCTTAGCCTG TAGTGTGTCG
CAACGGGATA ATAAAACAGC CACATTTGAC TTCAGCGCCT GTTCTTTAGA ATGGCAAAAC
ACCGTCGCCC AGGCGATCAG TCAGATCGAC GGCCTGAAAA CAACACAGTT ACCATCACCA
GTAATGGCTG TACTCACGGC ACTTGAAATG AAATGCACAA GATATAAAGT GCGTGAGGAT
GTTATGGATC AGATCGTCCA GGAAGGCGGT CTGGAATATG CTACTGATGT AATAATACAC
CTTCAACAGA TTGATATTGA ATGGGATTAT GCGAATAATG TCATTATTAT TCTGCCGTCT
GGCATTGCAC CTAGCTACTT GGAGCAATAT TCCAGATTTG AATTACGCCT ACGTAAACAT
TTATCACTGA CGGAAGAGTC TCTCTGGCAA AAATGTGCAC AAAAACTTAT TGCCGCAATT
CCACATATTC CAGAATGGCG GCAACCATTA ATTGCTTTGT TATTACCCGA AAAACCAGAA
ATTGCACATG AAATTGCCCA GCGTCTACTG GGGCAAAAAA AATTACCCTC GCTTGAGTGG
TTAAAAATAG TGGCGACTGA TGAGCACATT CTTGCCTCAT TAGAAAAATA TCACGAACCA
TATGCCATTT TTGATGATTA CTATTGTGGT GCGATATGGT CAGCCACCGT ATTACAGGAG
CAAGGTGTTG CAGCCCTGCC CCGATTTGCT CCCTATGCCG CAAGTGACTA CTGCGCCGAT
GTGTTGCGTC ATATCAATCA TCCGTTCGCA TTGACACTGC TTATACGTGT AGCCGGGCAA
ACTAAACGCT GTCACGATCG GATGACGAAA GCCATTGCTG CGTTCCCACA TGCAGCAATG
GCGGCACTGA CGGAACTTCT TGGGCAAAAA GAAGAGAACA GTTGGCGCAT TATGCTAATG
ACAATGCTTA TCTCACAACC AGCACTGGCA GAACAGGTCA TTCCCTGGCT CTCGACACCC
GCAGTTGCCG TACTGAAATC ATGCCAGCAA CAACTGACAC AGCCCTCAAA CCATGCCAGC
GCCGATCTAC TGCCAGCCGT AGTAGTCTCC CCTCCCTGGC TTTCGAAAAA GAAAAAATCG
CCGATTCCGG TGCTGGATTT AGCGCCATTA GGCATTGAGC CAATCTGTTA TCTGACAGAA
GAAATCAGTA ATCAACTTTT GGCGAAATAT ATCTGGTATT CAAAACACAT CACGGTTAGC
CATGAAGAAA GTACTACCAA CCTGTTGGCA AGGATGGGTT TTCAACGACG GATCGCTGGT
ACATATATTA AAGCTCCCGA AGCGGTAGTT GAGGCATGGC TAAATGAAGA TTATTCAACC
TTACTAAGTG AATTTAAGGT GTTTCATTCA CCTACCGGGC ATTATTGGCA GTTGGGGATT
TTGACAACAT TGCCGCTGGA GAAAGCAGTA AAAGCATGGA ATGCCCTTAC CCTATCTCCA
CATACCGATA CCGAATACTC CATGTTACAT TTTGGACTCA AAGGGTTACC TGGGTTAGTA
AACTCACTTG CACGCTATCC ACAAGAAGCC TTGCCCATCA CGAATTACTT CGCAGCGAGT
GAGCTGGCAC CTGCCGTCGC CCGCGCCTTC AACAAACTCA AAACCCTACG ACAAGACGCC
CGTAGCTGGC TGCTAAAATA CCCGGAACAT GCCATAACCG GCCTGCTACC TGCGGCGCTC
GGCAAAGCCG GTGAAGCCCA GGATAACGCC CGCGCTGCCT TGCGTATGCT TACCGAAAAC
GGTCATCAGC CATTACTGCA AGAAATCGCC CGGCGTTATA ACCAGCCGGA AGTAACAGAC
GCGGTGAACG CTCTGCTTGC GCTCGATCCC TTAGATAATC ACCCGACAAA AATCCCCACT
CTTCCGGCCT TTTATCAGCC ATCGCTCTGG ACGCGCCCGG TATTAAAAGC AAACGCCCAA
TCACTGCCAG ATAACGCCCT CCTCCACCTC GGTGAAATGC TCCGCTTCCC TCAGGAAGAG
GCTCTATATC CGGGATTATT GCAGGTGAAA GATGCCTGTA CCGCCGACTC ACTGGCTGAA
TTTGCCTGGG ATCTGTTTAC CGCCTGGCAG ACCGCTGGCG CGCCGTCGAA AGAAAGCTGG
GCGTTCACTA CGTTAGGCGT TCTCGGTAAC GATGACACCG CCCGCAAACT GACGCCATTA
ATCCGCGCCT GGCCTGGCGA ATCTCAGCAT AAACGTGCCA CCGTTGGGTT GGATATTCTC
GCTGCTATCG GTAGTGATAT CGCCCTGATG CAGCTTAACG GCATCGCCCA GAAACTGAAA
TTCAAAGCAT TACAGGAGCG GGCAAAAGAA AAAATTGCCG ACATTGCCGA AAGTCGCGAA
CTCACGGTGG CGGAGCTTGA AGATCGGTTA GCACCGGATC TCGGTCTGGA TGATAACGGT
TCGCTGCTGC TGGATTTCGG CCCACGTCAG TTCACCGTCA GCTTTGATGA AACCTTAAAA
CCGTTTGTGC GCGATGCTTC CGGCAGCCGC CTGAAAGACC TGCCCAAACC AAACAAAAGC
GATGATGAAT CGCGGGCTGA TGAGGCGGTT AACCGCTACA AACTGCTGAA AAAAGATGCG
CGTACCGTCG CCGCCCAGCA GGTGGCAAGG CTTGAATCCG CCATGTGCCT GCGCCGCCGC
TGGTCACCAG AAAACTTCCA GCTCTTCCTG GTTGAGCACC CGATGGTTCG CCACTTAACC
CGACGTCTGA TTTGGGGCGT TTATAGCGCC GACAACCAGC TACAGGCGTG CTTTAGGGTA
GCGGAAGATA ACAGCTACAG CACCGCTGAC GATGATCTTT TCACCCTGCC GGAAGGCGAT
ATCTCTCTTG GTATTCCTCA CGTTCTGGAA ATATCACCGA CGGATGCTGC CGCCTTTGGT
CAGCTTTTTG CCGACTACGA ACTGCTACCA CCGTTCCGTC AGCTTGACCG TAACAGTTAC
GCCCTGACAG AAGCCGAGCG CAATGCCAGC GAATTGATCC GCTGGGCAGG CAGAAAATGC
CCGAGTGGTC GGGTAATGGG GCTGGCGAAT AAAGGCTGGA TAAAGGGCAC CCCGCAGGAT
GCAGGCTGGA TCGGCTGGAT GATCAAACCT CTCGGTCGCT GGTCGTTAAT CATGGAAATC
GATGAAGGCT TTGCGGTAGG CATGTCGCCA GCCGAACTCA GCGCTGAGCA GCTCTTAAGC
AAGCTGTGGC TATGGGAAGG CAAAGCAGAA AGATATGGCT GGGGGAGTAA TTCAACACAG
GAAGCGCAGT TCTCCGTAAT CGATGCCATC ACCGCCAGCG AGCTAATTAA CGATATTGAA
GCCCTGTTTG AATAA
 
Protein sequence
MRHFIYQDEK SHKFWAVEQQ GNELHISWGK VGTKGQSQIK SFSDAAAAEK AELKLIAEKV 
KKGYVEQAKD NSLQPSQTVT GSLKVADLST IIQEQPSFVA ETRAPDKNTD AVLPWLAKDI
AVVFPPEVVH TTLSHRRFPG VPVQQADKLT QLRRLACSVS QRDNKTATFD FSACSLEWQN
TVAQAISQID GLKTTQLPSP VMAVLTALEM KCTRYKVRED VMDQIVQEGG LEYATDVIIH
LQQIDIEWDY ANNVIIILPS GIAPSYLEQY SRFELRLRKH LSLTEESLWQ KCAQKLIAAI
PHIPEWRQPL IALLLPEKPE IAHEIAQRLL GQKKLPSLEW LKIVATDEHI LASLEKYHEP
YAIFDDYYCG AIWSATVLQE QGVAALPRFA PYAASDYCAD VLRHINHPFA LTLLIRVAGQ
TKRCHDRMTK AIAAFPHAAM AALTELLGQK EENSWRIMLM TMLISQPALA EQVIPWLSTP
AVAVLKSCQQ QLTQPSNHAS ADLLPAVVVS PPWLSKKKKS PIPVLDLAPL GIEPICYLTE
EISNQLLAKY IWYSKHITVS HEESTTNLLA RMGFQRRIAG TYIKAPEAVV EAWLNEDYST
LLSEFKVFHS PTGHYWQLGI LTTLPLEKAV KAWNALTLSP HTDTEYSMLH FGLKGLPGLV
NSLARYPQEA LPITNYFAAS ELAPAVARAF NKLKTLRQDA RSWLLKYPEH AITGLLPAAL
GKAGEAQDNA RAALRMLTEN GHQPLLQEIA RRYNQPEVTD AVNALLALDP LDNHPTKIPT
LPAFYQPSLW TRPVLKANAQ SLPDNALLHL GEMLRFPQEE ALYPGLLQVK DACTADSLAE
FAWDLFTAWQ TAGAPSKESW AFTTLGVLGN DDTARKLTPL IRAWPGESQH KRATVGLDIL
AAIGSDIALM QLNGIAQKLK FKALQERAKE KIADIAESRE LTVAELEDRL APDLGLDDNG
SLLLDFGPRQ FTVSFDETLK PFVRDASGSR LKDLPKPNKS DDESRADEAV NRYKLLKKDA
RTVAAQQVAR LESAMCLRRR WSPENFQLFL VEHPMVRHLT RRLIWGVYSA DNQLQACFRV
AEDNSYSTAD DDLFTLPEGD ISLGIPHVLE ISPTDAAAFG QLFADYELLP PFRQLDRNSY
ALTEAERNAS ELIRWAGRKC PSGRVMGLAN KGWIKGTPQD AGWIGWMIKP LGRWSLIMEI
DEGFAVGMSP AELSAEQLLS KLWLWEGKAE RYGWGSNSTQ EAQFSVIDAI TASELINDIE
ALFE