Gene EcSMS35_0927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0927 
Symbol 
ID6143772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp937083 
End bp940745 
Gene Length3663 bp 
Protein Length1220 aa 
Translation table11 
GC content50% 
IMG OID641615815 
Productmolybdate metabolism regulator MolR-like protein 
Protein accessionYP_001743007 
Protein GI170680560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGG AATTACCGTG GCTGGCGGAT AACGCGCAAC TGGAACTGAA ATATAAAAAG 
GGTAAAACAC CACTCAGCCA TCGCAATTGG CCGGGTGAAC CGGTGCCTGT TATCACTGAA
AGCATCATCC AGACACTGGG TGATGAATTG CTACACATTG CTGAGAAGAA AAAAAACATT
GTCTGGCGTT ATGATAATTT TTCACTGGAG TGGCAGTCCG CCATCACGCA GGCCATCAAC
TTGATCGGCG AACACAAACC CTCAATCCCG GCCCGGACAA TGGCGGCGCT AGCCTGTATC
GCGCAAAATG ATAGCCAACA GTTACTCGAC GAAATCGTCC AACAAGAGGG ACTGGAATAT
GCGACTGAAG TGGTGATTGC ACGCCAGTTT ATTGTACGGT GTTATGAGAG TGATCCTCTG
GTAGTAACAT TGCAGTATCA GAAGGAGGAC TATGGCTATG GCTATGGCTA TGGCTATGGC
TATGGCTATG GTTATCGCTC AGAAACCTAT AACGAATTCA ATCTCCGACT GCGTAAGCAT
CTCTCTCTGG CAGAGGAAAG CTGCTGGCAG CGTTGCGCCG ACAAACTCAT TGCCGCACTG
CCAGGACTAC CCCAAGTTCG CCGTCCTTTT ATTGCACTCA TCCTCCCGGA AAAACCAGAA
ATCGCCAATG AGTTGGTAGG CCTTGAATGC CCGTGGACTC ATTTTCATTC TAAGGAGTGG
TTAAAAGTTG TTGCTACTGA CCACAGGGCA GTGGGAAAAC TCGAACGCTA CTGGAGCCAG
GATATATTTA GCGATCGCGA AGCCAGCTAC ATGTCGCATG AAAACCACTT CGGCTACGCG
GCCTGCGCCG CCCTTTTGCG CGAACAAGGA ATTGCAGCCG TTCCACGCCT CGCGATGTAT
GCCCATAAAG AAGATTGCGG TAGCCTGCTG GTACAAATTA ACCATCCGCA GGTCATCCGC
ACCCTGCTGT TAGTTGCTGA TAAAAACAAA CCCAGCCTGC AACGTGTGGC TAAATACAGT
AAAAACTTCC CCCATGCGAC GCTTGCCGCA CTGGCAGAGC TTCTGGCGTT AAAAGAACCG
CCTGCACGCC CTGGTTATCC AATCATCGAA GACAAAAAGC TGCCTGCACA GCAAAAAGCT
CGAGATGAAT ACTGGCGTAC GCTGTTACAA ACGCTGATGG CATCGCAGCC ACAACTGGCA
GAAGAGGTGA TGCCGTGGTT AAGTACTCAA GCCCAGGCAG TGCTGAATAG TTATTTATCG
GCACCGCCCA AAACGGTTAT TGATAGCACC GATAACATCC AAATGCCTGA AATCCTCGTT
TCACCACCGT GGCGTAGTAA GAAAAAAATG ACAGTTCCAC GTCTTGATTT GGCACCGTTC
GAATTAACGC CGCAAGTTTA CTGGCAACCA GGCGAACGAG AGAGGCTTGC CGCCACTGAG
TCTGCCCGTT ATTTCAGCAC GGAATCTCTT GCGGAACGCA TGGAACAAAA AAGTGGACGA
GTTGTATTAC AGGAGCTGGG TTTTGGAGAT GATGTATGGC TGTTTTTGAA TTATATACTC
CCCGGAAAAC TGGATGCTGC ACGCAATTCA CTCATTGTTC AGTGGCATTA CTACCCAGGG
CGGGTTGAAG AGATCATGAA TGGCTGGAGC TCCCCGGAAG CTCAATTAGC AGAACAGGCG
CTTCGCAACG GTCACGTAGA AGTGTTAATT AACATATGGG AAAATGACAG CTACTCACGC
TATCGTCGGG AAAAGAGTAT CTGGAACCTG TATTTATTGG CGCAGTTGCC GCGTGAGATG
GCTTTGACTT TCTGGCTACG TATCAATGAG AAAAAGCATC TGTCTGCGGG TGAGGACTAT
TTTCTCAGTA TCTTCGGATT GGATGCTCTA CCAGGTCTGC TGTTGGCTTT TTCACATCGT
CCAAAAGAAA CATTTCCATT AATTTTAAAT TTCGGCGCAA CAGAACTGGC CCTGCCCGTT
GCCCGCGTCT GGCGACGTTT CGCGGCGCAG CGTGATCTGG CTCGCCAGTG GATTTTACAT
TGGCCGGAAC ATACAGCTAC TGCACTTATC CCTCTTGTCT TCACCAAATC CAGCGATAAT
AGCGAAGCTG CATTACTTGC CCTGCGTTTA CTTTACGAAC AGGGACATGG CGAATTACTG
CAAACCGTGG CAAACCGCTG GCAGCATACA GATGTATGGC CTGCCCTGGA GCAATTACTT
AAACAGGGTC CAATGGACAT TTACCCGGCA CGCATTCCCA AAGCCCCTGA TTTCTGGCAT
CCGACAATGT GGTCCAGACC GAGACTTATC ACGAATAATC AGCCTGTTAC CGATGATGCT
CTGGAAATTA TCGGCGACAT GCTGCGCTTT ACCCAGGGCG GACGTTTTTA TTGCGGGCTG
GAACAACTGA AAACGTTCTG CCAGCCACAA ACGCTGGCAG CTTTTGCCTG GGATCTCTTC
ACCGCGTGGC AACAAGTTGG CGCCCCCGCA AAAGACAACT GGGCATTTCT GGCGTTAAGT
CTCTTCGGTG ACGAAAGCAC GGCTCGGGAT CTGACAACAC AGATCCTCGC CTGGCCACAG
GAAGGCAAAT CTGCCCGTGC CGTCAGTGGC CTGAACATAC TTACTCAGAT GAATAATGAT
ATGGCGCTGA TACAGCTGCA TCATATATCG CAACGGGCGA AATCCCGCCC CTTACGTGAT
AACGCGGCGG AATTTCTTCA GGTGGTCGCA GAAAATCGTA GGCTAAGCCA GGAACAGTTA
GCGGACAGAT TAGTCCCAAC ACTGGGTCTT GATGATCCGC AGGCGTTGAG TTTTGATTTT
GGTCCACGGC AATTTACCGT TCGCTTTGAT GAAAACCTTA ACCCGGTTAT CTTTGATCAG
CAAAACGTTC GCCAGAAAAG CGTTCCCCGG TTGCGCGCCG ATGACGATCA ACTGAAAGCG
CCCGAGGCAC TGGCCCGACT AAAAGGGCTA AAAAAAGATG CGACTCAGGT GAGCAAAAAC
CTGCTCCCGC GTCTTGAAAC GGCCCTACGC ACCACCCGAC GCTGGTCGCT GGCAGATTTT
CATACTCTGT TTGTTAATCA TCCCTTTACC CGCCTGGTTA CTCAGCGATT AATATGGGGC
GTGTATCCGG CAAATGAACC GCGTCGTTTA CTCAACGCCT TTCGTGTGGC CGCAGAGGGG
GAGTTCTGCA ATGCACAAGA TGAGCCCATT GACCTGCCTG CGGATGCTTT GATTGGCATT
GCCCACCCGT TAGAAATGAC AGCAGAAATG CGCAGTGAAT TTGCACAGCT TTTTGCCGAT
TACGAAATTA TGCCGCCTTT TCGCCAGTTG ACGAGGCGCA CAGTGCTGCT CACGCCTGAC
GAGTCAGCCA GTAACAGCCT GACTCGCTGG GAAGGTAAAT CCGCTACCGT TGGGCAACTT
ATGGGAATGC GATACAAAGG CTGGGAGTCA GGCTATGAGG ACGCATTTGT CTATGACCTG
GGCGAGTACC GGCTGGTCCT TAAGTTTTCA TCCGGTTTTA ACCACTACAA TGTTGATAGC
AAAGCGCTAA TGAGCTTCCG TTCTCTTCAT TTATACCGTG ACAATAAATC TGTCACTTTT
GCCGAACTTG ATGTGTTTGA TTTGAGTGAA GCGTTAAGCG CACCCGACGT CATTTTCCAT
TAA
 
Protein sequence
MDKELPWLAD NAQLELKYKK GKTPLSHRNW PGEPVPVITE SIIQTLGDEL LHIAEKKKNI 
VWRYDNFSLE WQSAITQAIN LIGEHKPSIP ARTMAALACI AQNDSQQLLD EIVQQEGLEY
ATEVVIARQF IVRCYESDPL VVTLQYQKED YGYGYGYGYG YGYGYRSETY NEFNLRLRKH
LSLAEESCWQ RCADKLIAAL PGLPQVRRPF IALILPEKPE IANELVGLEC PWTHFHSKEW
LKVVATDHRA VGKLERYWSQ DIFSDREASY MSHENHFGYA ACAALLREQG IAAVPRLAMY
AHKEDCGSLL VQINHPQVIR TLLLVADKNK PSLQRVAKYS KNFPHATLAA LAELLALKEP
PARPGYPIIE DKKLPAQQKA RDEYWRTLLQ TLMASQPQLA EEVMPWLSTQ AQAVLNSYLS
APPKTVIDST DNIQMPEILV SPPWRSKKKM TVPRLDLAPF ELTPQVYWQP GERERLAATE
SARYFSTESL AERMEQKSGR VVLQELGFGD DVWLFLNYIL PGKLDAARNS LIVQWHYYPG
RVEEIMNGWS SPEAQLAEQA LRNGHVEVLI NIWENDSYSR YRREKSIWNL YLLAQLPREM
ALTFWLRINE KKHLSAGEDY FLSIFGLDAL PGLLLAFSHR PKETFPLILN FGATELALPV
ARVWRRFAAQ RDLARQWILH WPEHTATALI PLVFTKSSDN SEAALLALRL LYEQGHGELL
QTVANRWQHT DVWPALEQLL KQGPMDIYPA RIPKAPDFWH PTMWSRPRLI TNNQPVTDDA
LEIIGDMLRF TQGGRFYCGL EQLKTFCQPQ TLAAFAWDLF TAWQQVGAPA KDNWAFLALS
LFGDESTARD LTTQILAWPQ EGKSARAVSG LNILTQMNND MALIQLHHIS QRAKSRPLRD
NAAEFLQVVA ENRRLSQEQL ADRLVPTLGL DDPQALSFDF GPRQFTVRFD ENLNPVIFDQ
QNVRQKSVPR LRADDDQLKA PEALARLKGL KKDATQVSKN LLPRLETALR TTRRWSLADF
HTLFVNHPFT RLVTQRLIWG VYPANEPRRL LNAFRVAAEG EFCNAQDEPI DLPADALIGI
AHPLEMTAEM RSEFAQLFAD YEIMPPFRQL TRRTVLLTPD ESASNSLTRW EGKSATVGQL
MGMRYKGWES GYEDAFVYDL GEYRLVLKFS SGFNHYNVDS KALMSFRSLH LYRDNKSVTF
AELDVFDLSE ALSAPDVIFH