Gene EcSMS35_1735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1735 
Symbol 
ID6145643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1739827 
End bp1741233 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content51% 
IMG OID641616611 
ProductGntR family transcriptional regulator 
Protein accessionYP_001743789 
Protein GI170682466 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.453488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000000000634797 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAT ACCAGCAGCT TGCAGAACAA TTACGCGAGC AGATTGCGTC GGGTATCTGG 
CAACCCGGAG ATCGTTTGCC TTCGTTGCGT GACCAGGTGG CACTTTCTGG CATGAGCTTT
ATGACCGTGA GCCATGCCTA TCAGTTGCTC GAAAGTCAGG GATATATTAT CGCACGACCG
CAGTCGGGTT ATTACGTTGC GCCACAGGCA ATAAAAATGC CGAAAGCGCC AGTCATTCCA
GTCACTCGAG ATGAAGCAGT CGATATCAAC ACTTATATTT TTGATATGTT GCAGGCCAGT
CGCGATCCGT CGGTCGTTCC GTTTGCCTCG GCCTTTCCCG ACCCGCGACT TTTCCCCCTC
CAACAACTAA ACCGCTCGCT GGCGCAGGTA AGCAAAACCG CCACAGCGAT GAGCGTGATT
GAAAACTTAC CGCCAGGAAA CGCAGAACTG CGTCAGGCTA TTGCTCGTCG CTATGCCTTA
CAGGGCATCA CTATTTCTCC AGATGAAATT GCCATTACCG CCGGGGCGTT AGAGGCATTA
AACCTCAGTT TGCAAGTGGT AACTGAACCG GGCGATTGGG TGATAGTAGA GAATCCTTGT
TTCTACGGCG CTTTGCAGGC GCTGGAGCGG TTACGGCTGA AGGCGTTATC GGTGGCGACG
GACGTTAAAG AAGGGATCGA TCTTCAGGCG CTGGAACTGG CGTTGCAGGA GTATCCGGTG
AAAGCGTGCT GGCTGATGAC TAATAGCCAG AATCCACTCG GATTTACCTT AACACCGCAA
AAAAAAGCAC AACTGGTGGC GTTGCTCAAT CAGTACAACG TAACGCTGAT TGAAGATGAC
GTTTACAGCG AACTTTATTT TGGACGGGAA AAACCGCTGC CTGCGAAAGC GTGGGATCAC
CACGATGGTG TTTTGCATTG CTCTTCGTTT TCGAAATGTC TGGTGCCTGG TTTTCGTATT
GGTTGGGTCG CCGCCGGAAA ACATGCACGT AAAATTCAAC GCTTGCAGTT GATGAGTACG
CTTTCCACCA GCTCACCGAT GCAGCTTGCG CTGGTGGATT ACCTTTCTAC TCGCCGATAC
GATGCTCATC TTCGTCGTCT GCGTCGCCAG CTTGCGGAAC GTAAACAACG CGCCTGGCAG
GCTTTGCTGC GTTATCTGCC TGCGGAAGTC AAAATTCATC ACAATGACAG CGGTTATTTT
CTCTGGCTGG AGCTCCCCGA GCCGTTAGAT GCAGGAGAAT TAAGCCTGGT GGCACTGACG
CATCATATCA GTATTGCGCC GGGTAAAATG TTTTCTACCG GTGAAAACTG GTCACGTTTT
TTTCGTTTTA ATACCGCGTG GCAGTGGGGA GAACGTGAAG AACAGGCGGT AAAACAATTA
GGCAAACTTA TTCAAGAACG GCTGTAA
 
Protein sequence
MKKYQQLAEQ LREQIASGIW QPGDRLPSLR DQVALSGMSF MTVSHAYQLL ESQGYIIARP 
QSGYYVAPQA IKMPKAPVIP VTRDEAVDIN TYIFDMLQAS RDPSVVPFAS AFPDPRLFPL
QQLNRSLAQV SKTATAMSVI ENLPPGNAEL RQAIARRYAL QGITISPDEI AITAGALEAL
NLSLQVVTEP GDWVIVENPC FYGALQALER LRLKALSVAT DVKEGIDLQA LELALQEYPV
KACWLMTNSQ NPLGFTLTPQ KKAQLVALLN QYNVTLIEDD VYSELYFGRE KPLPAKAWDH
HDGVLHCSSF SKCLVPGFRI GWVAAGKHAR KIQRLQLMST LSTSSPMQLA LVDYLSTRRY
DAHLRRLRRQ LAERKQRAWQ ALLRYLPAEV KIHHNDSGYF LWLELPEPLD AGELSLVALT
HHISIAPGKM FSTGENWSRF FRFNTAWQWG EREEQAVKQL GKLIQERL