Gene EcSMS35_2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2793 
Symbol 
ID6143066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2874146 
End bp2875480 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content53% 
IMG OID641617662 
ProductGntR family transcriptional regulator 
Protein accessionYP_001744822 
Protein GI170681638 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.102841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCT ATCAGGATAT CGCCCGTCAG TTAAAAACGG CTATTGAGCA AGGGGAACTG 
AAACCCGGCG CAAGGCTGCC TTCAAGTCGT ACCTGGTCGC AGGAGCTGGG GGTGTCTCGA
TCCACTGTCG AAAATGCGTA TGCTGAGCTG GTGGCGCAAG GATGGCTGAT CAGGCGGGGA
CAGGCTGGCA CATTTGTCAG TGAGCGGATA TATCCGCAAC AATCTACTGT ACAAGTTGTA
GCTTTTGCCG GTGAAAGTCA GCAACCGTTG CCATTTCAAA TGGGATTACC CGCACTCGAT
CTCTTTCCGC GAGAGCTGTG GGCGCGGGTG ATGGGGCGTC GCCTTCGTAC CCAGACGCGC
TTTGATTTGG CGTTAGGCGA TGTCTGCGGT GAGGCGGCGT TGCGCGAGGC AATAGTTGAT
TACTTACGCG TTTCACGTGG GATTGATTGT CAGCCAGAGC AGGTCTTTAT CACTCACGGT
TATGCGGCCT CAATAGCTTT AATTCTGCAC GCGCTGGCGA AACCGGGAAA CGGGATGTGG
ATAGAAGATC CCGGCTTTCC ACTGATTCGC CCGATTGTCA CTCGCCATGA TGTGGAAATT
TTGCCTGTGC CGGTTGATGA CAATGGACTG GATATCACAA GCGGAATACA AAATTATCCT
GATGCGCGTT TTGCCCTGAT AACACCAGCA CACCAAAGTC CGCTGGGTGT GGCGCTCTCT
TTAGCGCGCA GGCATCAGAT ACTGGAATGG GCAGATCGTA GTCAGGCATG GATTATTGAA
GATGATTATG ACAGTGAGTT TCGCTATCAC GGTAAGCCGT TACCGGCGTT GAAAAGTCTC
GACGCACCGC AGCGGGTAAT TTATGCCGGA ACATTCAGCA AAGCGCTATT TCCTGCATTG
CGCTGTGCGT GGCTGGTGGT GCCGGTGAAG CAAATTGCAC AATTCCGCCA CCAGGCGTCA
CTGGCTCCAT GTGCGGTACC TGTTCTATGG CAGAACACAC TGGCAGACTT TCTTCGCGAG
GGGCATTTCT GGCGGCATCT GAAGAAAATG CGTCAGCATT ATGCTCAACG TCGGCAATGG
ATTGAGCAAG CGTTGACGCA GCAAGGATTT CAGGTTGTGC CGCAGAAAGG TGGTATCCAG
ATGGTGATCA GAATGATAGG CGATGATATT GCCCATGCGC GTAAAGCCAA TGCTGCAGGC
CTTGCGGTGC AGGCACTTAG CGACTGGCGT ATCCGTTCAA GTGGGGAAGG TGGATTACTG
CTTTCGTTTA CTAATATCGT TAACGAAGGT ATGGCGCGAC AGGTAGCACA ACAATTACGT
AAAGCCTTAA GCTAA
 
Protein sequence
MPRYQDIARQ LKTAIEQGEL KPGARLPSSR TWSQELGVSR STVENAYAEL VAQGWLIRRG 
QAGTFVSERI YPQQSTVQVV AFAGESQQPL PFQMGLPALD LFPRELWARV MGRRLRTQTR
FDLALGDVCG EAALREAIVD YLRVSRGIDC QPEQVFITHG YAASIALILH ALAKPGNGMW
IEDPGFPLIR PIVTRHDVEI LPVPVDDNGL DITSGIQNYP DARFALITPA HQSPLGVALS
LARRHQILEW ADRSQAWIIE DDYDSEFRYH GKPLPALKSL DAPQRVIYAG TFSKALFPAL
RCAWLVVPVK QIAQFRHQAS LAPCAVPVLW QNTLADFLRE GHFWRHLKKM RQHYAQRRQW
IEQALTQQGF QVVPQKGGIQ MVIRMIGDDI AHARKANAAG LAVQALSDWR IRSSGEGGLL
LSFTNIVNEG MARQVAQQLR KALS