Gene EcSMS35_4886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4886 
Symbol 
ID6144120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5003289 
End bp5004701 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content55% 
IMG OID641619690 
ProductGntR family transcriptional regulator 
Protein accessionYP_001746797 
Protein GI170680887 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.344602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.838171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTT ATCAACATCT GGCGACTTTG CTTGCCGAGC GGATTGAGCA AGGGCTGTAT 
CGTCACGGGG AGAAATTGCC GTCGGTGCGC AGCTTAAGTC AGGAGCACGG CGTCAGCATC
AGTACCGTGC AGCAGGCGTA CCAGACGCTG GAAACGATGA AGCTCATCAC TCCGCAGCCG
CGTTCGGGTT ATTTTGTCGC ACAACGTAAA GCCCAGCCGC CAGTACCGCC GATGACGCGT
CCGGTGCAGC GCCCGGTGGA AATTACCCAG TGGGATCAGG TGCTGGATAT GCTGGAGGCG
CATAGCGACA GTTCCATTGT TCCGTTAAGT AAAAGCACGC CGGATGTCGA AACGCCCAGC
CTGAAACCGC TGTGGCGGGA GCTAAGCCGG GTGGTGCAGC ATAATCTGCA AACCGTTCTC
GGTTATGACT TGTTAGCCGG CCAGCGGGTA TTGCGCGAGC AGATTGCCCG CCTGATGCTC
GACAGCGGCT CGGTGGTCAC CGCCGATGAC ATCATCATCA CCAGCGGCTG CCATAACTCG
ATGTCGCTGG CGTTGATGGC AGTGTGTAAA CCGGGCGATA TTGTCGCGGT CGAATCCCCC
TGTTATTACG GTTCGATGCA GATGCTGCGC GGCATGGGCG TGAAAGTGAT TGAAATCCCC
ACCGATCCAG AAACAGGCAT CAGCGTTGAA GCGCTGGAAC TGGCGCTGGA GCAGTGGCCG
ATTAAAGGCA TCATTCTGGT GCCAAACTGT AATAATCCGC TGGGATTTAT TATGCCGGAC
GCGCGCAAAC GGGCCGTTCT CTCTCTCGCT CAGCGTCATG ATATTGTGAT TTTTGAAGAT
GATGTCTACG GCGAACTGGC GACGGAGTAT CCGCGCCCGC GGACCATTCA TTCCTGGGAT
ATCGACGGGC GAGTGCTGTT GTGCAGCTCG TTCAGTAAAA GTATTGCTCC AGGCCTGCGC
GTGGGTTGGG TCGCACCGGG GCGCTATCAC GATAAACTGC TGCATATGAA ATATGCCATC
AGCAGCTTTA ATGTGCCGTC CACGCAAATG GCGGCGGCAA CGTTTGTGCT GGAAGGTCAC
TATCATCGCC ATATCCGGCG GATGCGGCAG ATCTATCAGC GCAATTTGGC GCTTTATACC
TGCTGGATAC GGGAATATTT TCCCTGCGAA ATCTGTATTA CGCGCCCGAA AGGCGGATTT
TTACTGTGGA TCGAATTGCC TGAACAGGTC GATATGGTTT GCGTTGCGCG GCAGCTATAC
CGCATGAAAA TCCAGGTGGC GGCAGGCTCG ATTTTCTCAG CTTCCGGCAA ATACCGTAAT
TGTCTGCGCA TCAACTGCGC TTTGCCGCTC AGCGAAACCT ATCGCGAAGC ACTGAAGCAA
ATTGGCGAAG CCGTGTATCG GGCAATGGAA TGA
 
Protein sequence
MTRYQHLATL LAERIEQGLY RHGEKLPSVR SLSQEHGVSI STVQQAYQTL ETMKLITPQP 
RSGYFVAQRK AQPPVPPMTR PVQRPVEITQ WDQVLDMLEA HSDSSIVPLS KSTPDVETPS
LKPLWRELSR VVQHNLQTVL GYDLLAGQRV LREQIARLML DSGSVVTADD IIITSGCHNS
MSLALMAVCK PGDIVAVESP CYYGSMQMLR GMGVKVIEIP TDPETGISVE ALELALEQWP
IKGIILVPNC NNPLGFIMPD ARKRAVLSLA QRHDIVIFED DVYGELATEY PRPRTIHSWD
IDGRVLLCSS FSKSIAPGLR VGWVAPGRYH DKLLHMKYAI SSFNVPSTQM AAATFVLEGH
YHRHIRRMRQ IYQRNLALYT CWIREYFPCE ICITRPKGGF LLWIELPEQV DMVCVARQLY
RMKIQVAAGS IFSASGKYRN CLRINCALPL SETYREALKQ IGEAVYRAME