Gene EcSMS35_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0074 
SymbolsgrR 
ID6145976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp83496 
End bp85154 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content56% 
IMG OID641614975 
Producttranscriptional regulator SgrR 
Protein accessionYP_001742191 
Protein GI170681588 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.492998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCTG CTCGTCTGCA ACAACAGTTC ATCCGCCTGT GGCAATGCTG CGAGGGTAAA 
TCGCAGGACA CAACGCTCAA CGAACTGGCA GCGTTATTGA GCTGCTCGCG TCGTCATATG
CGCACCCTGC TCAACACCAT GCAGGATCGC GGCTGGCTGA CGTGGGAAGC GGAAGTCGGG
CGCGGTAAAC GCTCGCGTCT GACATTCCTC TATACCGGGC TGGCGCTTCA GCAACAGCGG
GCGGAAGACC TGCTGGAGCA GGATCGTATC GATCAACTGG TGCAACTGGT TGGCGACAAA
GCGACTGTGC GGCAAATGCT GGTTTCTCAT CTGGGCCGCA GCTTCCGCCA GGGGCGACAC
ATCCTGCGTG TGCTCTACTA TCGTCCGTTG CGTAATCTGC TACCTGGCAG CGCATTACGC
CGTTCCGAAA CCCATATCGC CCGGCAAATC TTCAGTTCGC TAACGCGCAT AAATGAGGAA
AATGGGGAAC TGGAAGCAGA CATCGCCCAC CACTGGCAGC AAATATCACC GCTTCACTGG
CGTTTCTTTT TGCGTCCAGG AGTCCATTTT CATCATGGTC GTGAACTGGA AATGGACGAT
GTGATCGCCT CTTTAAAACG GATCAATACG CTGCCGCTCT ATTCGCATAT TGCTGAAATT
GTATCGCCGA CGCCCTGGAC GCTGGATATC CACCTCACGC AGCCGGATCG CTGGTTGCCG
TTACTGCTGG GGCAAGTTCC GGCGATGATC CTGCCGCGCG AATGGGAAAC CCTCAGTAAC
TTTGCCAGCC ATCCCATCGG CACCGGTCCG TATGCGGTGA TCCGCAACAG CACCAATCAA
CTGAAAATTC AGGCATTCGA TGACTTCTTC GGTTACCGGG CATTAATCGA CGAAGTTAAC
GTCTGGGTTC TGCCGGAAAT TGCCGACGAG CCAGCCGGAG GGCTGATGCT AAAAGGGCCA
CAGGGCGAGG AAAAAGAGAT TGAAAGCCGC CTGGAGGAAG GTTGCTACTA TTTACTGTTC
GACAGCCGCA CCCATCGCGG GGCGAATCAG CAAGTCAGGG ACTGGGTAAG CTATGTGCTT
TCTCCAACTA ATCTGGTCTA TTTCGCTGAG GAACAGTACC AGCAACTGTG GTTCCCGGCT
TATGGACTGC TCCCCCGTTG GCACCATGCT CGTCCGACAC ATTGCGAAAA ACCCGCCGGG
CTGGAAAGCC TCACCCTGAC CTTTTATCAG GATCATATTG AGCATCGAGT GATTGCCGGG
ATCATGCAGC AGATTCTGGC AAGTCACCAG GTCACGCTGG AAATCAAAGA GATCAGCTAC
GATCAGTGGC ATGAAGGAGA GATCGAGAGC GATATCTGGC TTAACAGCGC CAACTTTACG
CTGCCGCTGG ATTTTTCGCT GTTCGCGCAC TTGTGCGAGG TGCCGCTGCT CCAACACTGT
CTTCCGATCG ACTGGCAAGC CGACGCCGCC CGCTGGCGCA ATGGCGAAAT GAACCTGGCG
AACTGGTGCC AGCAACTGGT CGCCAGCAAA GCAATGGTGC CGCTTATCCA CCACTGGCTG
ATCATTCAGG GGCAACGCAG TATGCGCGGT CTACGTATGA ACACCCTCGG CTGGTTCGAT
TTTAAATCAG CGTGGTTTGC GCCGCCGGAT CCAGAGTAG
 
Protein sequence
MPSARLQQQF IRLWQCCEGK SQDTTLNELA ALLSCSRRHM RTLLNTMQDR GWLTWEAEVG 
RGKRSRLTFL YTGLALQQQR AEDLLEQDRI DQLVQLVGDK ATVRQMLVSH LGRSFRQGRH
ILRVLYYRPL RNLLPGSALR RSETHIARQI FSSLTRINEE NGELEADIAH HWQQISPLHW
RFFLRPGVHF HHGRELEMDD VIASLKRINT LPLYSHIAEI VSPTPWTLDI HLTQPDRWLP
LLLGQVPAMI LPREWETLSN FASHPIGTGP YAVIRNSTNQ LKIQAFDDFF GYRALIDEVN
VWVLPEIADE PAGGLMLKGP QGEEKEIESR LEEGCYYLLF DSRTHRGANQ QVRDWVSYVL
SPTNLVYFAE EQYQQLWFPA YGLLPRWHHA RPTHCEKPAG LESLTLTFYQ DHIEHRVIAG
IMQQILASHQ VTLEIKEISY DQWHEGEIES DIWLNSANFT LPLDFSLFAH LCEVPLLQHC
LPIDWQADAA RWRNGEMNLA NWCQQLVASK AMVPLIHHWL IIQGQRSMRG LRMNTLGWFD
FKSAWFAPPD PE