Gene EcSMS35_0346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0346 
Symbol 
ID6144396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp357618 
End bp358706 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content46% 
IMG OID641615242 
ProductLuxR-family transcriptional regulator 
Protein accessionYP_001742450 
Protein GI170679658 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAT GTGATTTTCG TGTTTTTCTG CAAGAGTTCG GTACAACGGT TCATTTGTCA 
TTGCCTGGTA GCGTATCCGA GAAAGAACGA CTGCTACTCA AGCTGCTGAT GCAGGGAATG
TCTGTAACAG AAATATCACA GTACAGAAAT CGCAGTGCAA AGACCATTTC ACATCAAAAG
AAACAGCTCT TTGAGAAACT GGGGATTCAG AGCGATATTA CTTTCTGGCG GGATATTTTC
TTTCAGTACA ATCCGGAGAT CATATCCGCC ACGGGTAATA ATAGTCACAA ATATATTAAT
GATAATCACT ATCACCATAT CGTCACGCCT GAAGCCATCA GTCTGGCGTT GGAAAACCAT
GAATTTAAAC CGTGGATCCA ACCGGTTTTC TGCGCGCAGA CTGGGGTACT GACGGGCTGT
GAGGTGCTTG TCCGCTGGGA ACATCCACAA ACGGGAATTA TCCCACCGGA TCAGTTTATT
CCTCTGGCGG AGTCATCTGG TCTTATCGTC ATAATGACTC GCCAGTTGAT GAAACAGACT
GCGGATATTC TGATGCCGGT AAAACATTTG CTGCCGGACA ATTTCCATAT TGGCATCAAC
GTCTCGGCGG GTTGTTTTTT GGCCGCAGGA TTTGAAAAAG AGTGTCTGAA CCTGGTTAAG
AAATTAGGTA ACGATAAAAT CAAACTGGTT CTTGAGCTGA CGGAACGTAA CCCTATTCCG
GTAACGCCAG AAGCCAGAGC GATATTTGAC AGCCTTCATC AGCACAACAT TACCTTTGCG
CTGGATGACT TTGGTACGGG TTATGCGACC TATCGTTACT TGCAGGCGTT CCCGGTCGAT
TTTATTAAGA TCGATAAGTC ATTTGTGCAA ATGGCGAGCG TGGACGAAAT ATCCGGTCAT
ATTGTGGACA ATATTGTCGA ACTGGCGCGT AAGCCTGGTC TGAGTATCGT GGCGGAAGGG
GTAGAAACCC AGGAGCAGGC GGATTTAATG ATCGGCAAAG GAGTTCACTT TTTGCAGGGC
TATTTGTACT CTCCGCCAGT ACCGGGTAAT AAATTTATCT CTGAATGGGT AATGAAAGCA
GGTGGTTGA
 
Protein sequence
MNSCDFRVFL QEFGTTVHLS LPGSVSEKER LLLKLLMQGM SVTEISQYRN RSAKTISHQK 
KQLFEKLGIQ SDITFWRDIF FQYNPEIISA TGNNSHKYIN DNHYHHIVTP EAISLALENH
EFKPWIQPVF CAQTGVLTGC EVLVRWEHPQ TGIIPPDQFI PLAESSGLIV IMTRQLMKQT
ADILMPVKHL LPDNFHIGIN VSAGCFLAAG FEKECLNLVK KLGNDKIKLV LELTERNPIP
VTPEARAIFD SLHQHNITFA LDDFGTGYAT YRYLQAFPVD FIKIDKSFVQ MASVDEISGH
IVDNIVELAR KPGLSIVAEG VETQEQADLM IGKGVHFLQG YLYSPPVPGN KFISEWVMKA
GG