Gene EcSMS35_4722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4722 
SymboltreR 
ID6147153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4821519 
End bp4822466 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content54% 
IMG OID641619538 
Producttrehalose repressor 
Protein accessionYP_001746646 
Protein GI170680573 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID[TIGR02405] trehalose operon repressor, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.557711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATC GGCTGACCAT CAAAGACATC GCGCGCTTAA GCGGCGTGGG GAAATCTACG 
GTTTCCCGGG TGCTGAATAA CGAAAGCGGC GTGAGCCAGC GCACGCGCGA GCGTGTTGAA
GCAGTGATGA ATCAGCATGG ATTTTCCCCT TCCCGCTCTG CGCGCGCTAT GCGTGGGCAA
AGCGATAAAG TGGTCGCCAT CATTGTTACC CGTCTGGATT CGTTGTCAGA AAATCTCGCC
GTTCAAACCA TGCTGCCAGC GTTCTATGAA CAAGGTTACG ACCCAATCAT GATGGAAAGT
CAGTTTTCCC CGCAATTAGT TGCCGAACAT TTGGGGGTCC TGAAACGGCG TAATATCGAC
GGCGTAGTGC TGTTCGGTTT TACCGGCATA ACAGAAGAAA TGTTAGCCCA CTGGCAGTCA
TCGCTGGTTC TGCTGGCGCG TGACGCAAAA GGCTTTGCTT CAGTCTGTTA TGACGACGAA
GGGGCAATCA AGATCCTGAT GCAACGGCTG TATGACCAGG GGCATCGTAA TATCAGTTAT
CTCGGCGTGC CGCATAGTGA CGTGACGACC GGTAAGCGAC GTCACGAAGC CTACCTGGCG
TTCTGCAAAG CGCATAAATT GCATCCCGTT GCCGCTCTGC CAGGGCTTGC TATGAAGCAA
GGCTATGAGA ACGTAGCAAA AGTGATTACG CCTGAAACTA CCGCGTTACT GTGCGCAACC
GACACGCTGG CACTTGGCGC AAGTAAATAC CTGCAAGAGC AACGCATCGA CACCTTGCAA
CTGGCGAGCG TCGGTAATAC ACCGTTAATG AAATTCCTCC ATCCGGAGAT CGTAACCGTA
GATCCCGGTT ACGCCGAAGC TGGACGCCAG GCGGCCTGCC AGTTGATCGC GCAGGTCACC
GGGCGCAGCG AACCGCAACA AATCATCATC CCCGCCACCC TGTCCTGA
 
Protein sequence
MQNRLTIKDI ARLSGVGKST VSRVLNNESG VSQRTRERVE AVMNQHGFSP SRSARAMRGQ 
SDKVVAIIVT RLDSLSENLA VQTMLPAFYE QGYDPIMMES QFSPQLVAEH LGVLKRRNID
GVVLFGFTGI TEEMLAHWQS SLVLLARDAK GFASVCYDDE GAIKILMQRL YDQGHRNISY
LGVPHSDVTT GKRRHEAYLA FCKAHKLHPV AALPGLAMKQ GYENVAKVIT PETTALLCAT
DTLALGASKY LQEQRIDTLQ LASVGNTPLM KFLHPEIVTV DPGYAEAGRQ AACQLIAQVT
GRSEPQQIII PATLS