Gene EcSMS35_1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1819 
SymbolpspF 
ID6145159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1837937 
End bp1838914 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content52% 
IMG OID641616695 
Productphage shock protein operon transcriptional activator 
Protein accessionYP_001743873 
Protein GI170684159 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG1221] Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain 
TIGRFAM ID[TIGR02974] psp operon transcriptional activator PspF 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.235357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAT ACAAAGATAA TTTACTTGGT GAGGCGAATA GCTTTCTCGA AGTGCTGGAA 
CAGGTTTCGC ATCTCGCACC ACTGGATAAA CCGGTGCTCA TCATCGGCGA ACGCGGCACC
GGTAAAGAGT TGATTGCCAG CCGCCTGCAT TATCTCTCCT CCCGTTGGCA AGGGCCGTTT
ATTTCCCTTA ACTGCGCGGC GTTAAATGAA AATCTGCTGG ATTCCGAACT GTTTGGTCAC
GAAGCGGGGG CGTTTACCGG TGCGCAAAAA CGTCATCCTG GGAGATTTGA ACGTGCCGAT
GGCGGTACGC TATTTCTTGA TGAACTCGCT ACGGCACCGA TGATGGTGCA GGAAAAATTA
TTGCGTGTGA TTGAGTACGG TGAACTGGAG CGCGTTGGCG GTAGTCAGCC ATTGCAGGTG
AATGTGCGGT TGGTATGCGC GACGAATGCC GATCTCCCGG CGATGGTCAA TGAAGGTACT
TTTCGCGCTG ACCTGCTCGA CCGGCTGGCT TTCGATGTAG TGCAACTGCC GCCCCTGCGC
GAGCGCGAAA GCGACATTAT GCTGATGGCA GAACACTTTG CCATTCAGAT GTGTCGGGAA
ATCAAGCTGC CTCTGTTCCC GGGTTTTACG GAGCGCGCCA GAGAAACATT GCTGAATTAT
CGCTGGCCGG GAAATATTCG TGAATTGAAA AACGTGGTGG AACGTTCAGT GTATCGCCAC
GGCACCAGCG ATTATCCGCT TGATGACATC ATTATTGATC CCTTTAAACG GCGTCCGCCT
GAAGACGCTA TCGCCGTTTC AGAAACCACC TCGCTTCCAA CACTGCCGCT GGATTTACGT
GAGTTTCAGA TGCAGCAGGA AAAAGAGTTG CTGCAACTCA GTTTGCAACA GGGGAAATAT
AACCAGAAGC GCGCGGCTGA ATTACTGGGG TTAACCTATC ATCAGTTCCG CGCGTTGTTG
AAAAAGCACC AGATTTAG
 
Protein sequence
MAEYKDNLLG EANSFLEVLE QVSHLAPLDK PVLIIGERGT GKELIASRLH YLSSRWQGPF 
ISLNCAALNE NLLDSELFGH EAGAFTGAQK RHPGRFERAD GGTLFLDELA TAPMMVQEKL
LRVIEYGELE RVGGSQPLQV NVRLVCATNA DLPAMVNEGT FRADLLDRLA FDVVQLPPLR
ERESDIMLMA EHFAIQMCRE IKLPLFPGFT ERARETLLNY RWPGNIRELK NVVERSVYRH
GTSDYPLDDI IIDPFKRRPP EDAIAVSETT SLPTLPLDLR EFQMQQEKEL LQLSLQQGKY
NQKRAAELLG LTYHQFRALL KKHQI