Gene EcSMS35_4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4133 
Symbol 
ID6146654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4228237 
End bp4229757 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content51% 
IMG OID641618956 
Productputative ATP-dependent protease 
Protein accessionYP_001746088 
Protein GI170680083 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0606] Predicted ATPase with chaperone activity 
TIGRFAM ID[TIGR00368] Mg chelatase-related protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.815544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.463834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGT CAATTGTTCA TACCCGCGCA GCCCTGGGAG TAAATGCGCC CCCAATCACT 
GTTGAGGTAC ATATCAGTAA AGGTCTACCC GGCTTAACGA TGGTGGGCTT ACCAGAAACA
ACGGTAAAAG AAGCCCGCGA TCGCGTGCGC AGCGCCATTA TCAATAGCGG ATATGAATAT
CCGGCGAAAA AAATCACCAT CAACCTTGCT CCAGCCGATC TGCCAAAAGA AGGGGGACGA
TATGATTTAC CTATCGCCAT TGCGTTGCTG GCAGCCTCAG AACAGCTTAC AGCCAATAAG
TTAGATGAAT ATGAATTAGT CGGAGAACTG GCGCTTACAG GCGCTCTGCG TGGCGTTCCC
GGCGCAATCT CCAGTGCAAC TGAAGCCATT AAGTCGGGCA GAAAAATTAT CGTCGCGAAA
GATAACGAGG ATGAAGTGGG GCTAATTAAC GGTGAAGGAT GCCTGATAGC CGATCATCTG
CAGGCTGTCT GTGCGTTTCT GGAGGGTAAG CACGCTCTCG AACGCCCGAA ACCAACTGAT
GCAGTATCCC GGGCGCTACA ACATGATCTC AGTGATGTTG TCGGTCAGGA GCAAGGAAAG
CGAGGACTGG AAATTACCGC CGCTGGCGGG CACAACCTTT TACTGATTGG ACCGCCGGGA
ACAGGTAAAA CAATGCTCGC CAGCCGTATT AATGGCCTTT TGCCAGATTT AAGCAATGAA
GAGGCACTGG AGAGCGCTGC GATATTAAGT CTGGTAAATG CTGAATCAGT ACAAAAACAA
TGGCGGCAGC GCCCGTTCCG CTCACCTCAT CACAGTGCGT CGTTAACTGC GATGGTGGGC
GGTGGTGCAA TTCCAGGGCC AGGTGAAATT TCGCTGGCGC ATAACGGCGT GCTTTTTCTT
GATGAGCTAC CTGAATTTGA ACGGCGTACA CTGGATGCCT TGCGAGAGCC GATTGAATCC
GGGCAGATCC ATCTTTCACG CACACGAGCA AAAATAACCT ATCCAGCCCG TTTCCAGCTT
GTCGCGGCGA TGAATCCCAG CCCTACCGGA CATTATCAGG GAAACCATAA CCGCTGCACA
CCAGAACAGA CATTACGTTA TCTCAACCGG CTCTCGGGGC CCTTTCTCGA CCGCTTCGAT
CTCTCACTAG AGATCCCATT ACCGCCCCCC GGCATTTTGA GTAAAACGGT AGTGCAGGGA
GAAAACAGCA CCACCGTTAA ACAACGTGTA ATGGCCGCCA GAGAGCGCCA ATTTAAGCGG
CAGAATAAAC TGAACGCCTG GCTGGATAAT CCGGAAATAC GCCAATTCTG CAAGCTTGAG
AGCGAAGATG CGCAGTGGCT GGAAGAAACG CTGATCCATC TGGGGTTATC GATTCGTGCC
TGGCAGCGGT TATTGAAAGT TGCACGAACC ATTGCTGATA TTGATCAGTC TGACATTATC
ACACGTCAGC ATTTGCAGGA GGCAGTTAGC TATCGTGCGA TTGACCGTTT GCTCATCCAT
CTGCAAAAAC TACTGACATA A
 
Protein sequence
MSLSIVHTRA ALGVNAPPIT VEVHISKGLP GLTMVGLPET TVKEARDRVR SAIINSGYEY 
PAKKITINLA PADLPKEGGR YDLPIAIALL AASEQLTANK LDEYELVGEL ALTGALRGVP
GAISSATEAI KSGRKIIVAK DNEDEVGLIN GEGCLIADHL QAVCAFLEGK HALERPKPTD
AVSRALQHDL SDVVGQEQGK RGLEITAAGG HNLLLIGPPG TGKTMLASRI NGLLPDLSNE
EALESAAILS LVNAESVQKQ WRQRPFRSPH HSASLTAMVG GGAIPGPGEI SLAHNGVLFL
DELPEFERRT LDALREPIES GQIHLSRTRA KITYPARFQL VAAMNPSPTG HYQGNHNRCT
PEQTLRYLNR LSGPFLDRFD LSLEIPLPPP GILSKTVVQG ENSTTVKQRV MAARERQFKR
QNKLNAWLDN PEIRQFCKLE SEDAQWLEET LIHLGLSIRA WQRLLKVART IADIDQSDII
TRQHLQEAVS YRAIDRLLIH LQKLLT