Gene EcSMS35_0484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0484 
SymbolppiD 
ID6145900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp490217 
End bp492088 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content52% 
IMG OID641615378 
Productpeptidyl-prolyl cis-trans isomerase (rotamase D) 
Protein accessionYP_001742585 
Protein GI170681699 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0760] Parvulin-like peptidyl-prolyl isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00718349 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGACA GCTTACGCAC GGCTGCAAAC AGTCTCGTGC TCAAGATTAT TTTCGGTATC 
ATTATCGTGT CGTTCATATT GACCGGCGTG AGTGGTTACC TGATTGGCGG AGGCAATAAC
TACGCCGCAA AAGTGAATGA CCAGGAAATC AGCCGTGGGC AGTTCGAGAA TGCCTTCAAC
AGCGAGCGTA ATCGCATGCA GCAACAGCTG GGCGATCAAT ACTCTGAGCT TGCAGCGAAC
GAAGGCTATA TGAAAACCCT GCGTCAACAG GTGCTGAATC GTCTGATCGA CGAGGCGCTT
CTGGATCAGT ACGCTCGTGA GCTGAAACTG GGTATCAGTG ATGAGCAGGT TAAACAGGCG
ATTTTCGCGA CCCCAGCCTT CCAGGTTGAC GGCAAATTTG ATAACAGCCG CTATAACGGT
ATCCTCAACC AGATGGGGAT GACCGCCGAT CAGTACGCCC AGGCGCTACG TAACCAGCTC
ACTACCCAAC AGCTGATTAA CGGCGTTGCC GGTACCGATT TTATGCTGAA AGGTGAAACC
GACGAGCTGG CGGCACTGGT AGCTCAACAA CGCGTGGTAC GTGAAGCGAC TATCGATGTT
AACGCGCTGG CGGCGAAGCA GCCTGTGACC GAACAGGAAA TTGCCAGCTA CTACGAACAA
AACAAAAACA ATTTCATGAC GCCGGAACAA TTCCGCGTGA GTTACATCAA GCTGGATGCC
GCAACGATGC AGCAACCGGT TAGCGATGCG GATATCCAGA GCTACTACGA TCAGCATCAG
GATCAATTCA CCCAGCCGCA GCGTACCCGC TACAGCATCA TCCAGACCAA AACTGAAGAT
GAAGCGAAAG CGGTACTTGA TGAGCTGAAT AAAGGCGGTG ATTTTGCTGC GTTAGCCAAA
GAAAAATCTG CCGATATTAT CTCTGCCCGT AACGGCGGCG ATATGGGTTG GTTAGAAGAT
GCCACTATCC CGGACGAACT GAAAAATGCT GGTCTGAAAG AAAAAGGCCA ACTCTCTGGT
GTCATCAAAT CTTCGGTCGG TTTCCTGATT GTACGTCTGG ACGACATTCA GCCAGCGAAA
GTGAAGTCGT TAGACGAAGT GCGTGACGAT GTCGCGGCGA AAGTGAAACA CGAAAAAGCC
CTCGATGCGT ACTACGCACT GCAGCAGAAA GTGAGCGATG CAGCAAGCAA TGATACCGAG
TCTCTGGCCG GTGCAGAGCA AGCTGCCGGC GTTAAAGCCA CTCAGACGGG TTGGTTCAGC
AAAGATAACC TGCCGGAAGA GTTGAACTTC AAGCCGGTTG CTGACGCTAT CTTCAACGGC
GGTCTGGTAG GTGAAAACGG TGCGCCGGGC ATCAACTCTG ACATCATCAC CGTAGACGGC
GACCGCGCAT TCGTGCTGCG CATCAGCGAG CACAAACCGG AAGCGGTGAA ACCGTTGGCA
GATGTTCAGG AACAAGTTAA GGCACTGGTT CAGCACAACA AAGCTGAACA ACAGGCGAAA
GTGGATGCTG AGAAACTGCT GGTTGATTTG AAAGCCGGCA AAGGTGCGGA AGCTATGCAG
GCTGCCGGTC TGAAATTTGG CGAGCCGAAA ACCTTAAGCC GTTCCGGTCG TGACCCGATT
AGCCAGGCGG CGTTTGCACT GCCACTGCCA GCGAAAGACA AACCGAGCTA CGGTATGGCG
ACCGATATGC AAGGTAATGT GGTTCTGCTG GCGCTGGACG AAGTGAAACA AGGTTCAATG
CCGGAAGATC AGAAAAAAGC GATGGTGCAG GGTATCACCC AGAACAACGC ACAAATCGTC
TTTGAAGCGC TGATGAGTAA CCTGCGTAAA GAGGCGAAAA TCAAAATTGG CGATGCGCTG
GAACAGCAAT AA
 
Protein sequence
MMDSLRTAAN SLVLKIIFGI IIVSFILTGV SGYLIGGGNN YAAKVNDQEI SRGQFENAFN 
SERNRMQQQL GDQYSELAAN EGYMKTLRQQ VLNRLIDEAL LDQYARELKL GISDEQVKQA
IFATPAFQVD GKFDNSRYNG ILNQMGMTAD QYAQALRNQL TTQQLINGVA GTDFMLKGET
DELAALVAQQ RVVREATIDV NALAAKQPVT EQEIASYYEQ NKNNFMTPEQ FRVSYIKLDA
ATMQQPVSDA DIQSYYDQHQ DQFTQPQRTR YSIIQTKTED EAKAVLDELN KGGDFAALAK
EKSADIISAR NGGDMGWLED ATIPDELKNA GLKEKGQLSG VIKSSVGFLI VRLDDIQPAK
VKSLDEVRDD VAAKVKHEKA LDAYYALQQK VSDAASNDTE SLAGAEQAAG VKATQTGWFS
KDNLPEELNF KPVADAIFNG GLVGENGAPG INSDIITVDG DRAFVLRISE HKPEAVKPLA
DVQEQVKALV QHNKAEQQAK VDAEKLLVDL KAGKGAEAMQ AAGLKFGEPK TLSRSGRDPI
SQAAFALPLP AKDKPSYGMA TDMQGNVVLL ALDEVKQGSM PEDQKKAMVQ GITQNNAQIV
FEALMSNLRK EAKIKIGDAL EQQ