Gene EcolC_3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3191 
Symbol 
ID6066612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3495859 
End bp3497730 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content52% 
IMG OID641602606 
Productpeptidyl-prolyl cis-trans isomerase (rotamase D) 
Protein accessionYP_001726140 
Protein GI170021186 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0760] Parvulin-like peptidyl-prolyl isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0107866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGACA GCTTACGCAC GGCTGCAAAC AGTCTCGTGC TCAAGATTAT TTTCGGTATC 
ATTATCGTGT CGTTCATATT GACCGGCGTG AGTGGTTACC TGATTGGCGG AGGCAATAAC
TACGCCGCAA AAGTGAATGA CCAGGAAATC AGCCGTGGGC AATTCGAGAA CGCCTTCAAC
AGCGAGCGTA ATCGCATGCA GCAACAGCTG GGCGATCAAT ACTCCGAGCT GGCAGCGAAC
GAAGGCTATA TGAAAACCCT GCGTCAACAG GTGCTGAATC GTCTGATCGA CGAGGCGCTG
CTGGATCAGT ACGCACGTGA GCTGAAACTG GGTATCAGCG ATGAGCAGGT TAAACAGGCG
ATTTTCGCGA CCCCAGCCTT CCAGGTTGAC GGCAAATTTG ATAACAGCCG CTATAACGGT
ATCCTCAACC AGATGGGGAT GACCGCCGAT CAGTACGCCC AGGCGCTGCG TAACCAGCTC
ACTACCCAAC AGCTGATTAA CGGCGTTGCC GGTACCGATT TTATGCTGAA AGGTGAAACC
GACGAGCTGG CGGCACTGGT CGCGCAACAA CGCGTGGTGC GTGAGGCGAC TATCGATGTT
AACGCGCTGG CGGCGAAGCA GCCTGTGACC GAACAGGAGA TTGCCAGCTA CTACGAACAA
AACAAAAACA ATTTCATGAC GCCGGAACAA TTCCGCGTGA GTTACATCAA GCTGGATGCC
GCAACGATGC AGCAACCGGT TAGCGATGCG GATATCCAGA GCTACTACGA CCAGCATCAG
GATCAATTCA CCCAGCCGCA GCGTACCCGC TACAGCATCA TCCAGACCAA AACTGAAGAT
GAAGCGAAAG CGGTACTTGA TGAGCTGAAT AAAGGCGGTG ATTTTGCTGC ATTAGCCAAA
GAAAAATCTG CCGATATTAT CTCTGCTCGT AACGGCGGCG ATATGGGTTG GTTAGAAGAT
GCCACTATCC CGGACGAACT GAAAAATGCT GGTCTGAAAG AAAAAGGCCA ACTGTCTGGT
GTCATCAAAT CTTCGGTCGG TTTCCTGATT GTACGTCTGG ACGACATTCA GCCAGCGAAA
GTGAAATCGT TAGACGAAGT ACGTGACGAC ATTGCGGCGA AAGTGAAACA CGAAAAAGCC
CTCGATGCGT ACTACGCGCT GCAGCAGAAA GTGAGCGATG CGGCAAGCAA CGACACCGAG
TCTCTGGCCG GTGCAGAGCA AGCTGCCGGC GTTAAAGCCA CTCAGACGGG TTGGTTCAGC
AAAGATAACC TGCCGGAAGA GTTGAACTTC AAGCCGGTTG CCGACGCTAT CTTTAACGGC
GGTCTGGTAG GTGAAAACGG CGCGCCGGGC ATCAACTCTG ACATCATCAC CGTAGACGGC
GACCGCGCAT TCGTGCTGCG CATCAGCGAG CACAAACCGG AAGCGGTGAA ACCGTTGGCA
GATGTTCAGG AACAAGTTAA GGCACTGGTT CAGCACAACA AAGCTGAACA ACAGGCGAAA
GTGGATGCTG AGAAACTGCT GGTTGATTTG AAAGCCGGCA AAGGTGCGGA AGCTATGCAG
GCTGCCGGTC TGAAATTTGG AGAGCCGAAA ACCTTAAGCC GTTCCGGTCG TGACCCGATT
AGCCAGGCGG CGTTTGCACT GCCACTGCCA GCGAAAGACA AACCGAGCTA CGGTATGGCG
ACCGATATGC AAGGCAATGT GGTTCTGCTG GCGCTGGATG AAGTGAAACA AGGTTCAATG
CCGGAAGATC AGAAAAAAGC GATGGTGCAG GGTATCACCC AGAACAACGC ACAAATCGTC
TTTGAAGCTC TGATGAGTAA CCTGCGTAAA GAGGCGAAAA TCAAAATTGG CGATGCGCTG
GAACAGCAAT AA
 
Protein sequence
MMDSLRTAAN SLVLKIIFGI IIVSFILTGV SGYLIGGGNN YAAKVNDQEI SRGQFENAFN 
SERNRMQQQL GDQYSELAAN EGYMKTLRQQ VLNRLIDEAL LDQYARELKL GISDEQVKQA
IFATPAFQVD GKFDNSRYNG ILNQMGMTAD QYAQALRNQL TTQQLINGVA GTDFMLKGET
DELAALVAQQ RVVREATIDV NALAAKQPVT EQEIASYYEQ NKNNFMTPEQ FRVSYIKLDA
ATMQQPVSDA DIQSYYDQHQ DQFTQPQRTR YSIIQTKTED EAKAVLDELN KGGDFAALAK
EKSADIISAR NGGDMGWLED ATIPDELKNA GLKEKGQLSG VIKSSVGFLI VRLDDIQPAK
VKSLDEVRDD IAAKVKHEKA LDAYYALQQK VSDAASNDTE SLAGAEQAAG VKATQTGWFS
KDNLPEELNF KPVADAIFNG GLVGENGAPG INSDIITVDG DRAFVLRISE HKPEAVKPLA
DVQEQVKALV QHNKAEQQAK VDAEKLLVDL KAGKGAEAMQ AAGLKFGEPK TLSRSGRDPI
SQAAFALPLP AKDKPSYGMA TDMQGNVVLL ALDEVKQGSM PEDQKKAMVQ GITQNNAQIV
FEALMSNLRK EAKIKIGDAL EQQ