Gene EcSMS35_0057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0057 
SymbolsurA 
ID6143017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp59952 
End bp61238 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content54% 
IMG OID641614958 
Productpeptidyl-prolyl cis-trans isomerase SurA 
Protein accessionYP_001742174 
Protein GI170682162 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0760] Parvulin-like peptidyl-prolyl isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0283729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.558643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACT GGAAAACGCT GCTTCTCGGT ATCGCCATGA TCGCGAATAC CAGTTTCGCT 
GCCCCCCAGG TAGTCGATAA AGTCGCAGCC GTCGTCAATA ACGGCGTCGT GCTGGAAAGC
GACGTTGATG GATTAATGCA GTCGGTAAAA CTGAACGCTG CTCAGGCAAG GCAGCAACTT
CCTGATGACG CGACGCTGCG CCACCAAATC ATGGAACGTT TGATCATGGA TCAAATCATC
CTGCAGATGG GGCAGAAAAT GGGAGTGAAA ATCTCCGATG AGCAGCTGGA TCAGGCGATT
GCTAACATCG CGAAACAGAA CAACATGACG CTGGATCAGA TGCGCAGCCG TCTGGCTTAC
GATGGTCTGA ACTACAACAC CTATCGTAAC CAGATCCGCA AAGAGATGAT TATCTCTGAA
GTGCGTAACA ACGAGGTGCG TCGTCGCATC ACCATCCTGC CGCAGGAAGT CGAATCCCTG
GCGCAGCAGG TGGGTAACCA AAACGACGCC AGCACTGAGC TGAACCTGAG CCACATCCTG
ATCCCGCTGC CGGAAAACCC GACCTCTGAT CAAGTGAACG AAGCGGAAAG CCAGGCGCGT
GCCATTGTCG ATCAGGCGCG TAACGGCGCT GATTTCGGTA AGCTGGCGAT TGCTCATTCT
GCCGACCAGC AGGCGCTGAA CGGCGGCCAG ATGGGCTGGG GCCGTATTCA GGAGTTGCCG
GGGATCTTCG CCCAGGCATT AAGCACCGCG AAGAAAGGCG ACATTGTTGG CCCGATTCGT
TCCGGCGTTG GCTTCCATAT TCTGAAAGTT AACGACCTGC GCGGCGAAAG CAAAAATATC
TCGGTGACCG AAGTTCATGC TCGCCATATT CTGCTGAAAC CGTCGCCGAT CATGACTGAC
GAACAGGCCC GCGTGAAACT GGAACAGATT GCTGCCGATA TCAAGAGTGG TAAAACGACT
TTTGCTGCCG CAGCGAAAGA GTTCTCTCAG GATCCAGGCT CTGCTAACCA GGGCGGTGAT
CTCGGTTGGG CTACACCAGA TATTTTCGAT CCGGCCTTCC GTGACGCCCT GACCCGCCTG
AACAAAGGTC AAATGAGTGC ACCGGTTCAC TCTTCATTCG GCTGGCATTT AATCGAACTA
CTGGATACCC GTAATGTCGA TAAAACCGAC GCGGCGCAGA AAGATCGTGC GTACCGCATG
CTGATGAACC GTAAGTTCTC GGAAGAAGCA GCAAGCTGGA TGCAAGAACA ACGTGCCAGC
GCCTACGTTA AAATCCTGAG CAACTAA
 
Protein sequence
MKNWKTLLLG IAMIANTSFA APQVVDKVAA VVNNGVVLES DVDGLMQSVK LNAAQARQQL 
PDDATLRHQI MERLIMDQII LQMGQKMGVK ISDEQLDQAI ANIAKQNNMT LDQMRSRLAY
DGLNYNTYRN QIRKEMIISE VRNNEVRRRI TILPQEVESL AQQVGNQNDA STELNLSHIL
IPLPENPTSD QVNEAESQAR AIVDQARNGA DFGKLAIAHS ADQQALNGGQ MGWGRIQELP
GIFAQALSTA KKGDIVGPIR SGVGFHILKV NDLRGESKNI SVTEVHARHI LLKPSPIMTD
EQARVKLEQI AADIKSGKTT FAAAAKEFSQ DPGSANQGGD LGWATPDIFD PAFRDALTRL
NKGQMSAPVH SSFGWHLIEL LDTRNVDKTD AAQKDRAYRM LMNRKFSEEA ASWMQEQRAS
AYVKILSN