Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0057 |
Symbol | surA |
ID | 6143017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 59952 |
End bp | 61238 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641614958 |
Product | peptidyl-prolyl cis-trans isomerase SurA |
Protein accession | YP_001742174 |
Protein GI | 170682162 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0760] Parvulin-like peptidyl-prolyl isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0283729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.558643 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACT GGAAAACGCT GCTTCTCGGT ATCGCCATGA TCGCGAATAC CAGTTTCGCT GCCCCCCAGG TAGTCGATAA AGTCGCAGCC GTCGTCAATA ACGGCGTCGT GCTGGAAAGC GACGTTGATG GATTAATGCA GTCGGTAAAA CTGAACGCTG CTCAGGCAAG GCAGCAACTT CCTGATGACG CGACGCTGCG CCACCAAATC ATGGAACGTT TGATCATGGA TCAAATCATC CTGCAGATGG GGCAGAAAAT GGGAGTGAAA ATCTCCGATG AGCAGCTGGA TCAGGCGATT GCTAACATCG CGAAACAGAA CAACATGACG CTGGATCAGA TGCGCAGCCG TCTGGCTTAC GATGGTCTGA ACTACAACAC CTATCGTAAC CAGATCCGCA AAGAGATGAT TATCTCTGAA GTGCGTAACA ACGAGGTGCG TCGTCGCATC ACCATCCTGC CGCAGGAAGT CGAATCCCTG GCGCAGCAGG TGGGTAACCA AAACGACGCC AGCACTGAGC TGAACCTGAG CCACATCCTG ATCCCGCTGC CGGAAAACCC GACCTCTGAT CAAGTGAACG AAGCGGAAAG CCAGGCGCGT GCCATTGTCG ATCAGGCGCG TAACGGCGCT GATTTCGGTA AGCTGGCGAT TGCTCATTCT GCCGACCAGC AGGCGCTGAA CGGCGGCCAG ATGGGCTGGG GCCGTATTCA GGAGTTGCCG GGGATCTTCG CCCAGGCATT AAGCACCGCG AAGAAAGGCG ACATTGTTGG CCCGATTCGT TCCGGCGTTG GCTTCCATAT TCTGAAAGTT AACGACCTGC GCGGCGAAAG CAAAAATATC TCGGTGACCG AAGTTCATGC TCGCCATATT CTGCTGAAAC CGTCGCCGAT CATGACTGAC GAACAGGCCC GCGTGAAACT GGAACAGATT GCTGCCGATA TCAAGAGTGG TAAAACGACT TTTGCTGCCG CAGCGAAAGA GTTCTCTCAG GATCCAGGCT CTGCTAACCA GGGCGGTGAT CTCGGTTGGG CTACACCAGA TATTTTCGAT CCGGCCTTCC GTGACGCCCT GACCCGCCTG AACAAAGGTC AAATGAGTGC ACCGGTTCAC TCTTCATTCG GCTGGCATTT AATCGAACTA CTGGATACCC GTAATGTCGA TAAAACCGAC GCGGCGCAGA AAGATCGTGC GTACCGCATG CTGATGAACC GTAAGTTCTC GGAAGAAGCA GCAAGCTGGA TGCAAGAACA ACGTGCCAGC GCCTACGTTA AAATCCTGAG CAACTAA
|
Protein sequence | MKNWKTLLLG IAMIANTSFA APQVVDKVAA VVNNGVVLES DVDGLMQSVK LNAAQARQQL PDDATLRHQI MERLIMDQII LQMGQKMGVK ISDEQLDQAI ANIAKQNNMT LDQMRSRLAY DGLNYNTYRN QIRKEMIISE VRNNEVRRRI TILPQEVESL AQQVGNQNDA STELNLSHIL IPLPENPTSD QVNEAESQAR AIVDQARNGA DFGKLAIAHS ADQQALNGGQ MGWGRIQELP GIFAQALSTA KKGDIVGPIR SGVGFHILKV NDLRGESKNI SVTEVHARHI LLKPSPIMTD EQARVKLEQI AADIKSGKTT FAAAAKEFSQ DPGSANQGGD LGWATPDIFD PAFRDALTRL NKGQMSAPVH SSFGWHLIEL LDTRNVDKTD AAQKDRAYRM LMNRKFSEEA ASWMQEQRAS AYVKILSN
|
| |