Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0047 |
Symbol | surA |
ID | 6269855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 45957 |
End bp | 47243 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641724306 |
Product | peptidyl-prolyl cis-trans isomerase SurA |
Protein accession | YP_001878866 |
Protein GI | 187733861 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0760] Parvulin-like peptidyl-prolyl isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00000013551 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAACT GGAAAACGCT GCTTCTCGGT ATCGCCATGA TCGCGAATAC CAGTTTCGCT GCCCCCCAGG TAGTCGATAA AGTCGCAGCC GTCGTCAATA ACGGCGTCGT GCTGGAAAGC GACGTTGATG GATTAATGCA GTCGGTAAAA CTGAACGCTG CGCAGGCAAG GCAGCAGCTT CCTGATGACG CGACGCTGCG CCACCAAATC ATGGAACGTT TGATCATGGA TCAAATCATC CTGCAGATGG GGCAGAAAAT GGGAGTGAAA ATCTCCGATG AGCAGCTGGA TCAGGCGATT GCTAACATCG CGAAACAGAA CAACATGACG CTGGATCAGA TGCGCAGCCG TCTGGCTTAC GATGGTCTGA ACTACAACAC CTATCGTAAC CAGATCCGCA AAGAGATGAT TATCTCTGAA GTGCGTAACA ACGAGGTGCG TCGTCGCATC ACCATCCTGC CGCAGGAAGT CGAATCCCTG GCGCAGCAGG TGGGTAACCA AAACGACGCC AGCACTGAGC TGAACCTGAG CCACATCCTG ATCCCGCTGC CGGAAAACCC GACTTCTGAT CAGGTGAACG AAGCGGAAAG CCAGGCGCGC GCCATTGTCG ATCAGGTGCG TAACGGCGCT GATTTCGGTA AGCTGGCGAT TGCTCATTCT GCCGACCAGC AGGCGCTGAA CGGCGGCCAG ATGGGCTGGG GCCGTATTCA GGAGTTGCCG GGTATCTTCG CCCAGGCATT AAGCACCGCG AAGAAAGGCG ACATTGTTGG CCCGATTCGT TCCGGCGTTG GCTTCCATAT TCTGAAAGTT AACGACCTGC GCGGCGAAAG CAAAAATATC TCGGTGACCG AAGTTCATGC TCGCCATATT CTGCTGAAAC CGTCGCCGAT CATGACTGAC GAACAGGCCC GTGTGAAACT GGAACAGATT GCTGCCGATA TCAAGAGTGG TAAAACGACT TTTGCTGCCG CAGCGAAAGA GTTCTCTCAG GATCCAGACT CTGCTAACCA GGGCGGTGAT CTCGGCTGGG CTACACCAGA TATTTTCGAT CCGGCCTTCC GTGACGCCCT GACCCGCCTG AACAAAGGTC AAATGAGTGC ACCGGTTCAC TCTTCATTCG GCTGGCATTT AATCGAACTG CTGGATACCC GTAATGTCGA TAAAACCGAC GCTGCGCAGA AGGATCGTGC ATACCGCATG CTGATGAACC GTAAGTTCTC GGAAGAAGCA GCAAGCTGGA TGCAGGAACA ACGCGCCAGC GCCTACGTTA AAATCCTGAG CAACTAA
|
Protein sequence | MKNWKTLLLG IAMIANTSFA APQVVDKVAA VVNNGVVLES DVDGLMQSVK LNAAQARQQL PDDATLRHQI MERLIMDQII LQMGQKMGVK ISDEQLDQAI ANIAKQNNMT LDQMRSRLAY DGLNYNTYRN QIRKEMIISE VRNNEVRRRI TILPQEVESL AQQVGNQNDA STELNLSHIL IPLPENPTSD QVNEAESQAR AIVDQVRNGA DFGKLAIAHS ADQQALNGGQ MGWGRIQELP GIFAQALSTA KKGDIVGPIR SGVGFHILKV NDLRGESKNI SVTEVHARHI LLKPSPIMTD EQARVKLEQI AADIKSGKTT FAAAAKEFSQ DPDSANQGGD LGWATPDIFD PAFRDALTRL NKGQMSAPVH SSFGWHLIEL LDTRNVDKTD AAQKDRAYRM LMNRKFSEEA ASWMQEQRAS AYVKILSN
|
| |