Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00056 |
Symbol | surA |
ID | 8114307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 57612 |
End bp | 58898 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644846350 |
Product | hypothetical protein |
Protein accession | YP_002997923 |
Protein GI | 251783619 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0760] Parvulin-like peptidyl-prolyl isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0286102 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAACT GGAAAACGCT GCTTCTCGGT ATCGCCATGA TCGCGAATAC CAGTTTCGCT GCCCCCCAGG TAGTCGATAA AGTCGCAGCC GTCGTCAATA ACGGCGTCGT GCTGGAAAGC GACGTTGATG GATTAATGCA GTCGGTAAAA CTGAACGCTG CTCAGGCAAG GCAGCAACTT CCTGATGACG CGACGCTGCG CCACCAAATC ATGGAACGTT TGATCATGGA TCAAATCATC CTGCAGATGG GGCAGAAAAT GGGAGTGAAA ATCTCCGATG AGCAGCTGGA TCAGGCGATT GCTAACATCG CGAAACAGAA CAACATGACG CTGGATCAGA TGCGCAGCCG TCTGGCTTAC GATGGACTGA ACTACAACAC CTATCGTAAC CAGATCCGCA AAGAGATGAT TATCTCTGAA GTGCGTAACA ACGAGGTGCG TCGTCGCATC ACCATCCTGC CGCAGGAAGT CGAATCCCTG GCGCAGCAGG TGGGTAACCA AAACGACGCC AGCACTGAGC TGAACCTGAG CCACATCCTG ATCCCGCTGC CGGAAAACCC GACCTCTGAT CAGGTGAACG AAGCGGAAAG CCAGGCGCGC GCCATTGTCG ATCAGGCGCG TAACGGCGCT GATTTCGGTA AGCTGGCGAT TGCTCATTCT GCCGACCAGC AGGCGCTGAA CGGCGGCCAG ATGGGCTGGG GCCGTATTCA GGAGTTGCCC GGGATCTTCG CCCAGGCATT AAGCACCGCG AAGAAAGGCG ACATTGTTGG CCCGATTCGT TCCGGCGTTG GCTTCCATAT TCTGAAAGTT AACGACCTGC GCGGCGAAAG CAAAAATATC TCGGTGACCG AAGTTCATGC TCGCCATATT CTGCTGAAAC CGTCGCCGAT CATGACTGAC GAACAGGCCC GTGTGAAACT GGAACAGATT GCTGCTGATA TCAAGAGTGG TAAAACGACT TTTGCTGCCG CAGCGAAAGA GTTCTCTCAG GATCCAGGCT CTGCTAACCA GGGCGGCGAT CTCGGCTGGG CTACACCAGA TATTTTCGAT CCGGCTTTCC GTGACGCCCT GACCCGCCTG AACAAAGGTC AAATGAGTGC ACCGGTTCAC TCTTCATTCG GCTGGCATTT AATCGAACTG CTGGATACCC GAAATGTCGA TAAAACCGAC GCTGCGCAGA AAGATCGTGC ATACCGCATG CTGATGAACC GTAAGTTCTC GGAAGAAGCA GCAAGCTGGA TGCAGGAACA ACGTGCCAGC GCCTACGTTA AAATCCTGAG CAACTAA
|
Protein sequence | MKNWKTLLLG IAMIANTSFA APQVVDKVAA VVNNGVVLES DVDGLMQSVK LNAAQARQQL PDDATLRHQI MERLIMDQII LQMGQKMGVK ISDEQLDQAI ANIAKQNNMT LDQMRSRLAY DGLNYNTYRN QIRKEMIISE VRNNEVRRRI TILPQEVESL AQQVGNQNDA STELNLSHIL IPLPENPTSD QVNEAESQAR AIVDQARNGA DFGKLAIAHS ADQQALNGGQ MGWGRIQELP GIFAQALSTA KKGDIVGPIR SGVGFHILKV NDLRGESKNI SVTEVHARHI LLKPSPIMTD EQARVKLEQI AADIKSGKTT FAAAAKEFSQ DPGSANQGGD LGWATPDIFD PAFRDALTRL NKGQMSAPVH SSFGWHLIEL LDTRNVDKTD AAQKDRAYRM LMNRKFSEEA ASWMQEQRAS AYVKILSN
|
| |