Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3933 |
Symbol | |
ID | 6269314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3661285 |
End bp | 3662781 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641727784 |
Product | peptidase, M16 (pitrilysin) family |
Protein accession | YP_001882217 |
Protein GI | 187734067 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGGCA CAAAAATTCG ACTTTTAGCG GGCGGTTTGC TGATGATGGC CACTGCTGGC TATGTGCAGG CAGATGCGCT CCAGCCTGAT CCAGCATGGC AACAGGGGAC GCTTTCCAAC GGTTTACAGT GGCAAGTGCT GACCACCCCC CAGCGTCCCA GCGATCGTGT TGAAATTCGC CTGCTGGTTA ATACCGGTTC GCTCGCCGAA AGTACACAAC AGAGCGGTTA CAGTCACGCC ATCCCTCGTA TTGCGCTAAC GCAAAGCGGT GGCCTTGACG CAGCACAGGC GCGTTCATTG TGGCAGCAGG GGATCGACCC TAAACGCCCG ATGCCGCCGG TAATTGTCTC TTATGACACC ACGCTGTTTA ATCTGAGTTT GCCCAATAAC CGTAACGATT TGCTGAAAGA AGCGCTCTCT TATCTGGCAA ATGCCACTGG CAAATTGACC ATCACACCAG AAACCATCAA CCACGCGCTG CAAAGTCAGG ACATGGTGGC AACCTGGCCT GCCGATACTA AAGAGGGCTG GTGGCGCTAT CGTCTGAAAG GGTCAACCTT GTTAGGTCAC GATCCTGCCG ATCCGCTGAA ACAACCCGTT GAAGCGGAAA AAATTAAAGA TTTCTATCAG AAATGGTACA CCCCGGATGC AATGACGCTA CTGGTGGTGG GAAACGTGGA TGCGCGCTCG GTTGTCGACC AAATCAACAA AACGTTTGGC GAACTGAAAG GCAAACGTGA AACGCCAGCT CCGGTGCCGA CGCTTTCTCC GCTGCGTGCG GAAGCGGTGA GTATTATGAC TGACGCGGTG CGTCAGGACC GGTTATCTAT CATGTGGGAT ACGCCGTGGC AGCCGATTCG TGAAGCAGCC GCACTGCTGC GCTACTGGCG TGCGGACCTA GCCCGCGAGG CGCTGTTCTG GCATGTTCAG CAAGCGTTAA GCGCCAGTAA CAGCAAAGAC ATCGGTCTTG GATTTGACTG CCGTGTGCTG TATCTGCGTG CGCAGTGTGC CATCAACATC GAATCACCAA ACGACAAGCT GAACAGCAAC CTTAATCTGG TGGCGCGTGA ACTGGCGAAG GTTCGCGATA AAGGTCTGCC GGAAGAAGAG TTCAATGCGT TAGTGGCGCA AAAGAAACTG GAGCTGCAGA AACTGTTTGC CGCCTATGCG CGGGCTGATA CCGATATTCT GATGGGTCAG CGGATGCGTT CGTTGCAAAA TCAGGTCGTC GATATCGCGC CGGAGCAGTA TCAGAAACTG CGGCAGGATT TCCTTAACAG CCTGACGGTG GAGATGTTAA ATCAGGATCT GCGTCAGCAG TTGTCGAATG ATATGGCGTT AATACTGCTG CAGCCGAAAG GCGAGCCGGA ATTTAACATG AAAGCGTTGC AGGCGGCCTG GGATCAAATC ATGGCCCCAT CGACTGCTGC TGCCGCCACC TCTGTCGCCA CGGATGACGT ACATCCTGAA GTGACGGATA TTCCACCTGC ACAGTAA
|
Protein sequence | MQGTKIRLLA GGLLMMATAG YVQADALQPD PAWQQGTLSN GLQWQVLTTP QRPSDRVEIR LLVNTGSLAE STQQSGYSHA IPRIALTQSG GLDAAQARSL WQQGIDPKRP MPPVIVSYDT TLFNLSLPNN RNDLLKEALS YLANATGKLT ITPETINHAL QSQDMVATWP ADTKEGWWRY RLKGSTLLGH DPADPLKQPV EAEKIKDFYQ KWYTPDAMTL LVVGNVDARS VVDQINKTFG ELKGKRETPA PVPTLSPLRA EAVSIMTDAV RQDRLSIMWD TPWQPIREAA ALLRYWRADL AREALFWHVQ QALSASNSKD IGLGFDCRVL YLRAQCAINI ESPNDKLNSN LNLVARELAK VRDKGLPEEE FNALVAQKKL ELQKLFAAYA RADTDILMGQ RMRSLQNQVV DIAPEQYQKL RQDFLNSLTV EMLNQDLRQQ LSNDMALILL QPKGEPEFNM KALQAAWDQI MAPSTAAAAT SVATDDVHPE VTDIPPAQ
|
| |