Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3592 |
Symbol | hflB |
ID | 6268739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3341641 |
End bp | 3343584 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641727462 |
Product | ATP-dependent metalloprotease |
Protein accession | YP_001881907 |
Protein GI | 187731936 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000000114537 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTGACA TGGCGAAAAA CCTAATACTC TGGCTGGTCA TTGCCGTTGT GCTGATGTCC GTATTCCAGA GCTTTGGGCC CAGCGAGTCT AATGGCCGTA AGGTGGATTA CTCTACCTTC CTACAAGAGG TCAATAACGA CCAGGTTCGT GAAGCGCGTA TCAACGGACG TGAAATCAAC GTTACCAAGA AAGATAGTAA CCGTTATACC ACTTACATTC CGGTTCAGGA TCCGAAATTA CTGGATAACC TGTTGACCAA GAACGTCAAG GTTGTCGGTG AACCGCCTGA AGAACCAAGC CTGCTGGCTT CTATCTTCAT CTCTTGGTTC CCGATGCTGT TGCTGATTGG TGTCTGGATC TTCTTCATGC GTCAAATGCA GGGCGGCGGT GGCAAAGGTG CCATGTCGTT TGGTAAGAGC AAAGCGCGCA TGCTGACGGA AGATCAGATC AAAACGACCT TTGCTGACGT TGCGGGCTGC GACGAAGCAA AAGAAGAAGT TGCTGAACTG GTAGAGTATC TGCGCGAGCC GAGCCGCTTC CAGAAACTCG GCGGTAAGAT CCCGAAAGGC GTCCTGATGG TCGGTCCTCC GGGTACCGGT AAAACGTTGC TGGCGAAAGC GATTGCAGGC GAAGCGAAAG TTCCGTTCTT TACTATCTCC GGTTCTGACT TCGTAGAAAT GTTCGTCGGT GTGGGTGCAT CCCGTGTTCG TGACATGTTC GAACAGGCGA AGAAAGCGGC ACCGTGCATC ATCTTTATCG ATGAAATCGA CGCCGTAGGC CGCCAGCGTG GCGCTGGTCT GGGCGGTGGT CACGATGAAC GTGAACAGAC TCTGAACCAG ATGCTGGTTG AGATGGATGG CTTCGAAGGT AACGAAGGTA TCATCGTTAT CGCCGCGACT AACCGTCCGG ACGTTCTTGA CCCGGCCCTG CTGCGTCCTG GCCGTTTCGA CCGTCAGGTT GTGGTTGGCT TGCCAGATGT TCGCGGTCGT GAGCAGATCC TGAAAGTTCA CATGCGTCGC GTACCATTGG CACCCGATAT CGACGCGGCA ATCATTGCCC GTGGTACTCC TAGTTTCTCC GGTGCTGACC TGGCGAACCT GGTAAACGAA GCGGCACTGT TCGCTGCTCG TGGCAACAAA CGCGTTGTGT CGATGATTGA GTTCGAAAAA GCGAAAGATA AAATCATGAT GGGTGCGGAA CGTCGCTCCA TGGTGATGAC GGAAGCGCAG AAAGAATCGA CGGCTTACCA CGAAGCGGGT CATGCGATTA TCGGTCGCCT GGTGCCGGAA CACGATCCGG TGCACAAAGT GACGATTATC CCACGCGGTC GTGCGCTGGG TGTGACTTTC TTCTTGCCTG AGGGCGACGC AATCAGCGCC AGCCGTCAGA AACTGGAAAG CCAGATTTCT ACGCTGTACG GTGGTCGTCT GGCAGAAGAG ATCATCTACG GGCCGGAACA TGTTTCTACC GGTGCGTCCA ACGATATTAA AGTTGCGACC AACCTGGCGC GTAACATGGT GACTCAGTGG GGCTTCTCTG AGAAATTGGG TCCACTGCTG TACGCGGAAG AAGAAGGTGA AGTGTTCCTC GGCCGTAGCG TAGCAAAAGC GAAACATATG TCCGATGAAA CTGCACGTAT CATCGACCAG GAAGTGAAAG CACTGATTGA GCGTAACTAT AATCGTGCGC GTCAGCTTCT GACCGACAAT ATGGATATTC TGCATGCGAT GAAAGATGCT CTCATGAAAT ATGAGACTAT CGACGCACCG CAGATTGATG ACCTGATGGC ACGTCGCGAT GTACGTCCGC CAGCGGGCTG GGAAGAACCA GGCGCTTCTA ACAATTCTGG CGACAATGGT AGTCCAAAGG CTCCTCGTCC GGTTGATGAA CCGCGTACGC CGAACCCGGG TAACACCATG TCAGAGCAGT TAGGCGACAA GTAA
|
Protein sequence | MSDMAKNLIL WLVIAVVLMS VFQSFGPSES NGRKVDYSTF LQEVNNDQVR EARINGREIN VTKKDSNRYT TYIPVQDPKL LDNLLTKNVK VVGEPPEEPS LLASIFISWF PMLLLIGVWI FFMRQMQGGG GKGAMSFGKS KARMLTEDQI KTTFADVAGC DEAKEEVAEL VEYLREPSRF QKLGGKIPKG VLMVGPPGTG KTLLAKAIAG EAKVPFFTIS GSDFVEMFVG VGASRVRDMF EQAKKAAPCI IFIDEIDAVG RQRGAGLGGG HDEREQTLNQ MLVEMDGFEG NEGIIVIAAT NRPDVLDPAL LRPGRFDRQV VVGLPDVRGR EQILKVHMRR VPLAPDIDAA IIARGTPSFS GADLANLVNE AALFAARGNK RVVSMIEFEK AKDKIMMGAE RRSMVMTEAQ KESTAYHEAG HAIIGRLVPE HDPVHKVTII PRGRALGVTF FLPEGDAISA SRQKLESQIS TLYGGRLAEE IIYGPEHVST GASNDIKVAT NLARNMVTQW GFSEKLGPLL YAEEEGEVFL GRSVAKAKHM SDETARIIDQ EVKALIERNY NRARQLLTDN MDILHAMKDA LMKYETIDAP QIDDLMARRD VRPPAGWEEP GASNNSGDNG SPKAPRPVDE PRTPNPGNTM SEQLGDK
|
| |