Gene YpAngola_A3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3985 
SymbolhflB 
ID5802463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4238500 
End bp4240443 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content47% 
IMG OID641341770 
ProductATP-dependent metalloprotease 
Protein accessionYP_001608280 
Protein GI162419672 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000069949 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTGACA TGGCGAAAAA CCTAATTCTC TGGTTAGTTA TTGCAGTCGT CTTGATGTCT 
GTATTCCAGA GCTTTGGTCC CAGCGAATCG AATGGCCGTA GAGTGGATTA CTCTACTTTC
ATGTCCGACG TAACCCAAGA TCAAGTGCGT GAAGCACGTA TCAACGGACG TGAAATTAAC
GTTAGTAAGA AAGATAACAG CAAATATACG ACTTTTATTC CGGTCAATGA TCCAAAGCTG
CTAGATACCT TATTGACTAA AAATGTGAAA GTTGTTGGTG AGCCTCCAGA AGAGCAAAGC
TTACTGGCAT CTATCTTTAT ATCTTGGTTC CCAATGTTGT TATTGATTGG GGTCTGGATC
TTCTTTATGC GTCAAATGCA GGGCGGCGGT GGCAAAGGAG CAATGTCCTT TGGTAAAAGC
AAAGCTCGAA TGCTGACGGA AGATCAGATA AAAACCTCGT TTGCTGATGT TGCTGGTTGT
GACGAAGCAA AAGAAGAGGT CAGTGAATTA GTTGACTACC TGCGTGAGCC AAGCCGTTTC
CAGAAATTGG GCGGTAAAAT TCCTAAAGGC GTGTTGATGG TAGGCCCTCC GGGGACGGGT
AAAACCTTGC TGGCGAAAGC CATTGCAGGT GAAGCTAAAG TGCCATTCTT CACAATTTCT
GGTTCTGACT TCGTAGAAAT GTTCGTTGGT GTCGGTGCAT CCCGTGTCCG TGACATGTTT
GAACAGGCTA AAAAAGCTGC GCCTTGTATC ATCTTCATTG ATGAAATCGA TGCGGTTGGC
CGTCAACGTG GCGCTGGTCT GGGTGGGGGT CATGACGAAC GTGAACAGAC GCTGAACCAA
ATGCTGGTTG AAATGGATGG CTTCGAAGGT AATGAAGGCA TCATTGTTAT TGCGGCAACT
AACCGCCCAG ACGTTCTGGA TCCTGCGTTA TTGCGCCCAG GCCGTTTTGA CCGTCAGGTT
GTCGTTGGTT TACCTGATGT TCGTGGTCGT GAACAAATTC TTAAAGTTCA CATGCGCCGT
GTGCCATTAG ATACCGATAT TGATGCTTCA GTGATCGCTC GTGGTATTCC AGGCTTCTCT
GGTGCTGATT TGGCGAACCT GGTAAACGAA GCTGCATTGT TTGCCGCCCG CGGTAACAAA
CGCGTTGTTT CTATGGTTGA GTTCGAAAAA GCGAAAGACA AAATTATGAT GGGTGCGGAA
CGTCGCTCCA TGGTAATGAC AGAAGCGCAG AAAGAATCTA CGGCCTACCA TGAAGCAGGG
CATGCCATTA TTGGTCGTTT AGTGCCAGAG CATGATCCAG TGCATAAAGT GACGATCATT
CCTCGTGGCC GTGCTCTGGG TGTCACCTTC TTCTTGCCGG AAGGCGATGC AATCAGTGCT
AGCCGCCAGA AGTTGGAAAG TCAGATTTCT ACCTTGTACG GTGGTCGTCT TGCAGAAGAG
ATCATTTATG GCCCGGAAAA AGTGTCTACC GGTGCTTCGA ATGATATCAA AGTGGCAACG
TCTATTGCGC GTAACATGGT AACGCAGTGG GGCTTCTCCG AAAAACTGGG GCCGTTGCTG
TATGCTGAAG AAGAGGGCGA AATTTTCCTC GGCCGTTCTG TAGCGAAAGC TAAGCATATG
TCTGATGAGA CTGCGCGTAT CATCGATCAG GAAGTTAAAT TACTTGTTGA GCGTAACTAT
CAGCGTGCAC GTAAATTGCT GTTAGAAAAT ATGGATGTTT TACACTCCAT GAAAGACGCG
TTGATGAAGT ATGAAACTAT TGATGCGCCA CAGATTGATG ACTTGATGAA TCGCAAAGAA
GTTCGCCCGC CAGCGGGTTG GGACAATGTG ACCAAAAATA AATCATCTGA CAATGACAAT
ACACCAACGG CAACCATGCC GGCTGATGAA CCGAATACTC CAACGTCGGG CAATACAGTG
TCAGAACAGT TGGGTGATAA GTAA
 
Protein sequence
MSDMAKNLIL WLVIAVVLMS VFQSFGPSES NGRRVDYSTF MSDVTQDQVR EARINGREIN 
VSKKDNSKYT TFIPVNDPKL LDTLLTKNVK VVGEPPEEQS LLASIFISWF PMLLLIGVWI
FFMRQMQGGG GKGAMSFGKS KARMLTEDQI KTSFADVAGC DEAKEEVSEL VDYLREPSRF
QKLGGKIPKG VLMVGPPGTG KTLLAKAIAG EAKVPFFTIS GSDFVEMFVG VGASRVRDMF
EQAKKAAPCI IFIDEIDAVG RQRGAGLGGG HDEREQTLNQ MLVEMDGFEG NEGIIVIAAT
NRPDVLDPAL LRPGRFDRQV VVGLPDVRGR EQILKVHMRR VPLDTDIDAS VIARGIPGFS
GADLANLVNE AALFAARGNK RVVSMVEFEK AKDKIMMGAE RRSMVMTEAQ KESTAYHEAG
HAIIGRLVPE HDPVHKVTII PRGRALGVTF FLPEGDAISA SRQKLESQIS TLYGGRLAEE
IIYGPEKVST GASNDIKVAT SIARNMVTQW GFSEKLGPLL YAEEEGEIFL GRSVAKAKHM
SDETARIIDQ EVKLLVERNY QRARKLLLEN MDVLHSMKDA LMKYETIDAP QIDDLMNRKE
VRPPAGWDNV TKNKSSDNDN TPTATMPADE PNTPTSGNTV SEQLGDK