Gene YpsIP31758_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3602 
SymbolhflB 
ID5386962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4067720 
End bp4069654 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content47% 
IMG OID640866622 
ProductATP-dependent metalloprotease 
Protein accessionYP_001402556 
Protein GI153946925 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00000683259 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAA ACCTAATTCT CTGGTTAGTT ATTGCAGTCG TCTTGATGTC TGTATTCCAG 
AGCTTTGGTC CCAGCGAATC GAATGGCCGT AGAGTGGATT ACTCTACTTT CATGTCCGAC
GTAACCCAAG ATCAAGTGCG TGAAGCACGT ATCAACGGAC GTGAAATTAA CGTTAGTAAG
AAAGATAACA GCAAATATAC GACTTTTATT CCGGTCAATG ATCCAAAGCT GCTAGATACC
TTATTGACTA AAAATGTGAA AGTTGTTGGT GAGCCTCCAG AAGAGCAAAG CTTACTGGCA
TCTATCTTTA TATCTTGGTT CCCAATGTTG TTATTGATTG GGGTCTGGAT CTTCTTTATG
CGTCAAATGC AGGGCGGCGG TGGCAAAGGA GCAATGTCCT TTGGTAAAAG CAAAGCTCGA
ATGCTGACGG AAGATCAGAT AAAAACCTCG TTTGCTGATG TTGCTGGTTG TGACGAAGCA
AAAGAAGAGG TCAGTGAATT AGTTGACTAC CTGCGTGAGC CAAGCCGTTT CCAGAAATTG
GGCGGTAAAA TTCCTAAAGG CGTGTTGATG GTAGGCCCTC CGGGGACGGG TAAAACCTTG
CTGGCGAAAG CCATTGCAGG TGAAGCTAAA GTGCCATTCT TCACAATTTC TGGTTCTGAC
TTCGTAGAAA TGTTCGTTGG TGTCGGTGCA TCCCGTGTCC GTGACATGTT TGAACAGGCT
AAAAAAGCTG CGCCTTGTAT CATCTTCATT GATGAAATCG ATGCGGTTGG CCGTCAACGT
GGCGCTGGTC TGGGTGGGGG TCATGACGAA CGTGAACAGA CGCTGAACCA AATGCTGGTT
GAAATGGATG GCTTCGAAGG TAATGAAGGC ATCATTGTTA TTGCGGCAAC TAACCGCCCA
GACGTTCTGG ATCCTGCGTT ATTGCGCCCA GGCCGTTTTG ACCGTCAGGT TGTCGTTGGT
TTACCTGATG TTCGTGGTCG TGAACAAATT CTGAAAGTTC ACATGCGCCG TGTGCCATTA
GATACCGATA TTGATGCTTC AGTGATCGCT CGTGGTACTC CAGGCTTCTC TGGTGCTGAT
TTGGCGAACC TGGTAAACGA AGCTGCATTG TTTGCCGCCC GCGGTAACAA ACGCGTTGTT
TCTATGGTTG AGTTCGAAAA AGCGAAAGAC AAAATTATGA TGGGTGCGGA ACGTCGCTCC
ATGGTAATGA CAGAAGCTCA GAAAGAATCT ACGGCCTACC ATGAAGCAGG GCATGCCATT
ATTGGTCGTT TAGTGCCAGA GCATGATCCA GTGCATAAAG TGACGATCAT TCCTCGTGGC
CGTGCTCTGG GTGTCACCTT CTTCTTGCCG GAAGGCGATG CAATCAGTGC TAGCCGCCAG
AAGTTGGAAA GTCAGATTTC TACCTTGTAC GGTGGTCGTC TTGCAGAAGA GATCATTTAT
GGCCCGGAAA AAGTGTCTAC CGGTGCTTCG AATGATATCA AAGTGGCAAC GTCTATTGCG
CGTAATATGG TAACGCAGTG GGGCTTCTCC GAAAAACTGG GGCCGTTGCT GTATGCTGAA
GAAGAGGGCG AAATTTTCCT CGGCCGTTCT GTAGCGAAAG CTAAGCATAT GTCTGATGAG
ACTGCGCGTA TCATCGATCA GGAAGTTAAA TTACTTGTTG AGCGTAACTA TCAGCGTGCA
CGTAAATTGT TGTTAGAAAA TATGGATGTT TTACACTCCA TGAAAGACGC GTTGATGAAG
TATGAAACTA TTGATGCGCC ACAGATTGAT GACTTGATGA ATCGCAAAGA AGTTCGCCCG
CCAGCGGGTT GGGACAATGT GACCAAAAAT AAATCATCTG ACAATGACAA TACACCAACG
GCAACCATGC CGGCTGATGA ACCGAATACT CCAACGTCGG GCAATACAGT GTCAGAACAG
TTGGGTGATA AGTAA
 
Protein sequence
MAKNLILWLV IAVVLMSVFQ SFGPSESNGR RVDYSTFMSD VTQDQVREAR INGREINVSK 
KDNSKYTTFI PVNDPKLLDT LLTKNVKVVG EPPEEQSLLA SIFISWFPML LLIGVWIFFM
RQMQGGGGKG AMSFGKSKAR MLTEDQIKTS FADVAGCDEA KEEVSELVDY LREPSRFQKL
GGKIPKGVLM VGPPGTGKTL LAKAIAGEAK VPFFTISGSD FVEMFVGVGA SRVRDMFEQA
KKAAPCIIFI DEIDAVGRQR GAGLGGGHDE REQTLNQMLV EMDGFEGNEG IIVIAATNRP
DVLDPALLRP GRFDRQVVVG LPDVRGREQI LKVHMRRVPL DTDIDASVIA RGTPGFSGAD
LANLVNEAAL FAARGNKRVV SMVEFEKAKD KIMMGAERRS MVMTEAQKES TAYHEAGHAI
IGRLVPEHDP VHKVTIIPRG RALGVTFFLP EGDAISASRQ KLESQISTLY GGRLAEEIIY
GPEKVSTGAS NDIKVATSIA RNMVTQWGFS EKLGPLLYAE EEGEIFLGRS VAKAKHMSDE
TARIIDQEVK LLVERNYQRA RKLLLENMDV LHSMKDALMK YETIDAPQID DLMNRKEVRP
PAGWDNVTKN KSSDNDNTPT ATMPADEPNT PTSGNTVSEQ LGDK