Gene SNSL254_A3555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3555 
SymbolhflB 
ID6485513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3446749 
End bp3448692 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content56% 
IMG OID642738835 
ProductATP-dependent metalloprotease 
Protein accessionYP_002042552 
Protein GI194442507 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00594001 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones93 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTGACA TGGCGAAAAA CCTAATACTC TGGCTGGTCA TTGCCGTTGT GCTGATGTCA 
GTATTCCAGA GCTTTGGGCC CAGCGAGTCT AATGGCCGTA AGGTGGATTA CTCTACCTTC
CTGCAAGAGG TCAATCAGGA CCAGGTTCGC GAAGCGCGTA TCAACGGACG TGAGATCAAC
GTTACCAAGA AAGATAGTAA CCGTTACACG ACTTACATTC CGATTAATGA TCCGAAGCTG
CTTGATAACC TGCTGACTAA AAACGTCAAG GTTGTTGGCG AACCACCTGA AGAGCCAAGC
CTGCTGGCTT CTATCTTCAT TTCCTGGTTC CCGATGCTGT TGCTGATCGG CGTCTGGATC
TTCTTCATGC GTCAGATGCA GGGCGGCGGT GGCAAAGGCG CCATGTCGTT CGGTAAGAGC
AAGGCGCGTA TGCTGACGGA AGATCAGATC AAAACCACGT TTGCTGACGT CGCAGGTTGT
GACGAAGCGA AAGAAGAAGT GGCGGAACTG GTCGAATACC TGCGTGAACC GAGCCGTTTC
CAGAAGCTGG GCGGTAAAAT TCCGAAAGGC GTCCTGATGG TCGGCCCGCC GGGTACCGGT
AAAACCCTGC TGGCAAAAGC CATTGCGGGT GAAGCGAAGG TGCCATTCTT TACGATTTCC
GGTTCTGACT TTGTGGAAAT GTTCGTCGGT GTCGGCGCGT CTCGTGTGCG TGACATGTTC
GAACAGGCCA AGAAAGCGGC GCCGTGCATT ATCTTCATCG ATGAAATCGA CGCCGTAGGT
CGCCAGCGTG GCGCAGGTCT GGGCGGTGGT CACGATGAAC GTGAGCAGAC GTTGAACCAG
ATGCTGGTTG AGATGGACGG CTTCGAAGGT AACGAAGGTA TCATCGTTAT CGCCGCAACT
AACCGTCCGG ACGTCCTTGA CCCGGCGCTG CTGCGTCCAG GCCGTTTTGA CCGTCAGGTG
GTGGTAGGCC TGCCAGATGT TCGCGGTCGT GAGCAGATTC TGAAGGTGCA TATGCGTCGC
GTACCGTTAG CGACGGATAT TGATGCGGCG ATCATTGCAC GCGGCACGCC GGGCTTCTCC
GGTGCGGACC TGGCGAACCT GGTCAACGAA GCGGCGCTGT TTGCCGCACG CGGCAACAAA
CGCGTAGTGT CGATGGTTGA GTTCGAGAAA GCGAAAGACA AAATCATGAT GGGCGCCGAA
CGTCGCTCCA TGGTGATGAC GGAAGCGCAG AAAGAGTCGA CCGCGTACCA CGAAGCGGGC
CACGCGATTA TCGGTCGCCT GGTGCCGGAA CACGATCCGG TGCACAAAGT GACGATTATC
CCGCGTGGTC GTGCGCTGGG CGTGACCTTC TTCCTGCCTG AAGGCGACGC GATCAGCGCC
AGCCGTCAGA AGCTGGAAAG CCAAATCAGC ACGCTGTACG GCGGCCGTCT GGCGGAAGAG
ATTATCTACG GCGTTGAGCA TGTTTCCACC GGCGCGTCGA ACGACATTAA AGTCGCGACT
AACCTGGCGC GTAACATGGT CACCCAGTGG GGCTTCTCGG AGAAACTCGG TCCGTTGCTG
TATGCGGAAG AAGAGGGCGA AGTGTTCCTC GGCCGTAGCG TCGCAAAAGC GAAACATATG
TCTGATGAAA CTGCGCGTAT CATCGACCAG GAAGTGAAAG CGCTGATTGA ACGTAACTAC
AATCGCGCTC GTCAGATCCT GACTGACAAT ATGGATATTC TGCACGCGAT GAAAGATGCG
CTGATGAAAT ATGAAACCAT CGATGCGCCG CAGATTGATG ACCTGATGGC GCGTCGTGAA
GTGCGTCCGC CTGCGGGCTG GGAAGATCCA AACGGCACCA ATAACTCTGA CAGCAATGGT
ACGCCTCAGG CGCCGCGTCC GGTTGATGAA CCACGCACGC CGAACCCGGG CAACACGATG
TCAGAGCAGC TGGGCGACAA ATAA
 
Protein sequence
MSDMAKNLIL WLVIAVVLMS VFQSFGPSES NGRKVDYSTF LQEVNQDQVR EARINGREIN 
VTKKDSNRYT TYIPINDPKL LDNLLTKNVK VVGEPPEEPS LLASIFISWF PMLLLIGVWI
FFMRQMQGGG GKGAMSFGKS KARMLTEDQI KTTFADVAGC DEAKEEVAEL VEYLREPSRF
QKLGGKIPKG VLMVGPPGTG KTLLAKAIAG EAKVPFFTIS GSDFVEMFVG VGASRVRDMF
EQAKKAAPCI IFIDEIDAVG RQRGAGLGGG HDEREQTLNQ MLVEMDGFEG NEGIIVIAAT
NRPDVLDPAL LRPGRFDRQV VVGLPDVRGR EQILKVHMRR VPLATDIDAA IIARGTPGFS
GADLANLVNE AALFAARGNK RVVSMVEFEK AKDKIMMGAE RRSMVMTEAQ KESTAYHEAG
HAIIGRLVPE HDPVHKVTII PRGRALGVTF FLPEGDAISA SRQKLESQIS TLYGGRLAEE
IIYGVEHVST GASNDIKVAT NLARNMVTQW GFSEKLGPLL YAEEEGEVFL GRSVAKAKHM
SDETARIIDQ EVKALIERNY NRARQILTDN MDILHAMKDA LMKYETIDAP QIDDLMARRE
VRPPAGWEDP NGTNNSDSNG TPQAPRPVDE PRTPNPGNTM SEQLGDK