Gene BURPS668_A1574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1574 
SymbolftsH 
ID4887200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1505993 
End bp1507993 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content70% 
IMG OID640131513 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001062570 
Protein GI126442831 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.580758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGCG AAACCGGATA CATGGGATTC GTCGTCGTGC TCGTGTTCAT GGTGCTGCTC 
GCGCTGCAGC TCGCCACGCT CAGCGCGCCG GCCACGCAGA TCGCGTACAG CGACTTTCGC
AAGCTCGCCG CGGCCGCGCA GCTCGACGAT CTCGAAGTCA GCCCGACGCG CATCACGGGC
GTGCTGCGCA GTGCGTCCGC GGCGGCGGCG CTGCCCGCCT CCGACGCGGA GGCGATCAAG
CGCGCGGGCA CGCCGTGGCG CTTCTCGACA AAGCGCGTGA CCGACGAGCG CCTGATCGAC
ACGCTCGCCG CGACGGGCAC CCGCTATCGC GGCGCCGACG ACGACACGTG GATCGGCACG
CTCGCATCGT GGATCGTGCC GATCGCGGTG TTCGCGCTCG TCTGGAACCT GATGCTGCGG
CGCCCGCGCG GCGGCCTGCA GGACTGGTCG GGCGTCGGCA AGAGCAAGCC GCGCGTCTAT
GTGGAGGCGA AGACCGGCAT CGATTTCGAC GACATCGCGG GCATCGACGA GGCGAAGGCC
GAGCTCCAGC AGATCGTCGC GTTTCTGCGC GCGCCCGCGC GCTACCAGCG GCTCGGCGGC
AAGATCCCGA AGGGCGTGCT GATCGTCGGC GCGCCCGGCA CCGGCAAGAC GCTGCTCGCG
AAGGCGGTGG CGGGCGAGGC GGGCGTGCCG TTCTTCTCGA CGAGCGGCTC GTCGTTCGTC
GAGATGTTCG TCGGCGTCGG CGCGGCGCGC GTACGCGATC TGTTCGAGCA GGCGCAGCAA
AAGGCGCCGT GCATCATCTT CATCGACGAG CTCGACGCGC TCGGCAAGGT GCGCGGCGCG
GGGCTCGCGT CGGGCAACGA CGAGCGCGAG CAGACCCTGA ACCAGTTGCT CGTGGAGATG
GACGGCTTCC AGGCGAACTC CGGCGTGATC CTCATGGCGG CGACCAATCG TCCGGAGATT
CTCGATCCCG CGCTGCTGCG CCCGGGCCGC TTCGACCGCC ACATCGCGAT CGACCGGCCG
GACTTGACGG GGCGCCGGCA GATCCTGTCG GTCCACGTGA AGCACGTGAA GCTCGGCCCG
GACGTCGATC TCGGCGAGCT CGCGTCGCGC ACGCCCGGCT TCGTCGGCGC GGATCTCGCG
AACATCGTCA ACGAGGCGGC GCTGCACGCG GCCGAGCTCG ACAAGCCCGC GATCGACATG
TCCGATTTCG ACGAGGCGAT CGACCGCGCG ATGACCGGCA TGGAACGCAA GAGCCGCGTG
ATGAGCGAGC GCGAGAAGAT CACGATCGCG CATCACGAGG CGGGGCACGC GCTGATCGCG
CAGACGCGCG CGCACAGCGA TCCGGTGAAG AAGGTGTCGA TCATTCCGCG CGGCATCGCG
GCGCTCGGCT ACACGCAGCA GGTGCCGACC GAGGATCGCT ACGTGCTGCG CAAGAGCGAG
CTGCTCGACC GGCTCGACGT GCTGCTCGGC GGGCGCGTCG CCGAGGAGAT CGTGTTCGGC
GACGTGTCGA CGGGCGCGGA GAACGATCTC GAGCGCGCGA CCGAAATGGC GCGGCACATG
GTCGCCCGCT ACGGGATGAG CGAGCGGATC GGCCTCGCGA CGTTCGGCGA CGCGGACACC
CAGGGGCTGT CGCCCCTCGT CTGGCGGCGC GGCGGCGAGC GCTGCAGCGA GAGCACCGCG
ACGCGGATCG ACGACGAGAT CCAGCGGCTC CTCGCCGAGG CGCACGATCG CGTGTCGCGT
ACGCTGAAGG AGCGGCGCGG CGCGCTCGAA CGGATCGCCG GGTATCTGCT CGAGCACGAG
GTGGTCGATC ACGACAAGCT CGTGAGGCTC GTCAACGACG AGCCGACGCC CGAGCCCGGC
GCGCGCGATC CGGGCGGGGA CGCGGCGAAG CGAAGCGGCA TCGGCGCCGC GCCGGCGAAG
CCGCCGGCGG AAGTCGGGAG CGCCGAGCTT CGCGATCCGG CTCGAAAGGC CGACAACGCG
GACCACTCCG TGCCGCAGTG A
 
Protein sequence
MKSETGYMGF VVVLVFMVLL ALQLATLSAP ATQIAYSDFR KLAAAAQLDD LEVSPTRITG 
VLRSASAAAA LPASDAEAIK RAGTPWRFST KRVTDERLID TLAATGTRYR GADDDTWIGT
LASWIVPIAV FALVWNLMLR RPRGGLQDWS GVGKSKPRVY VEAKTGIDFD DIAGIDEAKA
ELQQIVAFLR APARYQRLGG KIPKGVLIVG APGTGKTLLA KAVAGEAGVP FFSTSGSSFV
EMFVGVGAAR VRDLFEQAQQ KAPCIIFIDE LDALGKVRGA GLASGNDERE QTLNQLLVEM
DGFQANSGVI LMAATNRPEI LDPALLRPGR FDRHIAIDRP DLTGRRQILS VHVKHVKLGP
DVDLGELASR TPGFVGADLA NIVNEAALHA AELDKPAIDM SDFDEAIDRA MTGMERKSRV
MSEREKITIA HHEAGHALIA QTRAHSDPVK KVSIIPRGIA ALGYTQQVPT EDRYVLRKSE
LLDRLDVLLG GRVAEEIVFG DVSTGAENDL ERATEMARHM VARYGMSERI GLATFGDADT
QGLSPLVWRR GGERCSESTA TRIDDEIQRL LAEAHDRVSR TLKERRGALE RIAGYLLEHE
VVDHDKLVRL VNDEPTPEPG ARDPGGDAAK RSGIGAAPAK PPAEVGSAEL RDPARKADNA
DHSVPQ