Gene BURPS1106A_A2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2031 
Symbol 
ID4906072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1998898 
End bp2001954 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content73% 
IMG OID640145136 
ProductATP-dependent Clp protease, ATP-binding subunit ClpB 
Protein accessionYP_001076064 
Protein GI126458158 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR03345] type VI secretion ATPase, ClpV1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTGG TTGACCTGAA ACCGTTGATC GAACAGCTCA ACCCCTATTG CCGGAACGCG 
CTCGAAAGCG CCGTCGGCGC GTGCGTCGCG CGCCGGCACG ACGACGTCGC CGTCGAGCAC
CTGCTCGCGC GGCTGTGCGA CGAGCCGTCG GCCGACGTCG CGCTGTTGCT GCGCGCGTGC
GGCGCGGACG CGGCGCGCCT GCGCCGGCAG GCCGACGCCG CGCTCGACGC GCGCCCGGCG
GGCGACGGCG GCCGCCCCGC GTTCGCGCCG TCGCTGCTCG CGCTGTTGCA GGACGCGTGG
CTGATCGCGT CGCTCGAGCT CGGCGACACG CACATCCGCT CGGCGGCCGT GCTCGCCGCC
GCGGTCGCCC GCGCGGCGCG GCAGCCGTCG CCCGGCGGCG ACGACGTGCT GCAGTCGCTG
CGCAAGGACG CGCTCGTCGC GCGCTTTGCG AGCGGCGCGT GCGCGCGATC GATCGAATCG
CGCGCGTCCG GCGCGCTCGA CGCCGCGCCG GGCGAGGCGA GCGACACGCG CGACGCATCG
AGCGCGATCG CCCGCTACTG CGAGGACTTC ACCGCGAAAG CGCGCGCCGG CGGGATCGAT
CCGGTCTTCG GTCGCGACGC GGAAATCCGC CAGATCGTCG ACATCCTCGC GCGGCGGCGC
AAGAACAATC CGGTGTGCGT CGGCGAGCCG GGCGTGGGCA AGACGGCCGT CGTCGAAGGG
CTCGCGCTGC GGATCGCCGA AGGCGACGTG CCGGCGACGC TGCGCGGCGC GACGCTGCTC
GGCCTCGACC TCGGCATGCT GCAGGCGGGC GCGAGCGTGA AGGGCGAATT CGAGCAGCGG
CTCAAGCGCG TGATCGCCGA GATCCGCGCG TCGCAAACGC CCGTCGTGCT GTTCATCGAC
GAAGCGCACA CGCTGATCGG CGCGGGCGGC GCGGCGGGCG CGTCCGACGC GGCGAACCTG
CTGAAGCCCG CGCTCGCGCG CGGCGAGCTG CGCACGATCG CAGCCACCAC GTGGAGCGAA
TACAAGAAGT ATTTCGAGAA GGACGCGGCG CTCGCGCGGC GCTTCCAGCC GGTGAAGCTC
GACAGCCCGG ACGTCGCGAC GTCGGTGATG ATCCTGCGCG GGCTGAAGGA GCGCTATCAG
GACGCGCACG GCGTGACGAT CCGCGACGAC GCGCTCGTCG CCGCGGCCGA GCTGTCGGCG
CGCTACATCA CCGGCCGGCA GTTGCCGGAC AAGGCGATCG ACCTGCTCGA CACCGCGTGC
GCGCGCGTGA AGGTGCGCCA GCAGACCAAG CCCGCCGCGC TGGAGGACGC GCAGCGCGCG
ATCCAGGCGC TCGAGCGCGA GCGGCGCGCG CTGCGCGACG AATTGGCCGA GCGCTGCGCG
CCGGATACGC CGCGCGTCGC CGACATCGAT CGCGAGCTCG CCGCACTGTC CGCGCGCGCC
GGCGCGCTGC GCGACGCGTG GGCGGCGCAG CGCGACGCCG CGCAGGCGCT CGTCGACGCA
CGCCGCGCGT GCCGGGCGGC GGCGGATGCA ACGGCTGCAA CGGATGCGGC TGGAACGACC
GGAACGGCCC ATGCCGGCGA GCTCGCCGAT AGTGCCGACG TTTCCGATAT CGCCGATGTC
GCCAAGATGG CCGATGCGGC CGACGCCGCC CCTGCCGATG CCGAAACCGG CTCGAACGCC
CGCGACGCCG ACGCACGCAT GCCCGCCGGG ACGTGCGCGC GCATCGCCCC GAACCCGGAT
GCGCGGGCGC GGCCCCCTCT GCGCGCGCCG CACGCCGACG CGCGCGAGCG CGCGGCTCAT
GCGCTCGCCG AAGCGACGCG CCGCTTCGAG CACGCGCAGC GCGACACGCC GCTCGTGCGA
ATCGACGTCG ATCCGGACGC GATCGCCGAC GTCGTGGCCG ACTGGACCGG CATCGCGGCC
GGCAAGCTGC GCCGCGATCG CGCGAACGTG ATGCTGCGGC TCGCCGATAC GCTGCGCCGC
CGCATCCGCG GCCAGGATCA CGCGATCGAA CAGATCTCCG AAGCCGTGAA GGCGGGCGCG
GCCGGCGTCC ACGATCCGCG CCGCCCGCTC GGCGTGTTCC TGCTCGCGGG GCCGTCGGGC
ACCGGCAAGA CCGAGACCGC GCTCGCGGTC GCCGATGCGC TGTTCGGCGA CGAGCGCTCG
ATCGTCGTCG TCAACATGAG CGAATTCCAG GAGCGGCACG ACGTGAGCCG CCTGATCGGC
TCGCCGCCGG GCTACGTCGG CTACGGCGAG GGCGGGATGC TGACCGAGGC GGTGCGCCAG
CGGCCCTATT CGGTCGTGCT GCTAGACGAA GTCGAGAAGG CGCACCCCGA CGTGCTGAAC
CTGTTCTACC AGGTGTTCGA CAAGGGTTCG CTGTCCGACG GCGAAGGCAA GGAGGTCGAC
TTCGCGAACA CGGTGATCTT CCTCACGTCG AATCTCGGCG CGGACATCAT CGCCGACATC
GCCGCGCGCG GCGCGCGGCC CGATCCGGAC GCGATGCGCG CGGCGGTACG CCCGGCGCTG
TCGCGCCATT TCAAGCCGGC GCTGCTCGCG CGGATGACCG AGATTCCGTA TGCGCCGCTC
GCGCCCGATA CGCTCGCCGA CATCGCGCGG CTGAAGCTCG AGCGCATCGC GGCGCGCGTC
GCCGCGCAGC ACGCGACGCG CATCGTCTAC GACGACGCCG TCGTCGCGCA CGTCGCCGCG
CGCTGCACCG AAGTCGAATC GGGCGCGCGC AACGTCGATT TCATCCTGCA GCGCCACGTG
CTGCCCGCGC TCGCGCAGCA CGTGCTCGTG TGCGCGGGCA ACGCGGCGCC GATGCCTGCG
ATTCGCGTCG CCATCGACGC CGGCGGCCGC TTCGTCGTCT CGAACGACGC GCCGGCCGAC
TCCGATGCAC GGGACCGGAG CGATGCGTCC GATGCGTCCG ATGCGTCTGA TGCGTCTGAT
GCGATCGACG CGGTCGACGC GGTCGATGCG GTCGATGCAC CCAACGCACC CAACGCACCC
AACGCACCCA ACGCACCCAA CGCACCCAAC GCACCCAACG CACCCAACGC ACGCTGA
 
Protein sequence
MLLVDLKPLI EQLNPYCRNA LESAVGACVA RRHDDVAVEH LLARLCDEPS ADVALLLRAC 
GADAARLRRQ ADAALDARPA GDGGRPAFAP SLLALLQDAW LIASLELGDT HIRSAAVLAA
AVARAARQPS PGGDDVLQSL RKDALVARFA SGACARSIES RASGALDAAP GEASDTRDAS
SAIARYCEDF TAKARAGGID PVFGRDAEIR QIVDILARRR KNNPVCVGEP GVGKTAVVEG
LALRIAEGDV PATLRGATLL GLDLGMLQAG ASVKGEFEQR LKRVIAEIRA SQTPVVLFID
EAHTLIGAGG AAGASDAANL LKPALARGEL RTIAATTWSE YKKYFEKDAA LARRFQPVKL
DSPDVATSVM ILRGLKERYQ DAHGVTIRDD ALVAAAELSA RYITGRQLPD KAIDLLDTAC
ARVKVRQQTK PAALEDAQRA IQALERERRA LRDELAERCA PDTPRVADID RELAALSARA
GALRDAWAAQ RDAAQALVDA RRACRAAADA TAATDAAGTT GTAHAGELAD SADVSDIADV
AKMADAADAA PADAETGSNA RDADARMPAG TCARIAPNPD ARARPPLRAP HADARERAAH
ALAEATRRFE HAQRDTPLVR IDVDPDAIAD VVADWTGIAA GKLRRDRANV MLRLADTLRR
RIRGQDHAIE QISEAVKAGA AGVHDPRRPL GVFLLAGPSG TGKTETALAV ADALFGDERS
IVVVNMSEFQ ERHDVSRLIG SPPGYVGYGE GGMLTEAVRQ RPYSVVLLDE VEKAHPDVLN
LFYQVFDKGS LSDGEGKEVD FANTVIFLTS NLGADIIADI AARGARPDPD AMRAAVRPAL
SRHFKPALLA RMTEIPYAPL APDTLADIAR LKLERIAARV AAQHATRIVY DDAVVAHVAA
RCTEVESGAR NVDFILQRHV LPALAQHVLV CAGNAAPMPA IRVAIDAGGR FVVSNDAPAD
SDARDRSDAS DASDASDASD AIDAVDAVDA VDAPNAPNAP NAPNAPNAPN APNAPNAR