Gene BURPS668_A2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2129 
Symbol 
ID4886783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2064116 
End bp2067136 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content73% 
IMG OID640132066 
ProductATP-dependent Clp protease, ATP-binding subunit ClpB 
Protein accessionYP_001063123 
Protein GI126443370 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR03345] type VI secretion ATPase, ClpV1 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTGG TTGACCTGAA ACCGTTGATC GAACAGCTCA ACCCCTATTG CCGGAACGCG 
CTCGAAAGCG CCGTCGGCGC GTGCGTCGCG CGCCGGCACG ACGACGTCGC CGTCGAGCAC
CTGCTCGCGC GGCTGTGCGA CGAGCCGTCG GCCGACGTCG CGCTGTTGCT GCGCGCGTGC
GGCGCGGACG CGGCGCGCCT GCGCCGGCAG GCCGACGCCG CGCTCGACGC GCGCCCGGCG
GGCGACGGCG GCCGCCCCGC GTTCGCGCCG TCGCTGCTCG CGCTGTTGCA GGACGCGTGG
CTGATCGCGT CGCTCGAGCT CGGCGACACG CACATCCGCT CGGCGGCCGT GCTCGCCGCC
GCGGTCGCCC GCGCGGCGCG GCAGCCGTCG CCCGGCGGCG ACGACGTGCT GCAGTCGCTG
CGCAAGGACG CGCTCGTCGC GCGCTTTGCG AGCGGCGCGT GCGCGCGATC GATCGAATCG
CGCGCGTCCG GCGCGCTCGA CGCCGCGCCG GGCGAGGCGA GCGACACGCG CGACGCATCG
AGCGCGATCG CCCGCTACTG CGAGGACTTC ACCGCGAAAG CGCGCGCCGG CGGGATCGAT
CCGGTCTTCG GTCGCGACGC GGAAATCCGC CAGATCGTCG ACATCCTCGC GCGGCGGCGC
AAGAACAATC CGGTGTGCGT CGGCGAGCCG GGCGTGGGCA AGACGGCCGT CGTCGAAGGG
CTCGCGCTGC GGATCGCCGA AGGCGACGTG CCGGCGACGC TGCGCGGCGC GACGCTGCTC
GGCCTCGACC TCGGCATGCT GCAGGCGGGC GCGAGCGTGA AGGGCGAATT CGAGCAGCGG
CTCAAGCGCG TGATCGCCGA GATCCGCGCG TCGCAAACGC CCGTCGTGCT GTTCATCGAC
GAAGCGCACA CGCTGATCGG CGCGGGCGGC GCGGCGGGCG CGTCCGACGC GGCGAACCTG
CTGAAGCCCG CGCTCGCGCG CGGCGAGCTG CGCACGATCG CGGCCACCAC GTGGAGCGAA
TACAAGAAGT ATTTCGAGAA GGACGCGGCG CTCGCGCGGC GCTTCCAGCC GGTGAAGCTC
GACAGCCCGG ACGTCGCGAC GTCGGTGATG ATCCTGCGCG GGCTGAAGGA GCGCTATCAG
GACGCGCACG GCGTGACGAT CCGCGACGAC GCGCTCGTGG CCGCGGCCGA GCTGTCGGCG
CGCTACATCA CCGGCCGGCA GTTGCCGGAC AAGGCGATCG ACCTGCTCGA CACCGCGTGC
GCGCGCGTGA AGGTGCGCCA GCAGACCAAG CCCGCCGCGC TGGAGGACGC GCAGCGCGCG
ATCCAGGCGC TCGAGCGCGA GCGGCGCGCG CTGCGCGACG AATTGGCCGA GCGCTGCGCG
CCGGATACGC CGCGCGTCGC CGACATCGAT CGCGAGCTCG CCGCGCTGTC CGCGCGCGCC
GGCGCGCTGC GCGACGCGTG GGCGGCGCAG CGCGAAGCCG CGCAGGCGCT CGTCGACGCA
CGCCGCGCGT GCCGGGCGGC GGCGGATGCA ACGGATGCGG CTGGAACGAC CGGAACGGCC
CATGCCGGCG AGCTCGCCGA TAGTGCCGAC GTTTCCGATA TCGCCGATGT CGCCAAGATG
GCCGATGCGG CCGACGCCGT CCCTGCCGAT GCCGAAACCG GCTCGAACGC CCGCGACGCC
GACGCACGCA TGCCCGCCGG GACGTGCGCG CGCGTCGCCC CGAACCCGGA TGCGCGGGCG
CGGCCCCCTC TGCGCGCGCC GCACGCCGAC GCGCGCGAGC GCGCGGCTCG TGCGCTCGCC
GAAGCGACGC GCCGCTTCGA GCACGCGCAG CGCGACACGC CGCTCGTGCG AATCGACGTC
GATCCGGACG CGATCGCCGA CGTCGTGGCC GACTGGACCG GCATCGCGGC CGGCAAGCTG
CGCCGCGATC GCGCGAACGT GATGCTGCGG CTCGCCGATA CGCTGCGCCG CCGCATCCGC
GGCCAGGATC ACGCGATCGA ACAGATCTCC GAAGCCGTGA AGGCGGGCGC GGCCGGCGTC
CACGATCCGC GCCGCCCGCT CGGCGTGTTC CTGCTCGCGG GGCCGTCGGG CACCGGCAAG
ACCGAGACCG CGCTCGCGGT CGCCGATGCG CTGTTCGGCG ACGAGCGCTC GATCGTCGTC
GTCAACATGA GCGAATTCCA GGAGCGGCAC GACGTGAGCC GCCTGATCGG CTCGCCGCCG
GGCTACGTCG GCTACGGCGA GGGCGGGATG CTGACCGAGG CGGTGCGCCA GCGGCCCTAT
TCGGTCGTGC TGCTAGACGA AGTCGAGAAG GCGCACCCCG ACGTGCTGAA CCTGTTCTAC
CAGGTGTTCG ACAAGGGTTC GCTGTCCGAC GGCGAAGGCA AGGAGGTCGA CTTCGCGAAC
ACGGTGATCT TCCTCACGTC GAATCTCGGC GCGGACATCA TCGCCGACAT CGCCGCGCGC
GGCGCGCGGC CCGATCCGGA CGCGATGCGC GCGGCGGTGC GCCCGGCGCT GTCGCGCCAT
TTCAAGCCGG CGCTGCTCGC GCGGATGACC GAGATTCCGT ATGCGCCGCT CGCGCCCGAT
ACGCTCGCCG ACATCGCGCG GCTGAAGCTC GAGCGCATCG CGGCGCGCGT CGCCGCGCAG
CACGCGACGC GCATCGTCTA CGACGACGCC GTCGTCGCGC ACGTCGCCGC GCGCTGCACC
GAAGTCGAAT CGGGCGCGCG CAACGTCGAT TTCATCCTGC AGCGCCACGT GCTGCCCGCG
CTCGCGCAGC ACGTGCTCGT GTGCGCGGGC AACGCGGCGC CGATGCCTGC GATTCGTGTC
GCCATCGACG CCGGCGGCCG CTTCGTCGTC TCGAACGACG CGCCGGCCGA CTCCGATGCA
CGGGACCGGA GCGATGCGTC CGATGCGATC GATGCGATCG ACGCGGTCGA CGCGGTCGAT
GCGGTCGATG CACCCAACGC ACCCAACGCA CCCAACGCAC CCAACGCACC CAACGCACCC
AACGCACCCA ACGCACGCTG A
 
Protein sequence
MLLVDLKPLI EQLNPYCRNA LESAVGACVA RRHDDVAVEH LLARLCDEPS ADVALLLRAC 
GADAARLRRQ ADAALDARPA GDGGRPAFAP SLLALLQDAW LIASLELGDT HIRSAAVLAA
AVARAARQPS PGGDDVLQSL RKDALVARFA SGACARSIES RASGALDAAP GEASDTRDAS
SAIARYCEDF TAKARAGGID PVFGRDAEIR QIVDILARRR KNNPVCVGEP GVGKTAVVEG
LALRIAEGDV PATLRGATLL GLDLGMLQAG ASVKGEFEQR LKRVIAEIRA SQTPVVLFID
EAHTLIGAGG AAGASDAANL LKPALARGEL RTIAATTWSE YKKYFEKDAA LARRFQPVKL
DSPDVATSVM ILRGLKERYQ DAHGVTIRDD ALVAAAELSA RYITGRQLPD KAIDLLDTAC
ARVKVRQQTK PAALEDAQRA IQALERERRA LRDELAERCA PDTPRVADID RELAALSARA
GALRDAWAAQ REAAQALVDA RRACRAAADA TDAAGTTGTA HAGELADSAD VSDIADVAKM
ADAADAVPAD AETGSNARDA DARMPAGTCA RVAPNPDARA RPPLRAPHAD ARERAARALA
EATRRFEHAQ RDTPLVRIDV DPDAIADVVA DWTGIAAGKL RRDRANVMLR LADTLRRRIR
GQDHAIEQIS EAVKAGAAGV HDPRRPLGVF LLAGPSGTGK TETALAVADA LFGDERSIVV
VNMSEFQERH DVSRLIGSPP GYVGYGEGGM LTEAVRQRPY SVVLLDEVEK AHPDVLNLFY
QVFDKGSLSD GEGKEVDFAN TVIFLTSNLG ADIIADIAAR GARPDPDAMR AAVRPALSRH
FKPALLARMT EIPYAPLAPD TLADIARLKL ERIAARVAAQ HATRIVYDDA VVAHVAARCT
EVESGARNVD FILQRHVLPA LAQHVLVCAG NAAPMPAIRV AIDAGGRFVV SNDAPADSDA
RDRSDASDAI DAIDAVDAVD AVDAPNAPNA PNAPNAPNAP NAPNAR