Gene BURPS1106A_A2116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2116 
Symbol 
ID4904352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2072783 
End bp2074372 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content72% 
IMG OID640145221 
Productkumamolisin 
Protein accessionYP_001076149 
Protein GI126457317 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.229792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGGC ATCTTCACGC CGGCAACGAA TCGCATCTCG TCGCCGAATC CACGTGCATC 
GGGCCGTGCG ATCCGGCCGA GACGATTCAT GTAGTGGTGA TGTTGCGGCG ACAGCAAGAG
CAGCACCTCG ATTCATTGTT GCAGGGCCTC GCGAGCGGCG ATCCGGGCGT GAAGCCTGTC
TCGCGCGAGG CGTTCGCCCA GCGTTTCGGC GCGCATCCCG ACGACGTCAT GAAAGTCGAG
GCATTCGCGC AGCAGCGCGG CCTCGCGGTC GCGCGCGTCG ATCCGGTCGA GAGCCTCGTC
GTGCTGTCGG GCACGATCGC GCAGTTCGAG GCGGCCTTCG GCGTGAAGCT CGAGCGCTTC
GAGCATCGGT CGATCGGCCA GTATCGCGGC CGCACGGGCG ATATCACGCT GCCCGACGAG
TTGCACGGCA TCGTCACCGC GGTGCTCGGG CTCGACGACC GCCCGCAGGC CCGGCCGCAT
TTCCGGCTGC GGCCGACTTT CCTGCCCGCG CGCGCGCCGG CCGTCACCTA CACGCCGCCG
CAGCTCGCGG CCCTCTACGA TTTCCCGCCC GGCGACGGCG CGGGCCAGTG CATCGCGATC
GTCGAGCTCG GCGGCGGCTA TCGGCCGGCC GAGATCCAGC AGTATTTCGG CGGCCTCGGG
CTCGCGCGGC AGCCGAAGCT CGTCGACGTG AGCGTCGGCG CGGGGCGCAA CGCGCCGACG
GGCGATCCGA GCGGGCCGGA CGGCGAAGTC GCGCTCGATA TCGAGATCGC GGGCGCGATC
GCGCCCGGCG CGACGATTGC CGTCTATTTC GCGCAGAACA GCGACGCCGG CTTCATCCAG
GCGGTCAATC AGGCGGTGCA CGACACGACG AACCGGCCCT CCGTCGTGTC GATCAGTTGG
GGCGCGGCGG AGGCGAACTG GACGTCGCAA TCGATCCAGG CCTTCGATAG CGTGCTGCAG
TCGGCCGCGG CGCTCGGCGT GACCGTGTGC GCGGCGTCCG GCGATGACGG CTCGAACGAC
GGCCTGCAGG ACGGCACGAA TCACGTCGAT TTCCCGGCAT CGAGCCCGTA CGTGCTCGCG
TGCGGCGGCA CGCGGCTCGA CGCACTGCCG GGGCAGGGCA TCCGCAGCGA AGTCGTGTGG
AACGACGAGG CGGCGGGCGG CGGCGCGACG GGCGGCGGCG TCAGCGCCGT GTTCGACGTG
CCGCAGTGGC AGAGCGGCCT GAGCGCGACG CTCGCGCAGG GCGGCGGCGC GTCGCCGCTC
GCGAAGCGCG GCGTGCCGGA CGTCGCGGGC GATGCGTCGC CCGCGACGGG CTACGAGGTG
TTCGTCGCGG GCACGTCGAC GGTGATGGGC GGCACGAGCG CCGTCGCACC GCTGTGGGCC
GCGCTCGTCG CGCGGATCAA TGCGGCGGCG GGCAGCCCCG CGGGCTGGAT CAACCCGAAG
CTGTACCGGA ACGCGGGCGC GCTGCACGAC ATCTCGGTGG GCGATAACGG CGCATATGCG
GCGACGCCGG GCTGGGACGC GTGCACGGGG CTCGGCAGCC CGAACGGCGC GAAGGTCGCG
GCGGCGCTGA AGGGCGGCGC GGCGGGCTGA
 
Protein sequence
MARHLHAGNE SHLVAESTCI GPCDPAETIH VVVMLRRQQE QHLDSLLQGL ASGDPGVKPV 
SREAFAQRFG AHPDDVMKVE AFAQQRGLAV ARVDPVESLV VLSGTIAQFE AAFGVKLERF
EHRSIGQYRG RTGDITLPDE LHGIVTAVLG LDDRPQARPH FRLRPTFLPA RAPAVTYTPP
QLAALYDFPP GDGAGQCIAI VELGGGYRPA EIQQYFGGLG LARQPKLVDV SVGAGRNAPT
GDPSGPDGEV ALDIEIAGAI APGATIAVYF AQNSDAGFIQ AVNQAVHDTT NRPSVVSISW
GAAEANWTSQ SIQAFDSVLQ SAAALGVTVC AASGDDGSND GLQDGTNHVD FPASSPYVLA
CGGTRLDALP GQGIRSEVVW NDEAAGGGAT GGGVSAVFDV PQWQSGLSAT LAQGGGASPL
AKRGVPDVAG DASPATGYEV FVAGTSTVMG GTSAVAPLWA ALVARINAAA GSPAGWINPK
LYRNAGALHD ISVGDNGAYA ATPGWDACTG LGSPNGAKVA AALKGGAAG