Gene BURPS668_A2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2202 
Symbol 
ID4886414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2134032 
End bp2135621 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content72% 
IMG OID640132139 
Productserine protease, kumamolysin 
Protein accessionYP_001063196 
Protein GI126445010 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.566013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGGC ATCTTCACGC CGGCAACGAA TCGCATCTCG TCGCCGAATC CACGTGCATC 
GGGCCGTGCG ATCCGGCCGA GACGATTCAT GTAGTGGTGA TGTTGCGGCG ACAGCAAGAG
CAGCACCTCG ATTCATTGTT GCAGGGCCTC GCGAGCGGCG ATCCGGGCGT GAAGCCTGTC
TCGCGCGAGG CGTTCGCCCA GCGTTTCGGC GCGCATCCCG ACGACGTCAT GAAAGTCGAG
GCATTCGCGC AGCAGCGCGG CCTCGCGGTC GCGCGCGTCG ATCCGGTCGA GAGCCTCGTC
GTGCTGTCGG GCACGATCGC GCAGTTCGAG GCGGCCTTCG GCGTGAAGCT CGAGCGCTTC
GAGCATCGGT CGATCGGCCA GTATCGCGGC CGCACGGGCG ATATCACGCT GCCCGACGAG
TTGCACGGCA TCGTCACCGC GGTGCTCGGG CTCGACGATC GCCCGCAGGC CCGGCCGCAT
TTCCGGCTGC GGCCGACTTT CCTGCCCGCG CGCGCGCCGG CCGTCACCTA CACGCCGCCG
CAGCTCGCGG CCCTCTACGA TTTCCCGCCC GGCGACGGCG CGGGCCAGTG CATCGCGATC
GTCGAGCTCG GCGGCGGCTA TCGGCCGGCC GAGATCCAGC AGTATTTCGG CGGCCTCGGG
CTCGCGCGGC AGCCGAAGCT CGTCGACGTG AGCGTCGGCG CGGGGCGCAA CGCGCCGACG
GGCGATCCGA GCGGGCCGGA CGGCGAAGTC GCGCTCGATA TCGAGATCGC GGGCGCGATC
GCGCCCGGCG CGACGATTGC CGTCTATTTC GCGCAGAACA GCGACGCCGG CTTCATCCAG
GCGGTCAATC AGGCGGTGCA CGACACGACG AACCGGCCCT CCGTCGTGTC GATCAGTTGG
GGCGCGGCGG AGGCGAACTG GACGTCGCAA TCGATCCAGG CCTTCGATCG CGTGCTGCAG
TCGGCCGCGG CGCTCGGCGT GACCGTGTGC GCGGCGTCCG GCGATGACGG CTCGAACGAC
GGCCTGCAGG ACGGCACGAA TCACGTCGAT TTCCCGGCAT CGAGCCCGTA CGTGCTCGCG
TGCGGCGGCA CGCGGCTCGA CGCGCTGCCG GGGCAGGGCA TCCGCAGCGA AGTGGTGTGG
AACGACGAGG CGGCGGGCGG CGGCGCGACG GGCGGCGGCG TCAGCGCCGT GTTCGACGTG
CCGCAGTGGC AGAGCGGCCT GAGCGCGACG CTCGCGCAGG GCGGCGGCGC GTCGCCGCTC
GTGAAGCGCG GCGTGCCGGA CGTCGCGGGC GATGCGTCGC CCGCGACGGG CTACGAGGTG
TTCGTCGCGG GCACGTCGAC GGTGATGGGC GGCACGAGCG CCGTCGCACC GCTGTGGGCC
GCGCTCGTCG CGCGGATCAA TGCGGCGGCG GGCAGCCCCG CGGGCTGGAT CAACCCGAAG
CTGTACCGGA ACGCGGGCGC GCTGCACGAC ATCTCGGTGG GCGATAACGG CGCGTATGCG
GCGACGCCGG GCTGGGACGC GTGCACGGGG CTCGGCAGCC CGGACGGCGC GAAGGTCGCG
GCGGCGCTGA AGGGCGGCGC GGCGGGCTGA
 
Protein sequence
MARHLHAGNE SHLVAESTCI GPCDPAETIH VVVMLRRQQE QHLDSLLQGL ASGDPGVKPV 
SREAFAQRFG AHPDDVMKVE AFAQQRGLAV ARVDPVESLV VLSGTIAQFE AAFGVKLERF
EHRSIGQYRG RTGDITLPDE LHGIVTAVLG LDDRPQARPH FRLRPTFLPA RAPAVTYTPP
QLAALYDFPP GDGAGQCIAI VELGGGYRPA EIQQYFGGLG LARQPKLVDV SVGAGRNAPT
GDPSGPDGEV ALDIEIAGAI APGATIAVYF AQNSDAGFIQ AVNQAVHDTT NRPSVVSISW
GAAEANWTSQ SIQAFDRVLQ SAAALGVTVC AASGDDGSND GLQDGTNHVD FPASSPYVLA
CGGTRLDALP GQGIRSEVVW NDEAAGGGAT GGGVSAVFDV PQWQSGLSAT LAQGGGASPL
VKRGVPDVAG DASPATGYEV FVAGTSTVMG GTSAVAPLWA ALVARINAAA GSPAGWINPK
LYRNAGALHD ISVGDNGAYA ATPGWDACTG LGSPDGAKVA AALKGGAAG