Gene BMA10229_A1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A1410 
Symbol 
ID4792807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp1439537 
End bp1441111 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content66% 
IMG OID 
Productcarboxy-terminal protease 
Protein accessionYP_001027393 
Protein GI124383488 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATGA AATTGAAGAA CATCGGCCTG ATTGCCGCGG GCCTCGCGAC TGGCGTCTTC 
GCGACGCTGC AAATCTCCGC GTCGGCCCAG CAGGCCGTCA CGACGGCCGC CGCGCCGCTG
CCGCTCGACC AGTTGCGGCT CTTCGCTGAA GTGTTCGGGC AGATCAAGCG CGAATACGTC
GAGCCCGTCG ACGACAAGAA GCTGCTGACC GCGGCGATCA AGGGCATGGT GTCGAGCCTC
GATCCGCACT CGTCGTACCT CGACAAGACC GATTACCAGG AACTGCAGGA GCAGACGAAG
GGCCGCTTCG CAGGCCTCGG CATCGAGATT TCGCAGGAAG ACGGCCTCGT CAAGGTGATC
TCGCCGATCG AGGACACGCC CGCGTTCCGC GCCGGCATCC GTCCGGGCGA CCTGATCACC
CGCATCAACG ATCGCCCGGT GCGCGGCATG ACGCTCGACA AGGCGGTCAA GCAGATGCGC
GGCGAGCCCG GCACGAAGGT CACGCTGACG ATCTTCCGCA AGAGCGACGA CCGCACGTTC
CCCGTCACGG TCACGCGCGC GGTGATCCGC GTGCAGAGCG TGAAGATGAA GCTGCTCGAT
CCGGGCTACG CGTACATCCG CATCACGAGC TTCCAGGAGC GCACGACGCC CGATCTCGCC
GCGAAGCTGC AGGACATCGC GCGCCAGCAG CCGAACCTGA AGGGCCTGAT CCTCGATCTG
CGCAACAACG GCGGCGGCCT GCTGCAAAGC GCCGTCGGCG TCGCGGGCGC GTTCCTGCCG
CCGGATTCCG TCGTCGTGTC GACGAACGGC CAGATCCCCG ATTCGAAGCA GATCTACCGC
GACAACTACG AGAACTACCG CCTGCCGTCG TTCGACTCCG ATCCGCTGAA GAACCTGCCC
GCCGTCTTCA AGACGGTGCC GATGATCGTG CTGACGAACG CGTATTCGGC GTCGGCCTCG
GAAATCGTCG CGGGCGCGCT GCAGGATTCG CACCGCGCGG TGATCATGGG CAAGGCGACG
TTCGGCAAGG GCTCGGTGCA GACGGTGCGG CCGATGACGG CCGATTCCGC GCTGCGCCTG
ACGACCGCGT ACTACTACAC GCCGAGCGGC CGCTCGATCC AGAACAAGGG CATCCTGCCC
GACATTCCGG TCGATCAGTA CGCGGACGGC GATCCGGACG ACGTGCTCGT CACGCGCGAG
GTCGATTACA CGAACCACCT CGCGAACACG CAGGATCCGA ACGAGAAGAA GGAGCTCGAG
GAACGCGAGC AGCGCCGGAT GGAGCAGTTG CGCATCCTCG AGGAGCAGAA CGACAAGAAG
ACGCCCGAGC AGCGTCAGAA GGATCGCGAG CGCAAGCCGA TCGAATTCGG CAGCGCCGAC
GATTTCATGA TGCAGCAGGC GCTCAACAAG CTCGAAGGCA AGCCGGTCGA GCAGTCGAAG
ATGATCGCCG CCGACAGCAC CGCGAAGAGC GCCGCCGCCA AGGCGGGCGC CGCCTCGGCG
GCGAAGGGCG CGTCGGGCGC GGCGGCCAAG CCCGCGTCGG CTGCAAAGCC CGCGTCGGCG
CCGCAACCGC AGTAA
 
Protein sequence
MRMKLKNIGL IAAGLATGVF ATLQISASAQ QAVTTAAAPL PLDQLRLFAE VFGQIKREYV 
EPVDDKKLLT AAIKGMVSSL DPHSSYLDKT DYQELQEQTK GRFAGLGIEI SQEDGLVKVI
SPIEDTPAFR AGIRPGDLIT RINDRPVRGM TLDKAVKQMR GEPGTKVTLT IFRKSDDRTF
PVTVTRAVIR VQSVKMKLLD PGYAYIRITS FQERTTPDLA AKLQDIARQQ PNLKGLILDL
RNNGGGLLQS AVGVAGAFLP PDSVVVSTNG QIPDSKQIYR DNYENYRLPS FDSDPLKNLP
AVFKTVPMIV LTNAYSASAS EIVAGALQDS HRAVIMGKAT FGKGSVQTVR PMTADSALRL
TTAYYYTPSG RSIQNKGILP DIPVDQYADG DPDDVLVTRE VDYTNHLANT QDPNEKKELE
EREQRRMEQL RILEEQNDKK TPEQRQKDRE RKPIEFGSAD DFMMQQALNK LEGKPVEQSK
MIAADSTAKS AAAKAGAASA AKGASGAAAK PASAAKPASA PQPQ