Gene BURPS1106A_3181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3181 
Symbol 
ID4903176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3096758 
End bp3098605 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content66% 
IMG OID640136407 
Productcarboxypeptidase C 
Protein accessionYP_001067419 
Protein GI126452334 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.961281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAC AGAAGTCCTT GAAAGACGGT TTCACGCTCG GATGGTGCAG GGCGGCACGG 
CCGGTTGCCG CTGCCGCGCT GGCCGCGCTG CTCGTCGCCG CGTGCGGCGG CGACGACGGC
GGCGGCGGGA GCCCGTCGCT CGCGGCCGCG AACGTCGCGA ACACGAGCAC GCCGACGAAC
GCGACGACGG CCGCCGATGC GACGACCAAT GCCGCGCTGC CGCCGGATCA GCCGTATATC
GACAACGACG TCTATGGCAC CGGGCCGAAC GATTCGGTCA GCGACGCGAC GGAGGGCACC
GCGGTCGTGC ACCGGCAGGT GAAGATCGGC GATCAGATCC TCACCTACAC GGCGACGGCC
GGCCACCTCG TGACGATCGA TCCGATCACG TCGAAGCCGA ACGCGAAGAT GTTCTACGTC
GCGTACACGC TCGACAATCC GAACCCGGGC AAGCCGCGCC CCGTCACGTT CTTCTACAAC
GGCGGCCCGG GCTCGTCGTC GGTGTACCTG CTGCTGGGCT CGTTCGGGCC GAAGCGCCTG
CAGTCGTCGT TCCCGAACTT CACGCCGCCC GCGCCGTACC GGCTGCGCGA CAACCCCGAG
AGCCTGCTCG ACCGCTCCGA TCTCGTGTTC ATCAATCCGG TCGGCACCGG CTACTCGGCC
GCGATCGCGC CGGCGAAGAA CAAGGATTTC TGGGGCGTCG ACCAGGACGC GCACTCGATC
GACCGCTTCA TCCAGCGCTA CCTGACGAAG TACGCGCGCT GGAACTCGCC GAAGTTCCTG
TTCGGCGAAT CGTACGGCAC GGCGCGCAGC GCGGTGACCG CGTGGGTGCT GCATGAGGAC
GGCATCGAGC TGAACGGGAT CACGCTGCAG TCGTCGATTC TCGACTATGC GAACGCGGTG
AGCGCGATCG GCATCTTCCC GACGCTCGCG GCCGATGCGT TCTACTGGAA CAAGACGACC
ATCAGCCCGA AACCGGCCGA TCTGGACGCA TACATGGCGC AGGCGCGCAG CTATGCGGAC
AACGTGCTCG CGCCGCTCGC GCAGGCGCCG AATCCGCAGG ACGGCGGCTT CGTCAACGTG
CGGCTGAACC TGAACGTCGC GACCGCGCAG CAGATGGGCG CGTACATCGG CACCGATCCG
ATCTCGCTGG TCCAGACGTT CGGCAATCCG GCCGCGCTCG GCAACGTGCC GTCGTCCAAC
GACAACCCGC CGTACACGTT CTTCCTGACG CTCGTGCCGG GCATCCAGAT CGGCCAGTAC
GACGGACGCG CGAACTACAC GGGCAAGGGC ATCGCGCCGT ACATCCTGCC GAACTCGGGC
AGCAACGATC CGTCGATCAG CAACGTCGGC GGCGCGTACA CGGTGCTGTG GAACGACTAC
ATCAACAACG ACCTGAAGTA TGTGTCGACG TCGTCGTTCG TCGATCTGAA CGACCAGGTG
TTCAACAACT GGGACTTCAG CCACACGGAC CCGACGGGCG CGAACCGCGG CGGCGGCAAC
ACGCTGTACA CGGCGGGCGA TCTCGCCGCG ACGATGAGCC TGAACCCGGA CCTGAAGGTG
CTGTCGGCGA ACGGCTATTT CGACGCGGTG ACGCCGTTCC ACCAGACCGA GCTCACGCTC
GCGCAGATGC CGCTCGATCC GTCGCTGAAG TCGGCGAACC TGACGATGAA ATACTATCCG
TCGGGCCACA TGATCTATCT GAACGATCAC TCGCGGATCG CGATGAAGGC GGATCTGGCG
ACGTTCTACG ACGGCATCCT CGCGGACCGC ACGGCGATGC GGCGCGTGCT GCTGCGCCAG
CAGAAGGCGC TGCAGTTGAA GCAGCAGAAG CAACAGCAAG GGCAGTGA
 
Protein sequence
MKIQKSLKDG FTLGWCRAAR PVAAAALAAL LVAACGGDDG GGGSPSLAAA NVANTSTPTN 
ATTAADATTN AALPPDQPYI DNDVYGTGPN DSVSDATEGT AVVHRQVKIG DQILTYTATA
GHLVTIDPIT SKPNAKMFYV AYTLDNPNPG KPRPVTFFYN GGPGSSSVYL LLGSFGPKRL
QSSFPNFTPP APYRLRDNPE SLLDRSDLVF INPVGTGYSA AIAPAKNKDF WGVDQDAHSI
DRFIQRYLTK YARWNSPKFL FGESYGTARS AVTAWVLHED GIELNGITLQ SSILDYANAV
SAIGIFPTLA ADAFYWNKTT ISPKPADLDA YMAQARSYAD NVLAPLAQAP NPQDGGFVNV
RLNLNVATAQ QMGAYIGTDP ISLVQTFGNP AALGNVPSSN DNPPYTFFLT LVPGIQIGQY
DGRANYTGKG IAPYILPNSG SNDPSISNVG GAYTVLWNDY INNDLKYVST SSFVDLNDQV
FNNWDFSHTD PTGANRGGGN TLYTAGDLAA TMSLNPDLKV LSANGYFDAV TPFHQTELTL
AQMPLDPSLK SANLTMKYYP SGHMIYLNDH SRIAMKADLA TFYDGILADR TAMRRVLLRQ
QKALQLKQQK QQQGQ