Gene BURPS1710b_A0605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0605 
Symbol 
ID3692255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp797827 
End bp799491 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content66% 
IMG OID637730859 
Productserine-type carboxypeptidase family protein 
Protein accessionYP_335764 
Protein GI76819704 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCG ATTCGACTTC CTCCGGCGGC GCGCAGCCGC TCCATCACGG CGCGAACGGC 
TCGGTTCACG CGCCGCCGCC GGTCATCGTC GCGCCGAAGG ACGACGGCGA CCAGCCGTTC
TTCGATCCGG TCGCCTACGG CAACGGCCCC GACGATTCGG TGACGGACAC CACCGAGGCC
GCCGCGATCA CGCACCACAC GGTCCGGATC GACGGCCGCA CGATCGCGTA CACGGCCGCG
GCGGGCCATC TCGTGACCGT CGATCCGAGC AGCTCGCAGC CGGATGCGAA GATCTTCTAC
GTCGCGTTCA CGCAGGACGG CCAGCAGGAG CAAACGCGCC CCGTCACGTT CTTCTACAAC
GGCGGGCCGG GCTCGTCGGC CGTGTTCGTG CTGCTCGGCT CGTTCGCGCC GCGGCGCATC
CGCACGTCGA TGCCGAGCTT CACGCCGCCC GCGCCGTACC GGATGGAAGA CAACCCGGAC
AGCCTGCTCG ACAAGAGCGA TCTCGTGTTC ATCAACCCGG TCGGCACCGG CTATTCGGCG
GCGATCGCGC CGCGCAAGAA CCGCGATTTC TGGGGCGTCG ATCAGGACGC GAACTCGATC
AAGCAGTTCA TCAAGCGCTA TCTGACGAAG CACAACCGGT GGAATTCGCC GAAGTACCTG
TTCGGCGAAT CGTACGGCAC CGCGCGCAGC TGCGTGCTCG CGTACAAGCT GCACGAGGAC
GGCGTCGACC TGAACGGGAT CACGCTGCAG TCGTCGATTC TCGACTACCG GCAGGCGGGC
AATCCGGTGG GCGCGCTGCC CACCGCGGCG GCCGACGCGT GGTATCACAA GCGGCTCGGC
GTCGCGCCGA CGCCGACCGA TCTCGGCGCG TTCGTGGAGG AGGTCGCGCA GTTCGCGCGC
ACCGACTATC TCGGCGCGCT GCGCAAGTTC CCGCAGGCCG ATGCGGCCGT CGTCAAGAAG
CTGTCCGACT ACACCGGCAT CGACACGACG ACGTTGCTGT CGTGGAGCCT CGACATCGCG
GGCTACGACG CGCGCGGCAA CGCGCTGTTC CTCACGACGC TGCTGAAGGC ACAAGGCCTC
GCGCTCGGCG CGTACGACGG CCGCGTGACG GGAATCGAAT CGGGGATCGC GGGCCGGATC
GATCCGAACT CGGGCGGCAA CGATCCGACG ATGACGGCTG TGTCGGGCGT CTACACGGCG
ATGTGGAATA CGTACCTGAA CGAGCAGTTG AAGTACACGT CGAACTCGTC GTTCACCGAC
CTGAACGACC AGGCATTCAA GTACTGGGAC TTCGGCCACA TCGATCCGAC GGGCGAACAG
CAGGGCGTCG ACGCGAAGGG CAACGTGATC CTGTACACGG CGGGCGATCT CGCCGCGACG
ATGGCGCTCA ACGTCGATCT GAAGGTGCTC TCGGCGAACG GGCTCTACGA TTTCGTCACG
CCGTTCTACC AGACGGTGCT CGATCTGCAG CAGATGCCGC TAGAGGACCC GAAGGTGCGG
CAGAACCTGT CCGCGCGCTT CTATCCGTCC GGACACATGG TGTACCTCGA CGGCGGCTCG
CGCACCACGC TCAAGCACGA CCTCGCGCAG ATGTACGAAT CGACGGTGCG CGACACCGCG
GCGGTGATGC GCATTCGCGC GTTGCAGGAG AAAAAGCGCG CGTAG
 
Protein sequence
MSIDSTSSGG AQPLHHGANG SVHAPPPVIV APKDDGDQPF FDPVAYGNGP DDSVTDTTEA 
AAITHHTVRI DGRTIAYTAA AGHLVTVDPS SSQPDAKIFY VAFTQDGQQE QTRPVTFFYN
GGPGSSAVFV LLGSFAPRRI RTSMPSFTPP APYRMEDNPD SLLDKSDLVF INPVGTGYSA
AIAPRKNRDF WGVDQDANSI KQFIKRYLTK HNRWNSPKYL FGESYGTARS CVLAYKLHED
GVDLNGITLQ SSILDYRQAG NPVGALPTAA ADAWYHKRLG VAPTPTDLGA FVEEVAQFAR
TDYLGALRKF PQADAAVVKK LSDYTGIDTT TLLSWSLDIA GYDARGNALF LTTLLKAQGL
ALGAYDGRVT GIESGIAGRI DPNSGGNDPT MTAVSGVYTA MWNTYLNEQL KYTSNSSFTD
LNDQAFKYWD FGHIDPTGEQ QGVDAKGNVI LYTAGDLAAT MALNVDLKVL SANGLYDFVT
PFYQTVLDLQ QMPLEDPKVR QNLSARFYPS GHMVYLDGGS RTTLKHDLAQ MYESTVRDTA
AVMRIRALQE KKRA