Gene BURPS668_A2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2200 
Symbol 
ID4886438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2131721 
End bp2133385 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content66% 
IMG OID640132137 
Productserine carboxypeptidase family protein 
Protein accessionYP_001063194 
Protein GI126443582 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.555603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCG ATTCGACTTC CTCCGGCGGC GCGCAGCCGC TCCATCACGG CGCGAACGGC 
TCGGTTCACG CGCCGCCGCC GATCATCGTC GCGCCGAAGG ACGACGGCGA CCAGCCGTTC
TTCGATCCGG TCGCCTACGG CAACGGCCCC GACGATTCGG TGACGGACAC CACCGAGGCC
GCCGCGATCA CGCACCACAC GGTCCGGATC GACGGCCGCA CGATCGCGTA CACGGCCGCG
GCGGGCCATC TCGTGACTGT CGATCCGAGC AGCTCGCAGC CGGATGCGAA GATCTTCTAC
GTCGCGTTCA CGCAGGACGG CCAGCAGGAG CAAACGCGCC CCGTCACGTT CTTCTACAAC
GGCGGGCCGG GCTCGTCGGC CGTGTTCGTG CTGCTCGGCT CGTTCGCGCC GCGGCGCATC
CGCACGTCGA TGCCGAGCTT CACGCCGCCC GCGCCGTACC GGATGGAAGA CAACCCGGAC
AGCCTGCTCG ACAAGAGCGA TCTCGTGTTC ATCAACCCGG TCGGCACCGG CTATTCGGCG
GCGATCGCGC CGCGCAAGAA CCGCGATTTC TGGGGCGTCG ATCAGGACGC GAACTCGATC
AAGCAGTTCA TCAAGCGCTA TCTGACGAAG CACAACCGGT GGAATTCGCC GAAGTACCTG
TTCGGCGAAT CGTACGGCAC CGCGCGCAGC TGCGTGCTCG CGTACAAGCT GCACGAGGAC
GGCGTCGACC TGAACGGGAT CACGCTGCAG TCGTCGATTC TCGATTACCG GCAGGCGGGC
AATCCGGTGG GCGCGCTGCC CACCGCGGCG GCCGACGCGT GGTATCACAA GCGGCTCGGC
GTCGCGCCGA CGCCGACCGA TCTCGGCGCG TTCGTGGAGG AGGTCGCGCA GTTCGCGCGC
ACCGACTATC TCGGCGCGCT GCGCAAGTTC CCGCAGGCCG ATGCGGCCGT CGTCAAGAAG
CTGTCCGACT ACACCGGCAT CGACACGACG ACGTTGCTGT CGTGGAGCCT CGACATCGCG
GGCTACGACG CGCGCGGCAA CGCGCTGTTC CTCACGACGC TGCTGAAGGC ACAAGGCCTC
GCGCTCGGCG CGTACGACGG CCGCGTGACG GGAATCGAAT CGGGGATCGC GGGCCGGATC
GATCCGAACT CGGGCGGCAA CGATCCGACG ATGACGGCGG TGTCGGGCGT CTACACGGCG
ATGTGGAATA CGTACCTGAA CGAGCAGTTG AAATACACGT CGAACTCGTC GTTCACCGAC
CTGAACGACC AGGCATTCAA GTACTGGGAC TTCGGCCACA TCGATCCGAC GGGCGAACAG
CAGGGCGTCG ACGCGAAGGG CAACGTGATC CTGTACACGG CGGGCGATCT CGCCGCGACG
ATGGCGCTCA ACGTCGATCT GAAGGTGCTG TCGGCGAACG GGCTCTACGA TTTCGTCACG
CCGTTCTACC AGACGGTGCT CGATCTGCAG CAGATGCCGC TCGAGGACCC GAAGGTGCGG
CAGAACCTGT CCGCGCGCTT CTATCCGTCC GGGCACATGG TGTACCTCGA CGGCGGCTCG
CGCACCACGC TCAAGCACGA CCTCGCGCAG ATGTACGAAT CGACGGTGCG CGACACCGCG
GCGGTGATGC GCATTCGCGC GTTGCAGGAG AAAAAGCGCG CGTAG
 
Protein sequence
MSIDSTSSGG AQPLHHGANG SVHAPPPIIV APKDDGDQPF FDPVAYGNGP DDSVTDTTEA 
AAITHHTVRI DGRTIAYTAA AGHLVTVDPS SSQPDAKIFY VAFTQDGQQE QTRPVTFFYN
GGPGSSAVFV LLGSFAPRRI RTSMPSFTPP APYRMEDNPD SLLDKSDLVF INPVGTGYSA
AIAPRKNRDF WGVDQDANSI KQFIKRYLTK HNRWNSPKYL FGESYGTARS CVLAYKLHED
GVDLNGITLQ SSILDYRQAG NPVGALPTAA ADAWYHKRLG VAPTPTDLGA FVEEVAQFAR
TDYLGALRKF PQADAAVVKK LSDYTGIDTT TLLSWSLDIA GYDARGNALF LTTLLKAQGL
ALGAYDGRVT GIESGIAGRI DPNSGGNDPT MTAVSGVYTA MWNTYLNEQL KYTSNSSFTD
LNDQAFKYWD FGHIDPTGEQ QGVDAKGNVI LYTAGDLAAT MALNVDLKVL SANGLYDFVT
PFYQTVLDLQ QMPLEDPKVR QNLSARFYPS GHMVYLDGGS RTTLKHDLAQ MYESTVRDTA
AVMRIRALQE KKRA