Gene BURPS1710b_A2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2223 
Symbol 
ID3692057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2708460 
End bp2710217 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content59% 
IMG OID637732477 
Productserine-type carboxypeptidase family protein 
Protein accessionYP_337374 
Protein GI76819158 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00580188 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAA GAATACCGAT TGGGACTACT GTGGCAGCGC TTGCGACCGT CACGCTGGTT 
GCGTGCGGCG GCGACGATCC ATCGTCTGGT GCCGTGAGCG CCGGCAACTC GCCGGCCATA
GTGCCGTCAG TACCCGCCGC AACGCCGGCG ACGCCAACCG CGACGGCCGC CGAGTCTGAA
CAGGACGTGT TCGCAAGAAC TGCGCCGGCT GACAAAGTAT TGGATGACTC CAACGTCTAC
GACACGACAA AAGACGGTTC CATCGCTTTG TCCAAGGTCA ACGAGGATTC GTCGGTCAAG
CGACATACGA TCACCATCAA CGGGAAAGTA CTGCCGTATG TCGCGCGTGC CGGGCACCTG
GTTGCTTACC GGCAAAACGG GGCGGGCAAG AAAGCCGAGG CGGCGATCTT CTATACGGCG
TACACGCGTG ACGCACTGCC CAAGGAGCAT CGCCCGGTCA CGTTCTTATG GAACGGCGGC
CCCGGATCGG CGTCGATCTG GCTGCACATG GGATCGTGGG GCCCGAAACG GCTCAAGTCC
GACGCCCCGA ACATGGCCGA TCCCACGAAG CAGCCCGACA GCTTCCCGTT CGAGGACAAT
GCGATTTCGC TACTCGACCA GTCGGACATC GTCTTCGTCG ATCCGCCGGG GACAGGTCTT
TCCACAGCCA TTGCGCCGCT CAAGAACGGC GATCTATGGG GAACCGACGA TGATGCCCAA
GTGGTTGCCG ACTTCATTAC GAGCTATACG AACAAGTACA ACCGGCAGAG CTCTCCGAAG
TACCTGTACG GCGAATCCTA CGGCGGTATC CGCACGCCGA TCGTCGCCAA CCTGCTGGAG
CAGGCAGGTA CCAGCGGCTA CGTTCCCGAT CCGTCCGGCA AGCCGGCCAG GGTGCTGGAC
GGATTCATCC TCAATTCGCC GCTCGTCGAT TACAACTCCA ATTGCGACAT GATGGGCGGA
AGAGTGACGT GCGAGGGTTA CATCCCGTCC TACGCCATGA CGGCAGACTA CTTCAAGAAA
TCCGTCAAGC GCGGCACGCG AACGCAAGAG CAGTATCTCG GCGAGTTGCG CACGCTGGCG
AGAACGACAT TCCAGACGAC CTACGGAACC TACTTCACCA ACGGCAAACC GAACAGCCAA
TGGAATGCGT ATGCGGCGAG CACGGCGGGA CAGGCTCTCC TGAACCGGAT CGCCGACTAT
ACCGGTATTC CCGCGTCGAC CTGGAACGGC TCGTTCAACT ACACGCCGAG GCCGTTTCGA
AACGCGCTGG TTCCAGGCTA CGAACTCGGG CGCTACGATG CGCGCATGAA GGTACCGAAC
GGGAACAGCT TCGCGGCGGA CAGCTACATC GACGTGGCCT TCCTTAACCA GCTCAAGACG
TATTTTCCCG ATTTCGTCAA CTACAAGACC CAGTCGATCT ACGAGCCGCT CAATAATGCG
ACGATCAGAA ACTGGAAATG GAAGAGGGCC GGGAGCAAAT ATGATTACCC ACAGAGCATT
ACCGACATTC AAGCCGTTCT GACTGCCAAC CCCGATGCCA AGCTGCTGAT TCTTCACGGT
TATGAAGATA TCGCGACGCC GGGTTTCCAG ACCGAACTGG ATCTCGAGGG GGTGAACCTG
AGCGATCGCA TCCCGGTCAA ATGGTTCGAA GGCGGGCATA TGATCTACAA CACCGAGGCA
TCGCGCGTGC CGCTGAAGCA GGCGATCGAC AGCTATTACA AGTCGCCGAC GCGTATCGCG
GACGGTTCCT TACTATAA
 
Protein sequence
MAKRIPIGTT VAALATVTLV ACGGDDPSSG AVSAGNSPAI VPSVPAATPA TPTATAAESE 
QDVFARTAPA DKVLDDSNVY DTTKDGSIAL SKVNEDSSVK RHTITINGKV LPYVARAGHL
VAYRQNGAGK KAEAAIFYTA YTRDALPKEH RPVTFLWNGG PGSASIWLHM GSWGPKRLKS
DAPNMADPTK QPDSFPFEDN AISLLDQSDI VFVDPPGTGL STAIAPLKNG DLWGTDDDAQ
VVADFITSYT NKYNRQSSPK YLYGESYGGI RTPIVANLLE QAGTSGYVPD PSGKPARVLD
GFILNSPLVD YNSNCDMMGG RVTCEGYIPS YAMTADYFKK SVKRGTRTQE QYLGELRTLA
RTTFQTTYGT YFTNGKPNSQ WNAYAASTAG QALLNRIADY TGIPASTWNG SFNYTPRPFR
NALVPGYELG RYDARMKVPN GNSFAADSYI DVAFLNQLKT YFPDFVNYKT QSIYEPLNNA
TIRNWKWKRA GSKYDYPQSI TDIQAVLTAN PDAKLLILHG YEDIATPGFQ TELDLEGVNL
SDRIPVKWFE GGHMIYNTEA SRVPLKQAID SYYKSPTRIA DGSLL