Gene BURPS668_0373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0373 
Symbol 
ID4885051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp341575 
End bp343266 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content68% 
IMG OID640126301 
Productserine carboxypeptidase family protein 
Protein accessionYP_001057426 
Protein GI126438747 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.250878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGGAG GGTATGGCGG CCTCGCGCGC CCGCGCGGCC GGATGCCGGC GACGGCCGCC 
GTTGCCGCCG CGTTGCTGCT CGCGCTCGGC GGGTGCGGCG ACGATCTGCA GAGCACCACG
ACGCCCGCGC AGCTCAATCA ACCGTACACG GACACGACCG CGTATTCGCC GAAGGCGGGC
GATGGGCTGC CGGCGTCGCA GGTATCCGAA CGCGCCGCGG TGATGAGCCA CCAGTGGACG
GCGAACGGCG CGAGCGTCGA TTACCTGACG ACGACCGGCC ACCTGACGGC CACCGATCCG
AACGGCAACG CGGAGGCGAC GATGTCGTAC GTCGCGTATA CGGCGCCGAG CCGCGACGGC
TCGCCGCGGC CCGTCACGTT CTTCTACAAC GGGGGGCCGG GCTCGTCGTC GGTGTGGCTG
CGGCTCGGCT CGTTCGCGCC GACGCGGGTC GCGACGCCCG ATCCGCTGAT GACGAACTGG
CCGAATTTCC CGCTCGTCGA CAACCCGGAG AGCCTGATCG CGACCACCGA CATGGTGTTC
ATCGATCCGC CGGGCACGGG CCTGTCGGAG GCGATCCAGC CGAACACGAA CCAGACGTTC
TGGGGAGCGG ACGCCGACGT GAAGGTGATG CGCGATTTCA TTCGCCGCTA CCTGTCGGTC
AACGGCCGCG GCGGCTCGCC GATCTATCTG TACGGCGAAT CGTACGGCAC GCCGCGCACG
GACATGCTCG CGCTCGCGCT CGAATCGGCG GGCGTGCCGC TCACGGGCAT CGTGCTGCAG
TCGTCGATCC TGAACTACAT GGCGGCCGCG GGCGACCAGG CGGTGGGCAC CTTCCCGTCG
TACGCGCAGG TGGCCGCATA CTTCAACCAG GTGTCGCCGT CGCCGACGAA TCTGGGCGCG
TACGCGCAGC GCATCGAGAA TTTCGTGACC GCGCAGTACG CGCCGATCGT GCATTACGCG
ACGGCTTCGT CGCCGATCTC GCCGGACGCC GGCACGCTCG CCGCGTGGTC GTCGCAAACG
GGCATGGCCA CCGCGTCGAT CGGCGCGTAC TTCCAGTATT TCTACGATAC GGAGCCGTCG
CCCGGCCAGA CGACGCTCGT GCCCGGCTAC ACGATCGGCC GCTACGACGG CCGCGTGTCG
CTGCCGAACG GCGACGCGCG CCTCGCGAGC GACGACGATC CGTCCGATAT CCTGATCTCG
AAGCCGTTCA CGAGCGCGCT CGCGTCGCAG ATGCCGAACT ACCTCGGCTA CACCGCGCCG
AACGCGACGT ATCAGACGCT CAATCCCGAC ATCATCGGCG TGTGGAACTT CAGCCACGCG
GGCCAGCCGT ATCCGGACAC GATCCCGGAT CTGCTTGCCG CGCTGCAACT GAATCCGAAG
CTGAAGGTGC TCGCGTCGAA CGGCTATCAT GATCTCGCGA CGCCGTACTT CGAAACGGAG
AAGGAGCTCG CGCGGCTGCA GACGGTGTCC GGCCTCGCGC CGAATCTGCA GGTGACGTTC
TACCAGGGCG GCCACATGAT CTATCTCGAC GACGTCGCGA GGCCGCAGAT GCAGGCGGAT
CTCGTCGCGT TCTACCAGAA CCGGCCGGTG GCAAACGCGT TGACGCTCGC GGCGCTGCCG
TCGCCGTGGC CCGACGAAAG CCCGGCGAAC ACGCCGACGG CGAAGATCGC GCGGGCCGCC
GCGGCCCGCT GA
 
Protein sequence
MHGGYGGLAR PRGRMPATAA VAAALLLALG GCGDDLQSTT TPAQLNQPYT DTTAYSPKAG 
DGLPASQVSE RAAVMSHQWT ANGASVDYLT TTGHLTATDP NGNAEATMSY VAYTAPSRDG
SPRPVTFFYN GGPGSSSVWL RLGSFAPTRV ATPDPLMTNW PNFPLVDNPE SLIATTDMVF
IDPPGTGLSE AIQPNTNQTF WGADADVKVM RDFIRRYLSV NGRGGSPIYL YGESYGTPRT
DMLALALESA GVPLTGIVLQ SSILNYMAAA GDQAVGTFPS YAQVAAYFNQ VSPSPTNLGA
YAQRIENFVT AQYAPIVHYA TASSPISPDA GTLAAWSSQT GMATASIGAY FQYFYDTEPS
PGQTTLVPGY TIGRYDGRVS LPNGDARLAS DDDPSDILIS KPFTSALASQ MPNYLGYTAP
NATYQTLNPD IIGVWNFSHA GQPYPDTIPD LLAALQLNPK LKVLASNGYH DLATPYFETE
KELARLQTVS GLAPNLQVTF YQGGHMIYLD DVARPQMQAD LVAFYQNRPV ANALTLAALP
SPWPDESPAN TPTAKIARAA AAR