Gene BURPS1106A_0393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0393 
Symbol 
ID4901575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp356014 
End bp357705 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content68% 
IMG OID640133623 
Productserine carboxypeptidase family protein 
Protein accessionYP_001064676 
Protein GI126451557 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGGAG GGTATGGCGG CCTCGCGCGC CCGCGCGGCC GGATGCCGGC GACGGCCGCC 
GTTGCCGCCG CGTTGCTGCT CGCGCTCGGC GGGTGCGGCG ACGATCTGCA GAGCACCACG
ACGCCCGCGC AGCTCAATCA ACCGTACACG GACACGACCG CGTATTCGCC GAAGGCGGGC
GATGGGCTGC CGGCGTCGCA GGTATCCGAA CGCGCCGCGG TGATGAGCCA CCAGTGGACG
GCGAACGGCG CGAGCGTCGA TTACCTGACG ACGACCGGCC ACCTGACGGC CACCGATCCG
AACGGCAACG CGGAGGCGAC GATGTCGTAC GTCGCGTATA CGGCGCCGAG CCGCGACGGC
TCGCCGCGGC CCGTCACGTT CTTCTACAAC GGGGGGCCGG GCTCGTCGTC GGTGTGGCTG
CGGCTCGGCT CGTTCGCGCC GACGCGGGTC GCGACGCCCG ATCCGCTGAT GACGAACTGG
CCGAATTTCC CGCTCGTCGA CAACCCGGAG AGCCTGATCG CGACCACCGA CATGGTGTTC
ATCGATCCGC CGGGCACGGG CCTGTCGGAG GCGATCCAGC CGAACACGAA CCAGACGTTC
TGGGGAGCGG ACGCCGACGT GAAGGTGATG CGCGATTTCA TTCGCCGCTA CCTGTCGGTC
AACGGCCGCG GCGGCTCGCC GATCTATCTG TACGGCGAAT CGTACGGCAC GCCGCGCACG
GATATGCTCG CGCTCGCGCT CGAATCGGCG GGCGTGCCGC TCACGGGCAT CGTGCTGCAG
TCGTCGATCC TGAACTACAT GGCGGCCGCG GGCGACCAGG CGGTGGGCAC CTTTCCGTCG
TACGCGCAGG TGGCCGCATA CTTCAACCAG GTGTCGCCGT CGCCGACGAA TCTGGGCGCG
TATGCGCAGC GCATCGAGAA TTTCGTGACC GCGCAGTACG CGCCGATCGT GCATTACGCG
ACGGCTTCGT CGCCGATCTC GCCGGACGCC GGCACGCTCG CCGCGTGGTC GTCGCAAACG
GGCATGGCCA CCGCGTCGAT CGGCGCGTAC TTCCAGTATT TCTACGATAC GGAGCCGTCG
CCCGGCCAGA CGACGCTCGT GCCCGGCTAC ACGATCGGCC GCTACGACGG CCGCGTGTCG
CTGCCGAACG GCGACGCGCG CCTCGCGAGC GACGACGATC CGTCCGACAT CCTGATCTCG
AAGCCGTTCA CGAGCGCGCT CGCGTCGCAG ATGCCGAACT ACCTCGGCTA CACCGCGCCG
AACGCGACGT ATCAGACGCT CAATCCCGAC ATCATCGGCG TGTGGAACTT CAGCCACGCG
GGCCAGCCGT ATCCGGACAC GATCCCGGAT CTGCTTGCCG CGCTGCAACT GAATCCGAAG
CTGAAGGTGC TCGCGTCGAA CGGCTATCAT GATCTCGCGA CGCCGTACTT CGAAACGGAG
AAGGAGCTCG CGCGGCTGCA GACGGTGTCC GGCCTCGCGC CGAATCTGCA GGTGACGTTC
TACCAGGGCG GCCACATGAT CTATCTCGAC GACGTCGCGA GGCCGCAGAT GCAGGCGGAT
CTCGTCGCGT TCTACCAGAA CCGGCCGGTG GCAAACGCGT TGACGCTCGC GGCGCTGCCG
TCGCCGTGGC CCGACGAAAG CCCGGCGAAC ACGCCGACGG CGAAGATCGC GCGGGCCGCC
GCGGCCCGCT GA
 
Protein sequence
MHGGYGGLAR PRGRMPATAA VAAALLLALG GCGDDLQSTT TPAQLNQPYT DTTAYSPKAG 
DGLPASQVSE RAAVMSHQWT ANGASVDYLT TTGHLTATDP NGNAEATMSY VAYTAPSRDG
SPRPVTFFYN GGPGSSSVWL RLGSFAPTRV ATPDPLMTNW PNFPLVDNPE SLIATTDMVF
IDPPGTGLSE AIQPNTNQTF WGADADVKVM RDFIRRYLSV NGRGGSPIYL YGESYGTPRT
DMLALALESA GVPLTGIVLQ SSILNYMAAA GDQAVGTFPS YAQVAAYFNQ VSPSPTNLGA
YAQRIENFVT AQYAPIVHYA TASSPISPDA GTLAAWSSQT GMATASIGAY FQYFYDTEPS
PGQTTLVPGY TIGRYDGRVS LPNGDARLAS DDDPSDILIS KPFTSALASQ MPNYLGYTAP
NATYQTLNPD IIGVWNFSHA GQPYPDTIPD LLAALQLNPK LKVLASNGYH DLATPYFETE
KELARLQTVS GLAPNLQVTF YQGGHMIYLD DVARPQMQAD LVAFYQNRPV ANALTLAALP
SPWPDESPAN TPTAKIARAA AAR