Gene BURPS1106A_3492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3492 
Symbol 
ID4902222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3396846 
End bp3398480 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content64% 
IMG OID640136718 
Productserine carboxypeptidase family protein 
Protein accessionYP_001067729 
Protein GI126454227 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.547458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAC GTAAAGCGCA GCTGTTATTA GCCGCGGTAT TCTCTTCGCT GCTGATCGCG 
GCGTGTAACG GAGACGATGC CGGTTCGCCC GCGAGCGCCG CGGCGAGCGA GAACAGTTCG
GGCAACATCG ATGCCTCGTC CGGCGCGGAC AAGCCCTATC GGGATCCCAA CCGTTATTCG
TCGAGCGGCA GCGCTTCGCT GTCGCCTTCC GCGGTGACCG AGAACGCCGC GATCACGCAT
CACCAGATCA CGGTGAACGG CAAGACGATC AACTACACCG CGACCGCCGG CCACCTGAGC
GCGCGCAACC CGCAAAGCGG CGCCGCCGAA GCGTCGTTCT TCTATGTCGC GTACACGGCC
GACAATCAGC CGCTCGGCAA GCGGCCCGTC ACGTTTTTCT ACAACGGCGG GCCGGGCTCG
TCGACGGTCT GGCTGCATCT CGGCTCGTTC GGGCCCAAGC GCCTCGTGAC GGGCGATCCG
AACGCGAACA GCCCGACGCC GTTCCCGTTC GTCGACAACC AGGAAACGCT CCTCGACACG
ACCGATCTCG TGTTCGTCGA CGCGATCGGC ACCGGTTTCT CGGAGGCGAT CGCGCCGAAC
ACGAACCAGA CGTTCTGGGG CGTCGATCAG GATGCCGGAG CATTTCGCGA TTTCGTGATC
CGCTATCTGC AGGTGAATCA GCGCAACGAT TCGCCGAAGT ACCTGTTCGG GGAATCGTAC
GGCACGCCGC GCACCGACGT GCTCGCGAAC CTGCTCGAGA CGGCGGGCGT GAAGCTCGAC
GGCATCGTGC TGCAATCGTC GATCCTCAAC TACAACAGCA ACTGCGACAT GGCGAGCAAC
TCCGTCGGCG GCTCGAACAG CGGCTCGAGC GCGGTGAGCT GCGCAGGCTT CGTGCCGAGC
TACGGCGCGA TCGGCGCGTA TTACCAGCTC GACAACCCGA ATCCGACGAA CCTGCCGCAG
TATGCGGACC AGATGCGTTT GCTCACGGCG GGCACCTATT CGCCGGCCGT GGACGCGTAC
CTCGCGAACC ATACGCCGCC TTCGTCGAGC CTCGTCACGA CGATGTCGAA CGCAACGGGC
GCGACCGTGC CGCTCTGGCG CGCGAACTTC AACCTGCTGC CGACTTCGTA TGACAACAGC
TACCAGCTCG CGCTGATTCC GGGTACGCTG ATCGGCCGCT ACGACGCGCG CGTGAACGTG
CCGACGAACA GCCCGCTCGC GTCGGACGGC GATCCGTCGA GCAGCTTCAT CACGAAGCCG
TTCACCGATA CGATCGGCAC CTATTTGCCG AACGTGCTCA AGTACACGGC GAAATCGGCC
TATGCTGTGC AGAGCAACGC GATTGCCTCG TGGGACTGGA GCCACGACGG CCTCGATCTG
CCCGACACGA TCCCCGATCT GGCCGCGGCG CTCACGCTCA ATCCCGCGCT GAAGGTGCTG
TCGCTGAACG GCTATCACGA CATCGCGACG CCGTTCTATC AGACGGAACT CGATCTTGCC
CGTCTCGGCG CGCAGCCGAA CCTGACGATC AAGGATTACC AGGGCGGGCA CATGGTCTAC
CTGGACGATA CGTCCCGGCC GCAGGAAAAA GCCGATCTGG CGACGTTCTA CGCGGCGGCG
CCTGCCGCGC GATGA
 
Protein sequence
MPARKAQLLL AAVFSSLLIA ACNGDDAGSP ASAAASENSS GNIDASSGAD KPYRDPNRYS 
SSGSASLSPS AVTENAAITH HQITVNGKTI NYTATAGHLS ARNPQSGAAE ASFFYVAYTA
DNQPLGKRPV TFFYNGGPGS STVWLHLGSF GPKRLVTGDP NANSPTPFPF VDNQETLLDT
TDLVFVDAIG TGFSEAIAPN TNQTFWGVDQ DAGAFRDFVI RYLQVNQRND SPKYLFGESY
GTPRTDVLAN LLETAGVKLD GIVLQSSILN YNSNCDMASN SVGGSNSGSS AVSCAGFVPS
YGAIGAYYQL DNPNPTNLPQ YADQMRLLTA GTYSPAVDAY LANHTPPSSS LVTTMSNATG
ATVPLWRANF NLLPTSYDNS YQLALIPGTL IGRYDARVNV PTNSPLASDG DPSSSFITKP
FTDTIGTYLP NVLKYTAKSA YAVQSNAIAS WDWSHDGLDL PDTIPDLAAA LTLNPALKVL
SLNGYHDIAT PFYQTELDLA RLGAQPNLTI KDYQGGHMVY LDDTSRPQEK ADLATFYAAA
PAAR