Gene BURPS668_3144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3144 
Symbol 
ID4884322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3080763 
End bp3082622 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content66% 
IMG OID640129072 
Productserine carboxypeptidase family protein 
Protein accessionYP_001060156 
Protein GI126441836 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.875872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAC AGAAGTCCTT GAAAGACGGT TTCATGCTCG GATGGTGCAG GGCGGCACGG 
CCGGTTGCCG CTGCCGCGCT GGCCGCGCTG CTCGTCGCCG CGTGCGGCGG CGACGACGGC
GGCGGCAGCA GCCCGTCGCT CGCGGCCGCG AACGTCGCGA ACACGAGCAC GTCGACGAAC
GCGACGACGA ACGCGACGAC GGCCGCCGAT GCGACGACCA ACGCCGCGCT GCCGCCGGAC
CAGCCGTATA TCGACAACGA CGTCTATGGC ACCGGGCCGA ACGATTCGGT CAGCGACGCG
ACGGAGGGCA CCGCGGTCGT GCACCGGCAG GTGAAGATCG GCGATCAGAT CCTCACCTAC
ACGGCGACGG CCGGCCACCT CGTGACGATC GATCCGATCA CGTCGAAGCC GAACGCGAAG
ATGTTCTACG TCGCGTACAC GCTCGACAAT CCGAACCCGG GCAAGCCGCG CCCCGTCACG
TTCTTCTACA ACGGCGGCCC GGGCTCGTCG TCGGTGTACC TGCTGCTGGG CTCGTTCGGG
CCGAAGCGCC TGCAGTCGTC GTTCCCGAAC TTCACGCCGC CCGCGCCGTA CCGGCTGCGC
GACAACCCCG AGAGCCTGCT CGACCGCTCC GATCTCGTGT TCATCAATCC GGTCGGCACC
GGCTACTCGG CCGCGATCGC GCCGGCGAAG AACAAGGATT TCTGGGGCGT CGACCAGGAC
GCGCACTCGA TCGACCGCTT CATCCAGCGC TACCTGACGA AGTACGCGCG CTGGAACTCG
CCGAAGTTCC TGTTCGGCGA ATCGTACGGC ACGGCGCGCA GCGCGGTGAC CGCGTGGGTG
CTGCATGAGG ACGGCATCGA GCTGAACGGG ATCACGCTGC AGTCGTCGAT TCTCGACTAT
GCGAACGCGG TGAGCGCGAT CGGCATCTTC CCGACGCTCG CGGCCGATGC GTTCTACTGG
AACAAGACGA CCATCAGCCC GAAACCGGCC GATCTGGACG CATACATGGC GCAGGCGCGC
AGCTATGCGG ACAACGTGCT CGCGCCGCTC GCGCAGGCGC CGAATCCGCA GGACGGCGGC
TTCGTCAACG TGCGGCTGAA CCTGAACGTC GCGACCGCGC AGCAGATGGG CGCGTACATC
GGCACCGATC CGATCTCGCT GGTCCAGACG TTCGGCAATC CGGCCGCGCT CGGCAACGTG
CCGTCGTCCA ACGACAACCC GCCGTACACG TTCTTCCTGA CGCTCGTGCC GGGCATCCAG
ATCGGCCAGT ACGACGGACG CGCGAACTAC ACGGGCAAGG GCATCGCGCC GTATATCCTG
CCGAACTCGG GCAGCAACGA TCCGTCGATC AGCAACGTCG GCGGCGCGTA CACGGTGCTG
TGGAACGACT ACATCAACAA CGACCTGAAG TATGTGTCGA CGTCGTCGTT CGTCGATCTG
AACGACCAGG TGTTCAACAA CTGGGACTTC AGCCACACGG ACCCGACGGG CGCGAACCGC
GGCGGCGGCA ACACGCTGTA CACGGCGGGC GATCTCGCCG CGACGATGAG CCTGAACCCG
GACCTGAAGG TGCTGTCGGC GAACGGCTAT TTCGACGCGG TGACGCCGTT CCACCAGACC
GAGCTCACGC TCGCGCAGAT GCCGCTCGAT CCGTCGCTGA AGTCGGCGAA CCTGACGATG
AAATACTATC CGTCGGGCCA CATGATCTAT CTGAACGATC ACTCGCGGAT CGCGATGAAG
GCGGATCTGG CGACGTTCTA CGACGGCATC CTCGCGGACC GCACGGCGAT GCGGCGCGTG
CTGCTGCGCC AGCAGAAGGC GCTGCAGTTG AAGCAGCAGA AGCAACAGCA AGGGCAGTGA
 
Protein sequence
MKIQKSLKDG FMLGWCRAAR PVAAAALAAL LVAACGGDDG GGSSPSLAAA NVANTSTSTN 
ATTNATTAAD ATTNAALPPD QPYIDNDVYG TGPNDSVSDA TEGTAVVHRQ VKIGDQILTY
TATAGHLVTI DPITSKPNAK MFYVAYTLDN PNPGKPRPVT FFYNGGPGSS SVYLLLGSFG
PKRLQSSFPN FTPPAPYRLR DNPESLLDRS DLVFINPVGT GYSAAIAPAK NKDFWGVDQD
AHSIDRFIQR YLTKYARWNS PKFLFGESYG TARSAVTAWV LHEDGIELNG ITLQSSILDY
ANAVSAIGIF PTLAADAFYW NKTTISPKPA DLDAYMAQAR SYADNVLAPL AQAPNPQDGG
FVNVRLNLNV ATAQQMGAYI GTDPISLVQT FGNPAALGNV PSSNDNPPYT FFLTLVPGIQ
IGQYDGRANY TGKGIAPYIL PNSGSNDPSI SNVGGAYTVL WNDYINNDLK YVSTSSFVDL
NDQVFNNWDF SHTDPTGANR GGGNTLYTAG DLAATMSLNP DLKVLSANGY FDAVTPFHQT
ELTLAQMPLD PSLKSANLTM KYYPSGHMIY LNDHSRIAMK ADLATFYDGI LADRTAMRRV
LLRQQKALQL KQQKQQQGQ