Gene BURPS668_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2008 
Symbol 
ID4884297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2005370 
End bp2007742 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content74% 
IMG OID640127936 
Productputative penicillin amidase 
Protein accessionYP_001059043 
Protein GI126438402 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.276522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCTC GCACGAACCG CTTGCCGCGG TGGCTCAAGA TCCTGCCCGG CGTCATTCTT 
CTCGGCGCGC TGCTCGTCGC GGCCGGCGCG GCGCTGTTCC TGCGCGCGAG CCTGCCGCGG
CTCGACGGCG ACGTGCGCGC GCCGACGCTC GGCGGCCCGA TGACGATCGA ACGCGACGCC
GCGGGCGTGC CGACCGTCGC CGCGCGCGAC CGCTTCGACG CCGCCTACGG CATCGGCTAC
CTGCATGCGC AGGACCGCTT CTTCCAGATG GATTATCTGC GCCGGACCGG AGCAGGGGAG
TTGGCGGAGC TGCTCGGGCC CGCCGCGCTG GATTTCGATC GCGAGCACCG GCTGTTCCGG
TTTCGCGCGC GCGCCGCGGC GGCGTTCGCG CAGTTGCCGC CCGACGAGCG GCGCCTGCTC
GAACGCTACA CGCAAGGCGT GAACGACGGG CTCGCCGCGC TGCGCGCCCG GCCGTTCGAA
TACGCGCTGC TCGGCGAGCC GCCGGCGCGG TGGCGGCCCG AAGATTCGCT GCTCGTGATC
TGGGCGATGT ACTTTCAGGT GCAGGGCACG CTCGCGTCGC GCGACATCGC GCGCAACTGG
CTGACGGCGC ACGCGACGCA GCAGCAACGC GCCTTCCTGC TGCCGTCGTC GAGCGGATTC
GACGCGCCGC TCGATGCGCC GCGCATCGAC GAAGCGCCCG CGCCGCTGCC CGACGCCGCG
CCCGACTGGT TCCGCGCCGC AGGCGACGGC GCGGCCAAGC GCGCATCGCT CGATTTCCGC
TCGTCGGTCG GCAGCAACAA CTGGGCCATT GCCGGCAGCC GCAGCGCACG CGGCGCGGCC
ATCGTCGGCG ACGACATGCA CCTCGTGCTC GGCCTGCCGA ACACCTGGTA TCGCGCGGCC
TTCACCTATC CGGGCGGCGC GGCGCCCGTG CGGCGGGCCG TCGGCGTGAC GCTCGCCGGG
CTGCCGGCGA TCGTGGCCGG CAGCAACGGG CATGTCGCAT GGGGCTTGAC GGTCGGTTAC
GCGGATTGCC TCGACCTCGT GCCGCTCGAG CGCGACGGCG ACGATTCGCG GGCGTTCCGG
ATGAGCGGCG CGCGCCAGGT CGCGCGCCGG TACGTCGAAT CGATCCGGGT GCGCGGCGGC
GCGTCCGTTT CGCTGACCGT GCTGGAAACG ACGGTCGGGC CGGTGCGGGA AATCGACGGC
CGGCCCTATG CGGTCCACTG GGTCGCGCAG TCGCCGGGCG CGGTGAACCT GGGGCTCGCG
CGCCTCGCGG ACGCCGTCGA CGTCGACGGC GCGATGCGCG TGGCGAATAC GCTCGGCATT
CCGGCCGAGA ACATCGTGGT CGGCGACCGC GCCGGGCGAA TCGGCTGGAC TATCGCCGGC
GCGCTGCCGG ACCGGCGCGC GCCGCGCGGC GGCGAGGGCG CGGCGTGGCG GTCGCTGCTG
CCGCCCGACG CGTATCCGCG CGTCGTCGAT CCGTCCGGCG GCCAGCTCTG GACCGCGAAC
AGCCGCCAGT TGGCGGGCGA CGCATACCGG TTGATCGGCG ATGGCGGCAC GGATCTCGGC
GCGCGGGCGA CCCAGCTGCG CGACGGACTG ACGGCGCTCG GCCGCACCGA CGAACAAGCG
GCGTATCGGA TCGACCTCGA CGATCGCGCG CTGTTCATCG CGCAGTGGCG CGACCGCGCG
CTGCGCGTGC TCGACGACGC GGCGCTCGCG GGCCACCCGT CGCGCGCGGA ATTCCGCCGG
CTGCTCGAGC ACGGCTGGAC GGGCCGGGCG AGCGTCGACT CGGTCGGGTA CACGCTCGCG
CGCGGCTTTC TGTATCGGCT GTACGACGTC ACGTTCGACG GGCTGAACGC CCGCCTGAAG
CAAGTCGATG CGGGCGCGGA CTACGAACTG GCGAATCTGC GCTGGCCGGC CGTCGTCGCG
CGGCTGCTCG ACGCGCAGCC GCCGGGCTGG CTGCCGGCCG GCGCGTCGAG CTGGCGCGAC
GTGCAACTGA TCGCGATCGA CCGGACCATC GCCGCGCTCA CGGCCGACGG CGCGCCGCTC
GCGCGGGCGA GCTGGGGCGC GCGCAACACG CTGCGGATCG CGCATCCGTT CGCCGGCAGC
CTGCCGCTGC TCGGCGGATG GATGACGGCG CCGGCCGCGC AGATGCCGGG GGATTCGCAC
ATGCCGCGCG TCGCCGCGCC GGATTTCGGG CAATCCGAGC GGATGGTCGT GTCGCCGGGG
CACGAGGAAT TCGGGATCTT CAACATGCCG GGCGGGCAGA GCGGGCATCC GCTGAGCCCG
TTCTTCCTCG CGGGCCACGA TGCGTGGGTG CGCGCGGAGC CGACGCCGTT CTTGCCCGGC
GTCGCGCGGC ATACGTTGAG ATTCGCGCCG TAG
 
Protein sequence
MASRTNRLPR WLKILPGVIL LGALLVAAGA ALFLRASLPR LDGDVRAPTL GGPMTIERDA 
AGVPTVAARD RFDAAYGIGY LHAQDRFFQM DYLRRTGAGE LAELLGPAAL DFDREHRLFR
FRARAAAAFA QLPPDERRLL ERYTQGVNDG LAALRARPFE YALLGEPPAR WRPEDSLLVI
WAMYFQVQGT LASRDIARNW LTAHATQQQR AFLLPSSSGF DAPLDAPRID EAPAPLPDAA
PDWFRAAGDG AAKRASLDFR SSVGSNNWAI AGSRSARGAA IVGDDMHLVL GLPNTWYRAA
FTYPGGAAPV RRAVGVTLAG LPAIVAGSNG HVAWGLTVGY ADCLDLVPLE RDGDDSRAFR
MSGARQVARR YVESIRVRGG ASVSLTVLET TVGPVREIDG RPYAVHWVAQ SPGAVNLGLA
RLADAVDVDG AMRVANTLGI PAENIVVGDR AGRIGWTIAG ALPDRRAPRG GEGAAWRSLL
PPDAYPRVVD PSGGQLWTAN SRQLAGDAYR LIGDGGTDLG ARATQLRDGL TALGRTDEQA
AYRIDLDDRA LFIAQWRDRA LRVLDDAALA GHPSRAEFRR LLEHGWTGRA SVDSVGYTLA
RGFLYRLYDV TFDGLNARLK QVDAGADYEL ANLRWPAVVA RLLDAQPPGW LPAGASSWRD
VQLIAIDRTI AALTADGAPL ARASWGARNT LRIAHPFAGS LPLLGGWMTA PAAQMPGDSH
MPRVAAPDFG QSERMVVSPG HEEFGIFNMP GGQSGHPLSP FFLAGHDAWV RAEPTPFLPG
VARHTLRFAP