Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2008 |
Symbol | |
ID | 4884297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2005370 |
End bp | 2007742 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640127936 |
Product | putative penicillin amidase |
Protein accession | YP_001059043 |
Protein GI | 126438402 |
COG category | [R] General function prediction only |
COG ID | [COG2366] Protein related to penicillin acylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.276522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCTC GCACGAACCG CTTGCCGCGG TGGCTCAAGA TCCTGCCCGG CGTCATTCTT CTCGGCGCGC TGCTCGTCGC GGCCGGCGCG GCGCTGTTCC TGCGCGCGAG CCTGCCGCGG CTCGACGGCG ACGTGCGCGC GCCGACGCTC GGCGGCCCGA TGACGATCGA ACGCGACGCC GCGGGCGTGC CGACCGTCGC CGCGCGCGAC CGCTTCGACG CCGCCTACGG CATCGGCTAC CTGCATGCGC AGGACCGCTT CTTCCAGATG GATTATCTGC GCCGGACCGG AGCAGGGGAG TTGGCGGAGC TGCTCGGGCC CGCCGCGCTG GATTTCGATC GCGAGCACCG GCTGTTCCGG TTTCGCGCGC GCGCCGCGGC GGCGTTCGCG CAGTTGCCGC CCGACGAGCG GCGCCTGCTC GAACGCTACA CGCAAGGCGT GAACGACGGG CTCGCCGCGC TGCGCGCCCG GCCGTTCGAA TACGCGCTGC TCGGCGAGCC GCCGGCGCGG TGGCGGCCCG AAGATTCGCT GCTCGTGATC TGGGCGATGT ACTTTCAGGT GCAGGGCACG CTCGCGTCGC GCGACATCGC GCGCAACTGG CTGACGGCGC ACGCGACGCA GCAGCAACGC GCCTTCCTGC TGCCGTCGTC GAGCGGATTC GACGCGCCGC TCGATGCGCC GCGCATCGAC GAAGCGCCCG CGCCGCTGCC CGACGCCGCG CCCGACTGGT TCCGCGCCGC AGGCGACGGC GCGGCCAAGC GCGCATCGCT CGATTTCCGC TCGTCGGTCG GCAGCAACAA CTGGGCCATT GCCGGCAGCC GCAGCGCACG CGGCGCGGCC ATCGTCGGCG ACGACATGCA CCTCGTGCTC GGCCTGCCGA ACACCTGGTA TCGCGCGGCC TTCACCTATC CGGGCGGCGC GGCGCCCGTG CGGCGGGCCG TCGGCGTGAC GCTCGCCGGG CTGCCGGCGA TCGTGGCCGG CAGCAACGGG CATGTCGCAT GGGGCTTGAC GGTCGGTTAC GCGGATTGCC TCGACCTCGT GCCGCTCGAG CGCGACGGCG ACGATTCGCG GGCGTTCCGG ATGAGCGGCG CGCGCCAGGT CGCGCGCCGG TACGTCGAAT CGATCCGGGT GCGCGGCGGC GCGTCCGTTT CGCTGACCGT GCTGGAAACG ACGGTCGGGC CGGTGCGGGA AATCGACGGC CGGCCCTATG CGGTCCACTG GGTCGCGCAG TCGCCGGGCG CGGTGAACCT GGGGCTCGCG CGCCTCGCGG ACGCCGTCGA CGTCGACGGC GCGATGCGCG TGGCGAATAC GCTCGGCATT CCGGCCGAGA ACATCGTGGT CGGCGACCGC GCCGGGCGAA TCGGCTGGAC TATCGCCGGC GCGCTGCCGG ACCGGCGCGC GCCGCGCGGC GGCGAGGGCG CGGCGTGGCG GTCGCTGCTG CCGCCCGACG CGTATCCGCG CGTCGTCGAT CCGTCCGGCG GCCAGCTCTG GACCGCGAAC AGCCGCCAGT TGGCGGGCGA CGCATACCGG TTGATCGGCG ATGGCGGCAC GGATCTCGGC GCGCGGGCGA CCCAGCTGCG CGACGGACTG ACGGCGCTCG GCCGCACCGA CGAACAAGCG GCGTATCGGA TCGACCTCGA CGATCGCGCG CTGTTCATCG CGCAGTGGCG CGACCGCGCG CTGCGCGTGC TCGACGACGC GGCGCTCGCG GGCCACCCGT CGCGCGCGGA ATTCCGCCGG CTGCTCGAGC ACGGCTGGAC GGGCCGGGCG AGCGTCGACT CGGTCGGGTA CACGCTCGCG CGCGGCTTTC TGTATCGGCT GTACGACGTC ACGTTCGACG GGCTGAACGC CCGCCTGAAG CAAGTCGATG CGGGCGCGGA CTACGAACTG GCGAATCTGC GCTGGCCGGC CGTCGTCGCG CGGCTGCTCG ACGCGCAGCC GCCGGGCTGG CTGCCGGCCG GCGCGTCGAG CTGGCGCGAC GTGCAACTGA TCGCGATCGA CCGGACCATC GCCGCGCTCA CGGCCGACGG CGCGCCGCTC GCGCGGGCGA GCTGGGGCGC GCGCAACACG CTGCGGATCG CGCATCCGTT CGCCGGCAGC CTGCCGCTGC TCGGCGGATG GATGACGGCG CCGGCCGCGC AGATGCCGGG GGATTCGCAC ATGCCGCGCG TCGCCGCGCC GGATTTCGGG CAATCCGAGC GGATGGTCGT GTCGCCGGGG CACGAGGAAT TCGGGATCTT CAACATGCCG GGCGGGCAGA GCGGGCATCC GCTGAGCCCG TTCTTCCTCG CGGGCCACGA TGCGTGGGTG CGCGCGGAGC CGACGCCGTT CTTGCCCGGC GTCGCGCGGC ATACGTTGAG ATTCGCGCCG TAG
|
Protein sequence | MASRTNRLPR WLKILPGVIL LGALLVAAGA ALFLRASLPR LDGDVRAPTL GGPMTIERDA AGVPTVAARD RFDAAYGIGY LHAQDRFFQM DYLRRTGAGE LAELLGPAAL DFDREHRLFR FRARAAAAFA QLPPDERRLL ERYTQGVNDG LAALRARPFE YALLGEPPAR WRPEDSLLVI WAMYFQVQGT LASRDIARNW LTAHATQQQR AFLLPSSSGF DAPLDAPRID EAPAPLPDAA PDWFRAAGDG AAKRASLDFR SSVGSNNWAI AGSRSARGAA IVGDDMHLVL GLPNTWYRAA FTYPGGAAPV RRAVGVTLAG LPAIVAGSNG HVAWGLTVGY ADCLDLVPLE RDGDDSRAFR MSGARQVARR YVESIRVRGG ASVSLTVLET TVGPVREIDG RPYAVHWVAQ SPGAVNLGLA RLADAVDVDG AMRVANTLGI PAENIVVGDR AGRIGWTIAG ALPDRRAPRG GEGAAWRSLL PPDAYPRVVD PSGGQLWTAN SRQLAGDAYR LIGDGGTDLG ARATQLRDGL TALGRTDEQA AYRIDLDDRA LFIAQWRDRA LRVLDDAALA GHPSRAEFRR LLEHGWTGRA SVDSVGYTLA RGFLYRLYDV TFDGLNARLK QVDAGADYEL ANLRWPAVVA RLLDAQPPGW LPAGASSWRD VQLIAIDRTI AALTADGAPL ARASWGARNT LRIAHPFAGS LPLLGGWMTA PAAQMPGDSH MPRVAAPDFG QSERMVVSPG HEEFGIFNMP GGQSGHPLSP FFLAGHDAWV RAEPTPFLPG VARHTLRFAP
|
| |