Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1887 |
Symbol | |
ID | 4882270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 1847412 |
End bp | 1848830 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640127815 |
Product | M24/M37 family peptidase |
Protein accession | YP_001058922 |
Protein GI | 126441274 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0768301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCCAAGC CAATCGCGTA CCCCGTTTCC GTTTCTTCCG CCCGCCGCGC CGAGCCGGCG CCCGCCGCCG AAGCGCCGGC CCGCGGCGGC GCCCCTCGTG CGTTCACGCG CAGGCTGCCC GCGGCGAGCG CGCTCGGCGC GCTGATCGCC TGCGGTCTCG CCGCGCTGCC GGCCGTGTCG AAGTCGCCCG ACGCGTCGAG CGCGAATGCC GCGCGCGCCC CGCAGCGCGT GTTCGCCGAC GCGCCGTTTC CGAGCGCGCA GCGCTTCGTC ACGGGCGAGC TCGAGCGCAT GCTCGCCGAG CAGAACCCGC CCGCCGCGCC GTTGCGCACG CTCGGCGCGC ACGCCGAGGG CGCGACGCTC GGCGACGCGC CCTCGGCCAT CGCGCAGCAC GGCCCGGCGC GGCCGGCGCC CGCACGCCCC GCCGCGCCGC CGCCGCCGGG GCTTTTCGCG CCGCTCGCCG AGCACCGCAA CGCACTGCTC GGCTACGACG GCCGCGCGTT CTCGCTCGCC GATTCAGTGC TCGCGCTCAT CGATAGCGGC GTGCGCGCCG GGCCGATCGA CGACACGCTC GCCGACACGC TGAACCGGCT CGACATCCCG CCCGAAGTGC GCATCCAGAT CGGCGACCTG ATCGCCGAGC GCGTGCGCGC GCATGCGCAC GCGCAGCGAG GCGACCGCTA CCGGATCGCG TTCGACGCCG CGTCGGGCAA GCCGCGCGTG ACCGCGCTCG AGCTGCGCGT CGCGGGCCGC CGGTTCGGCG CGATCTGGTT CAAGCCGCCG GGCGCGTCGA GCGGCGCGTA CTACGCGTTC GACGGCGCGC CGCTCGACGC GCCGGCGCTC GCGATGCCCG TCGTCAGCAC GCGCATCAGC TCGTACTTCG GCGAGCGCGT GCATCCGCTG TCGCACATCC TGCAGATGCA TACGGGCGTC GATCTCGCCG CGCCCACCGG CACGCGCGTG AACGCGGCGG CGGCGGGCGT CGTGTCGTTC GTCGGCTACG ATCCGGGCGG CTACGGCAAG TATGTCGTCA TCGACCATCC GGACCGCTCG TCGACCTACT ACGCGCATCT GTCGGCGTTC GCGCCGAAGC TCGAGGTCGG GATGGCGGTC GCGCAGGGCC AGCGGATCGG CGCGGTCGGC TCGACAGGCG CGGCGACGGG CCCGCATCTG CATTTCGAGG TGCGCGTCGA CGATCAGCCG GTGGATCCGC TCGTCGCGCT CGCGAACGCG CAGAACACGC TGTCGGCGAT GCAGCTCGAC GCGTTCCGGC GCGCCGCGAG CGAGGCGCGC TTCCGGCTCG CGTCGGGCGC CACGCCGCCG CTCGGCTTCG CGCAGATTAA CGCGCCGCTG TGGGCCGAGT TCGCCACCGA TACGTCGACG CTGCGCGCGA TCTTCAACAC GCATTACGCG GCGTCGTGA
|
Protein sequence | MAKPIAYPVS VSSARRAEPA PAAEAPARGG APRAFTRRLP AASALGALIA CGLAALPAVS KSPDASSANA ARAPQRVFAD APFPSAQRFV TGELERMLAE QNPPAAPLRT LGAHAEGATL GDAPSAIAQH GPARPAPARP AAPPPPGLFA PLAEHRNALL GYDGRAFSLA DSVLALIDSG VRAGPIDDTL ADTLNRLDIP PEVRIQIGDL IAERVRAHAH AQRGDRYRIA FDAASGKPRV TALELRVAGR RFGAIWFKPP GASSGAYYAF DGAPLDAPAL AMPVVSTRIS SYFGERVHPL SHILQMHTGV DLAAPTGTRV NAAAAGVVSF VGYDPGGYGK YVVIDHPDRS STYYAHLSAF APKLEVGMAV AQGQRIGAVG STGAATGPHL HFEVRVDDQP VDPLVALANA QNTLSAMQLD AFRRAASEAR FRLASGATPP LGFAQINAPL WAEFATDTST LRAIFNTHYA AS
|
| |