Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0741 |
Symbol | |
ID | 4905538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 730367 |
End bp | 731338 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640143847 |
Product | dipeptidase family protein |
Protein accession | YP_001074777 |
Protein GI | 126457459 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGC TGCACCAGGA CAGCATCATC ATCGATGGCC TGAACATTTC GAAGTTCGAA CGCTCGGTGT TCGAAGACAT GCAAAAGGGC GGCGTGACGG CCGCGAACTG CACGGTGTCC GTGTGGGAGA ACTTCACGAA GACGGTCGAC AACATCGCGC TGATGAAAAA GCAGATTCGC GAGAACGGCG AACTGCTGAC GCTCGTGCGC ACGACGGACG ACATCCTCCG CGCGAAGCGG GAAGGCCGCA CGGGCGTGAT CCTCGGCTTC CAGAACGCGC ACGCGTTCGA GGACAACCTG GGCTATGTCG AGGCGTTCGC CGACATGGGC GTGCGCGTCG TGCAGCTTTG CTACAACACG CAGAACCTCG TCGGCACCGG CTGCTACGAG CGCGACGGCG GGCTGTCGGA TTTCGGCCGC GAGGTGATCA CCGAGATGAA CCGCGTCGGG ATCATGGTCG ACTTGTCGCA CGTCGGCGGC AACACGTCGT CGGAGGCGAT CGCGTTCTCG AAGAAACCCG TGTGCTACTC GCACTGCCTG CCGTCGGGTC TCAAGGCGCA TCCGCGCAAC AAGAGCGACG CGCAACTGAA GGAGATCGCG GACGCGGGCG GCTTCGTCGG GGTGACGATG TTCGCGCCGT TCCTGAAGCG CGGGATCGAC GCGACGATCG ACGATTACAT CGAGGCGATC GGCTACGTCG TGAACCTGAT CGGCGAGGAC GCGGTCGGCA TCGGCACCGA TTTCACGCAG GGCTACAGCG TCGATTTCTT CGATTGGCTC ACGCACGACA AGGGCCGCTA CCGCCGGCTC ACGAATTTCG GCAAGGTCGT GAATCCTGAA GGCATCCGAA CGATCGGCGA ATTCCCGAAC CTGACGGCCG CGATGGAGCG CGCGGGATGG AAGGCGTCGC GCATCCGCAA GATCATGGGC GAAAACTGGG TGCGCGTGTT CAAGGAGGTC TGGGGCGCGT AA
|
Protein sequence | MSTLHQDSII IDGLNISKFE RSVFEDMQKG GVTAANCTVS VWENFTKTVD NIALMKKQIR ENGELLTLVR TTDDILRAKR EGRTGVILGF QNAHAFEDNL GYVEAFADMG VRVVQLCYNT QNLVGTGCYE RDGGLSDFGR EVITEMNRVG IMVDLSHVGG NTSSEAIAFS KKPVCYSHCL PSGLKAHPRN KSDAQLKEIA DAGGFVGVTM FAPFLKRGID ATIDDYIEAI GYVVNLIGED AVGIGTDFTQ GYSVDFFDWL THDKGRYRRL TNFGKVVNPE GIRTIGEFPN LTAAMERAGW KASRIRKIMG ENWVRVFKEV WGA
|
| |