Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1022 |
Symbol | |
ID | 4906165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 987966 |
End bp | 989096 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640144127 |
Product | putative D-aminopeptidase |
Protein accession | YP_001075057 |
Protein GI | 126455541 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCACAC GAGATCTGGG CATTCGCATC GGCCGCGGCA AGCCGGGGCG CCTGAACGCC ATCACCGACG TCGCCGGCGT GCGGGTCGGG CACCACACGG TGCACGTCGA GGCGGGCGAC GCGTCGGCGC ACACCGGCGT GACGGTGATC GAGCCGCGCG CCGCGCGCGC GCGCGACGAG CCGTGCTTCG CGGGCGTTCA CGTGCTCAAC GGCAACGGCG ACGCGACCGG GCTCGAATGG ATTCGCGAGG CGGGGCTGCT GACGACGCCG ATCGCCTATA CGAACACGCA CAGCGTCGGC ATCGTGCGCG ATGCGCTCGT CGCCGCCGAG CGCGCGCAGG GCGGCGCGCG CGAGCGCGAG CACGTGTACT GGTGCATGCC GGTCGTGATG GAGACGTTCG ACGGACTCCT GAACGACATC TGGGGGCAGC ACGTGTGCGT CGGGCACGTC GCGCAGGCGC TCGCCGCCGC GCGTTCGGGC CCGGTCGCGG AAGGCTGCGT CGGCGGCGGC ACCGGCATGA TCTGCCACGA GTTCAAGGGC GGCATCGGCA CCGCGTCGCG CGTCGTCGCC GAAGCGGCGG GCGGCTGGAC GGTCGGCGCG CTCGTGCAGG CGAACTACGG GCAGCGCGCG GCGCTGCGCG TCGCGGGCTA CCCGGTCGGC GAAGTGCTGC GCGACGCGCA CTCGCCGTTC GACGAGGCGG GCGGGGCGGG CGAGCCCGGC ATGGGCTCGA TCGTCGTGAC GCTCGCGACC GACGCGCCGC TGCTGCCGCA TCAATGCACG CGGCTCGCGC AGCGCGCGAG CGTCGGGCTC GCGCGCGTCG GCGGCGGCAC CGACAATTCG AGCGGCGACA TTTTCGTGGC GTTCGCAACC GGCAATACCG GGCTGCCGAT CGCGAGCTAC GGCCGGCCGG GCCCGACGAC GGTCGGCGTG CGGATGGTCG CCGACGCGCA CATCTCCGCC CTGTTCGACG CGGCGGCGGA AGCGGTCGAG GAGGCGATCG TCAACGCGCT CGTCGCGGCG ACCGATCTCG CGGCACGCGG CGTGCGCGTC GAGGCGCTCG GCGCCGCGCG GCTCGTCGAT GCGTTGCGCG AGACCGGCTG GCGCCCGCGC GCGGGCGACG CTCAGCTATA G
|
Protein sequence | MRTRDLGIRI GRGKPGRLNA ITDVAGVRVG HHTVHVEAGD ASAHTGVTVI EPRAARARDE PCFAGVHVLN GNGDATGLEW IREAGLLTTP IAYTNTHSVG IVRDALVAAE RAQGGARERE HVYWCMPVVM ETFDGLLNDI WGQHVCVGHV AQALAAARSG PVAEGCVGGG TGMICHEFKG GIGTASRVVA EAAGGWTVGA LVQANYGQRA ALRVAGYPVG EVLRDAHSPF DEAGGAGEPG MGSIVVTLAT DAPLLPHQCT RLAQRASVGL ARVGGGTDNS SGDIFVAFAT GNTGLPIASY GRPGPTTVGV RMVADAHISA LFDAAAEAVE EAIVNALVAA TDLAARGVRV EALGAARLVD ALRETGWRPR AGDAQL
|
| |