Gene BMA10247_A1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A1842 
Symbol 
ID4889137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp1774711 
End bp1775841 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content74% 
IMG OID640148108 
Productputative D-aminopeptidase 
Protein accessionYP_001079025 
Protein GI126447514 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACAC GAGATCTGGG CATTCGCATC GGCCGCGGCA AGCCGGGGCG CCTGAACGCC 
ATCACCGACG TCGCCGGCGT GCGGGTCGGG CACCACACGG TGCACGTCGA GGCGGGCGAC
GCGTCGGCGC ACACCGGCGT GACGGTGATC GAGCCGCGCG CCGCGCGCGC GCGCGACGAG
CCGTGCTTCG CGGGCGTTCA CGTGCTCAAC GGCAACGGCG ACGCGACCGG GCTCGAATGG
ATTCGCGAGG CGGGGCTGCT GACGACGCCG ATCGCCTATA CGAACACGCA CAGCGTCGGC
ATCGTGCGCG ATGCGCTCGT CGCCGCCGAG CGCGCGCAGG GCGGCGCGCG CGAGCGCGAG
CACGTGTACT GGTGCATGCC GGTCGTGATG GAGACGTTCG ACGGACTCCT GAACGACATC
TGGGGGCAGC ACGTGTGCGT CGGGCACGTC GCGCAGGCGC TCGCCGCCGC GCGTTCGGAC
CCGGTCGCGG AAGGCTGCGT CGGCGGCGGC ACCGGCATGA TCTGCCACGA GTTCAAGGGC
GGCATCGGCA CCGCGTCGCG CGTCGTCGCC GAAGCGGCGG GCGGCTGGAC GGTCGGCGCG
CTCGTGCAGG CGAACTACGG GCAGCGCGCG GCGCTGCGCG TCGCGGGCTA CCCGGTCGGC
GAAGTGCTGC GCGACGCGCA CTCGCCGTTC GACGAGGCGG GCGGGGCGGG CGAGCCCGGC
ATGGGCTCGA TCGTCGTGAC GCTCGCGACC GACGCGCCGC TGCTGCCGCA TCAATGCACG
CGGCTCGCGC AGCGCGCGAG CGTCGGGCTC GCGCGCGTCG GCGGCGGCAC CGACAATTCG
AGCGGCGACA TTTTCGTGGC GTTCGCAACC GGCAATACCG GGCTGCCGAT CGTGAGCTAT
GGCCGGCCGG GCCCGACGAC GGTCGGCGTG CGGATGGTCG CCGACGCGCA CATCTCCGCC
CTGTTCGACG CGGCGGTGGA AGCGGTCGAG GAGGCGATCG TCAACGCGCT CGTCGCGGCG
ACCGATCTCG CGGCGCGCGG CGTGCGCGTC GAGGCGCTCG GCGCCGCGCG GCTCGTCGAT
GCGTTGCGCG AGACCGGCTG GCGCCCGCGC GCGGGCGACG CTCAGCTATA G
 
Protein sequence
MRTRDLGIRI GRGKPGRLNA ITDVAGVRVG HHTVHVEAGD ASAHTGVTVI EPRAARARDE 
PCFAGVHVLN GNGDATGLEW IREAGLLTTP IAYTNTHSVG IVRDALVAAE RAQGGARERE
HVYWCMPVVM ETFDGLLNDI WGQHVCVGHV AQALAAARSD PVAEGCVGGG TGMICHEFKG
GIGTASRVVA EAAGGWTVGA LVQANYGQRA ALRVAGYPVG EVLRDAHSPF DEAGGAGEPG
MGSIVVTLAT DAPLLPHQCT RLAQRASVGL ARVGGGTDNS SGDIFVAFAT GNTGLPIVSY
GRPGPTTVGV RMVADAHISA LFDAAVEAVE EAIVNALVAA TDLAARGVRV EALGAARLVD
ALRETGWRPR AGDAQL