Gene BURPS668_A1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1108 
Symbol 
ID4887501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1063073 
End bp1064203 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content75% 
IMG OID640131047 
Productputative D-aminopeptidase 
Protein accessionYP_001062106 
Protein GI126444290 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACAC GAGATCTGGG CATTCGCATC GGCCGCGGCA AGCCGGGGCG CCTGAACGCC 
ATCACCGACG TCGCCGGCGT GCGGGTCGGG CACCACACGG TGCACGTCGA GGCGGGCGAC
GCGTCGGCGC ACACCGGCGT GACGGTGATC GAGCCGCGCG CCGCGCGCGC GCGCGACGAG
CCGTGCTTCG CGGGCGTTCA CGTGCTCAAC GGCAACGGCG ACGCGACCGG GCTCGAATGG
ATTCGCGAGG CGGGGCTGCT GACGACGCCG ATCGCCTATA CGAACACGCA CAGCGTCGGC
ATCGTGCGCG ATGCGCTCGT CGCCGCCGAG CGCGCGCAGG GCGGCGCGCG CGAGCGCGAG
CACGTGTACT GGTGCATGCC GGTCGTGATG GAGACGTTCG ACGGACTCCT GAACGACATC
TGGGGGCAGC ACGTGTGCGT CGGGCACGTC GCGCAGGCGC TCGCCGCCGC GCGTTCGGGC
CCGGTCGCGG AAGGCTGCGT CGGCGGCGGC ACCGGCATGA TCTGCCACGA GTTCAAGGGC
GGCATCGGCA CCGCGTCGCG CGTCGTCGCC GAAGCGGCGG GCGGCTGGAC GGTCGGCGCG
CTCGTGCAGG CGAACTACGG GCAGCGCGCG GCGCTGCGCG TCGCGGGCTA CCCGGTCGGC
GAAGTGCTGC GCGACGCGCA CTCGCCGTTC GACGAGGCGG GCGGGGCGGG CGAGCCCGGC
ATGGGCTCGA TCGTCGTGAC GCTCGCGACC GACGCGCCGC TGCTGCCGCA TCAATGCACG
CGGCTCGCGC AGCGCGCGAG CGTCGGGCTC GCGCGCGTCG GCGGCGGCAC CGACAATTCG
AGCGGCGACA TTTTCGTGGC GTTCGCAACC GGCAATACCG GGCTGCCGAT CGCGAGCTAC
GGCCGGCCGG GCCCGACGAC GGTCGGCGTG CGGATGGTCG CCGACGCGCA CATCTCCGCC
CTGTTCGACG CGGCGGCGGA AGCGGTCGAG GAGGCGATCG TCAACGCGCT CGTCGCGGCG
ACCGATCTCG CGGCGCGCGG CGTGCGCGTC GAGGCGCTCG GCGCCGCGCG GCTCGTCGAT
GCGTTGCGCG AGACCGGCTG GCGCCCGCGC GCGGGCGACG CTCAGCTATA G
 
Protein sequence
MRTRDLGIRI GRGKPGRLNA ITDVAGVRVG HHTVHVEAGD ASAHTGVTVI EPRAARARDE 
PCFAGVHVLN GNGDATGLEW IREAGLLTTP IAYTNTHSVG IVRDALVAAE RAQGGARERE
HVYWCMPVVM ETFDGLLNDI WGQHVCVGHV AQALAAARSG PVAEGCVGGG TGMICHEFKG
GIGTASRVVA EAAGGWTVGA LVQANYGQRA ALRVAGYPVG EVLRDAHSPF DEAGGAGEPG
MGSIVVTLAT DAPLLPHQCT RLAQRASVGL ARVGGGTDNS SGDIFVAFAT GNTGLPIASY
GRPGPTTVGV RMVADAHISA LFDAAAEAVE EAIVNALVAA TDLAARGVRV EALGAARLVD
ALRETGWRPR AGDAQL