Gene BURPS1710b_3598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3598 
Symbol 
ID3689036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3921166 
End bp3922980 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content72% 
IMG OID637730053 
Productpeptidase, M24 family protein 
Protein accessionYP_334963 
Protein GI76811283 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.979645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCC GACTTCCCGA TCCGTCGCCC GTGCCGGCGC GTCTTGCCCT GTTGCGCGGC 
GCGATGACGC GCGAGGATCT GGCCGCCTAC GTGGTGCCGT CCGCCGATCC CCATTTGTCC
GAGTATTTGC CCGAGCGCTG GCAGGCGCGC CAATGGCTGT CGGGCTTCAC CGGCTCGGTC
GGCACGCTCG TCGTGACCGC CGATTTCGCC GGCCTCTGGG TCGACAGCCG CTATTGGATG
CAGGCCGAGG CGCAACTCGC GGGCACGGGC GTCGCGTTGA TGAAGATGGT GGGCGGCCAG
CAGACGCAGC CGCACGTCGA ATGGCTCGCC GAGCACGTGC CCGAGGGCAC GACGGTCGGC
GTGGACGGCG CGGTGCTCGG CGTCGCGGCG GCGCGCGCGC TCACGTCGGC GCTCACCCCG
CGCGGCATCG TGCTGCGCAC CGATCTCGAT CTGCTCGATG CGATCTGGCC GCAGCGCCCG
TCGCTGCCGG GCGACGCGGT GTTCGAGCAC GCGGCGCCGC AGGCCGACAC CGCGCGCGCG
GGCAAGCTCG CGCAGGTGCG CCGCGCGATG CACGAGCAAG GCGCGCAGTG GCACTTCGTG
TCGACGCTCG ACGATCTCGC GTGGCTCTTC AACCTGCGCG GCGCCGACGT CAACTACAAC
CCGGTGTTCG TCGCGCACGC GCTCGTCGGC CTCGAGCGCG CGACGCTGTT CGTCGCCGAC
GGCAAGGTGT CGGCCGAGCT GGCGACGTCG CTCGCGCGGG ACGGCGTCGA CGTGAAGCCG
TACGACGCCG CGGCCGCCGC GCTCGCCGCG CTGCCCGAGG GCGCGGGGCT GCTGATCGAT
CCGCGTCGCG TCACGTACGG GCTGCTGCAG GCGGTGCCGC AGCAGGTGCG CGTGATCGAG
GCGGTGAATC CGTCGACGTT CGCGAAATCG CGCAAGACGC CCGCCGAGAT CGAGCACGTG
CGCGCGACGA TGGAGCACGA CGGCGCGGCG CTCGCCGAAT TCTTCGCATG GTTCGAGCGC
GCGCTCGGCC GCGAGACGAT CACCGAGCTG ACCATCGACG AGCAGCTCAC GGCCGCGCGC
GCGCGACGGC CGGGCTATGT GTCGCCGAGC TTCGCGACGA TCGCGGGCTT CAACGCGAAC
GGCGCGATGC CGCATTACCG CGCGACGCGC GCCGCGCACG CGACGATCGA AGGCGACGGC
CTGCTGCTCG TCGATTCGGG CGGCCAGTAT CTGAGCGGGA CGACGGACAT CACGCGGGTC
GTGCCGGTCG GCGCGATCGG CGACGCGCAC CGGCGCGACT TCACGATCGT GCTGAAGGCG
ATGATGGCGC TGTCGCGTGC GCGCTTTCCG CGCGGCATCC GCTCGCCGAT GCTCGACGCG
ATCGCGCGCG CGCCGATGTG GGCGGCCGGG CTCGACTACG GGCACGGCAC GGGGCACGGC
GTCGGCTATT TCCTGAACGT ACACGAAGGG CCGCAGGTGA TCTCGCACTA CGCGCCCGCC
GAGCCGTACA CGGCGATGGA GGAGGGGATG ATCACGTCGA TCGAGCCCGG CGTGTACCGG
CCCGGCAACT GGGGCGTGCG CATCGAGAAT CTCGTCGTGA ACCGCGCGGC GGGCCAGACC
GAGTTCGGCG ATTTCCTCGA ATTCGAGACG CTCACGCTCT GCCCGATCGA TACGCGCTGC
GTGCTGCCCG CGCTCCTCGA CGACGTCGAG CGCGCGTGGC TGAACGCGTA TCACGCGACG
GTGCGCGAGC GGGTCGGCAA GCACGTGTCG GGCGACGCGA GGGCGTGGCT CGACGCGCGC
ACGCAACCGA TCTGA
 
Protein sequence
MNARLPDPSP VPARLALLRG AMTREDLAAY VVPSADPHLS EYLPERWQAR QWLSGFTGSV 
GTLVVTADFA GLWVDSRYWM QAEAQLAGTG VALMKMVGGQ QTQPHVEWLA EHVPEGTTVG
VDGAVLGVAA ARALTSALTP RGIVLRTDLD LLDAIWPQRP SLPGDAVFEH AAPQADTARA
GKLAQVRRAM HEQGAQWHFV STLDDLAWLF NLRGADVNYN PVFVAHALVG LERATLFVAD
GKVSAELATS LARDGVDVKP YDAAAAALAA LPEGAGLLID PRRVTYGLLQ AVPQQVRVIE
AVNPSTFAKS RKTPAEIEHV RATMEHDGAA LAEFFAWFER ALGRETITEL TIDEQLTAAR
ARRPGYVSPS FATIAGFNAN GAMPHYRATR AAHATIEGDG LLLVDSGGQY LSGTTDITRV
VPVGAIGDAH RRDFTIVLKA MMALSRARFP RGIRSPMLDA IARAPMWAAG LDYGHGTGHG
VGYFLNVHEG PQVISHYAPA EPYTAMEEGM ITSIEPGVYR PGNWGVRIEN LVVNRAAGQT
EFGDFLEFET LTLCPIDTRC VLPALLDDVE RAWLNAYHAT VRERVGKHVS GDARAWLDAR
TQPI