Gene BURPS668_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2844 
Symbol 
ID4883637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2804222 
End bp2805592 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content69% 
IMG OID640128772 
ProductTldD/PmbA family protein 
Protein accessionYP_001059863 
Protein GI126439239 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCAA ACCTCGACGC CCAGGCGCAC TATTTTCCGC ACACGCAAGA CGAACTGAAA 
GCGATCGCGA CGGACATCCT CCGGCACGCG AAGGCGCTCG GCGCGACCGA CGCCGCCACC
GAAATCTCCG AGGGCGACGG CCTGTCGGTG TCGGTGCGGC GCGGCGAGGT CGAGACGATC
GAGCACAACC GCGACAAGAC GGTCGGCGTG ACCGTCTTCA TCGGCAAGAA GCGGGGCAAC
GCGAGCACGT CGGATTTCTC GCCCACGGCG CTCAAGGATA CGGTCGCCGC CGCGTACAAC
ATCGCGCGCT TCACGGCCGA GGACGACGCG GCAGGGCTCG CCGAAGCCGA ACTGCTCGAA
ACCGAGCCGC GCGACCTGGA CCTTTATCAC CCGTGGCGCC TGTCGGCCGA CGAAGCGGTC
GATCTCGCGC GCCGCTCGGA GGACGCCGCT TTCGCGGTGA GCCCGCAGAT CCGCAATTCG
GAGGGCGCGA GCGTGTCCGC GCAGCATTCG CAGTTCGTGC TCGCGACGAC GCGCGGCTTT
CTCGCCGGCT ATCCGTATTC GCGCCACTAC ATCGCGTGCG CGCCGATCGC GGGCAGCGGC
CGCCACATGC AGCGCGACGA CTGGTACACG TCCAAGCGCC GCGCGGACGA GTTGGCGTCG
CCCGAATCGG TCGGCCGCTA CGCGGCGCAA CGCGCGCTCG CGCGGATGGG CGCGCGCCGC
CTCGACACGC GCAAGGTGCC GGTCCTCTTC GAGGCGCCGC TCGCGGCGGG CCTGCTCGGC
GCGTTCGTGC AGGCGGTGAG CGGCGGCGCG CTGTACCGCA AGACGTCGTT CCTCGTCGAC
AGCCTCGGCA AGGAGGTGTT CGCGCCGCAC GTGCAGATCG TCGAGGATCC GCACGTGCCG
CGCGCGATGG GCAGCGCGCC GTTCGACGAG GAGGGCGTGC GCACGCGCGC GCGCAGCGTC
GTCCGGGACG GCGTGGTCGA GGGCTATTTC CTGTCGACGT ATTCGGCGCG CAAGCTCGGC
GCGCAGACGA CCGGCAACGC GGGCGGCTCG CACAACATCG CGCTGCGCAG CGCGCTCACG
GCGCCCGGCG ACGATTTCGA CGCGATGCTC AAGAAGCTCG GCACGGGACT TTTGCTGACC
GAACTGATGG GGCAGGGCGT CAACTACGTG ACGGGCGACT ACTCGCGCGG CGCGGCGGGC
TTCTGGGTCG AGAACGGCGA GATCCAGTAT CCGGTCGAGG AGATCACGGT GGCGAGCACG
CTGCAGGAGA TGTTCCGCCA TATCGTCGCG ATCGGCGCGG ATTCGATCGT GCGCGGCACG
AAGGAAACGG GTTCGGTGCT GATCGAGCAG ATGACGATCG CGGGGCAGTA A
 
Protein sequence
MAANLDAQAH YFPHTQDELK AIATDILRHA KALGATDAAT EISEGDGLSV SVRRGEVETI 
EHNRDKTVGV TVFIGKKRGN ASTSDFSPTA LKDTVAAAYN IARFTAEDDA AGLAEAELLE
TEPRDLDLYH PWRLSADEAV DLARRSEDAA FAVSPQIRNS EGASVSAQHS QFVLATTRGF
LAGYPYSRHY IACAPIAGSG RHMQRDDWYT SKRRADELAS PESVGRYAAQ RALARMGARR
LDTRKVPVLF EAPLAAGLLG AFVQAVSGGA LYRKTSFLVD SLGKEVFAPH VQIVEDPHVP
RAMGSAPFDE EGVRTRARSV VRDGVVEGYF LSTYSARKLG AQTTGNAGGS HNIALRSALT
APGDDFDAML KKLGTGLLLT ELMGQGVNYV TGDYSRGAAG FWVENGEIQY PVEEITVAST
LQEMFRHIVA IGADSIVRGT KETGSVLIEQ MTIAGQ