Gene BURPS1106A_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1900 
Symbol 
ID4902033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1853216 
End bp1854631 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content74% 
IMG OID640135130 
ProductM24/M37 family peptidase 
Protein accessionYP_001066165 
Protein GI126451678 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0520216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCCAAGC CAATCGCGTA CCCCGTTTCC GTTTCTTCCG CCCGCCGCGC CGAGCCGGCG 
CCCGCCGCCG AAGCGCCGGC CCGCGGCGGC GCCCCTCATT CGTTCACGCG CAGGCTGCCC
GCGGCGAGCG CGCTCGGCGC GCTGATCGCC TGCGGTCTCG CCGCGCTGCC GGCCGTGTCG
AAGTCGCCCG ACGCGTCGAG CGCGAATGCC GCGCGCGCCC CGCAGCGCGT GTTCGCCGAC
GCGCCGTTTC CGAGCGCGCA GCGCTTCGTC ACGGGCGAGC TCGAGCGCAT GCTCGCCGAG
CAGAACCCGC CCGCCGCGCC GTTGCGCACG CTCGGCGCGC ACGCCGAGGG CGCGACGCTC
GGCGACGCGC CCTCGGACAT CGCGCAGCAC GGCCCGGCGC GGCCGGCGCC CGCACGCCCC
GCCGCGCCGC CGCCGGGGCT TTTCGCGCCG CTCGCCGAGC ACCGCAACGC ACTGCTCGGC
TACGACGGCC GCGCGTTCTC GCTCGCCGAT TCGGTGCTCG CGCTCGTCGA TAGCGGCGTG
CGCGCCGGGC CGATCGACGA CACGCTCGCC GACACGCTGA ACCGGCTCGA CATCCCGCCC
GAAGTGCGCA TCCAGATCGG CGACCTGATC GCCGAGCGCG TGCGAGCGCA TGCGCACGCG
CAGCAAGGCG ACCGCTACCG GATCGCGTTC GACGCCGCGT CGGGCAAGCC GCGCGTGACC
GCGCTCGAGC TGCGCGTCGC GGGCCGCCGG TTCGGCGCGA TCTGGTTCAA GCCGCCGGGC
GCGTCGAGCG GCGCGTACTA CGCGTTCGAC GGCGCGCCGC TCGACGCGCC GGCGCTCGCG
ATGCCCGTCG TCAGCACGCG CATCAGCTCG TACTTCGGCG AGCGCGTGCA TCCGCTGTCG
CACATCCTGC AGATGCATAC GGGCGTCGAT CTCGCCGCGC CCACCGGCAC GCGCGTGAAC
GCGGCGGCGG CGGGCGTCGT GTCGTTCGTC GGCTACGATC CGGGCGGCTA CGGCAAGTAT
GTCGTCATCG ACCATCCGGA CCGCTCGTCG ACCTACTACG CGCATCTGTC GGCGTTCGCG
CCGAAGCTCG AGGTCGGGAT GGCGGTCGCG CAGGGCCAGC GGATCGGCGC GGTCGGCTCG
ACGGGCGCGG CGACGGGCCC GCATCTGCAT TTCGAGGTGC GCGTCGACGA TCAGCCGGTG
GATCCGCTCG TCGCGCTCGC GAACGCGCAG AACACGCTGT CGGCGATGCA GCTCGACGCG
TTCCGGCGCG CCGCGAGCGA GGCGCGCTTC CGGCTCGCGT CGGGCGCCAC GCCGCCGCTC
GGCTTCGCGC AGATCAACGC GCCGCTGTGG GCCGAGTTCG CCACCGATAC GTCGACGCTG
CGCGCGATCT TCAACACGCA TTACGCGACG TCGTGA
 
Protein sequence
MAKPIAYPVS VSSARRAEPA PAAEAPARGG APHSFTRRLP AASALGALIA CGLAALPAVS 
KSPDASSANA ARAPQRVFAD APFPSAQRFV TGELERMLAE QNPPAAPLRT LGAHAEGATL
GDAPSDIAQH GPARPAPARP AAPPPGLFAP LAEHRNALLG YDGRAFSLAD SVLALVDSGV
RAGPIDDTLA DTLNRLDIPP EVRIQIGDLI AERVRAHAHA QQGDRYRIAF DAASGKPRVT
ALELRVAGRR FGAIWFKPPG ASSGAYYAFD GAPLDAPALA MPVVSTRISS YFGERVHPLS
HILQMHTGVD LAAPTGTRVN AAAAGVVSFV GYDPGGYGKY VVIDHPDRSS TYYAHLSAFA
PKLEVGMAVA QGQRIGAVGS TGAATGPHLH FEVRVDDQPV DPLVALANAQ NTLSAMQLDA
FRRAASEARF RLASGATPPL GFAQINAPLW AEFATDTSTL RAIFNTHYAT S