Gene BURPS1106A_0827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0827 
Symbol 
ID4899467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp810246 
End bp812015 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content73% 
IMG OID640134057 
Producthypothetical protein 
Protein accessionYP_001065108 
Protein GI126454072 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAG CTCCATCGAA CCATCACGCG GTGGCCGCCA GATCCTTGTC CATGCGCGTC 
AAATCGTCGT TTGCCGTCCT GCTGTGCGCG GCGCTCGCCT TGCCGCCCGG CGGCCACGCG
CAGTCGCGCG GCGATGCGCC GCCGCTCGAA TCCGCGCGCG CCGCCGGCGC CGAGGACGCC
GCGGCGCGCG CGCGCGATGC GCTGTCCACG GTGCCGTCCG GCATCGCGCC CGGCGTGTTC
GGCATGTACG GCGGCGCGCA GAGCCGGCTT GCCGATCCGG CGTCGGGCAC GCCCAGTTTG
CGCGCGCCGC TTCGCTCGTT GCAACTGCCC GATCTCGGCG ACGGCTCGGG CGGCTCGCTG
ACGCCGCAAG CGGAGCGCCG GCTCGGCGAG CGCGTGATGC GCGAGGTGCG GCGCGATCCC
GACTATCTCG ACGACTGGCT CGTGCGCGAC TACCTGAATT CCGTCGCGGC GAAGCTCTCC
GCGGCCGCCG CCGCGCAGTT CATCGGCGGC TACATGCCCG ATTTCGAGCT GTTCGCGATG
CGCGATCCGC AGATCAACGC GTTCTCGCTG CCGGGCGGTT TCATCGGCAT CAACAGCGGG
CTCGTCGCGG CGACGCAGAC GGAGTCCGAA CTCGCGTCGG TGATTGGCCA CGAGATGGGG
CATGTGCTGC AGCGGCACAT CGCGCGGATG ATCGGCGCGA GCGAGAAGAG CGGCTATGCG
GCGCTCGCGA CGATGCTGTT CGGCGTGCTC GCGGGCATTC TCGCGCGCAG CGGCGATCTC
GGCAGCGCGA TCGCGATGGG CGGCCAGGCG TTCGCGGTCG ACAGCCAGCT CAGGTTCTCG
CGCTCGGCCG AGCGCGAGGC GGACCGCGTC GGCTTCCAGT TGCTCGCGGG CGCCGGCTAC
GATCCGTACG GCATGCCGGG CTTCTTCGAG CGGCTCGAGC GTGCGTCGGT GGGCGACGCG
GGCGTGCCCG CGTACGCGCG CACGCACCCG CTGACGGGCG AGCGGATCGC CGACATGGAC
GACCGCGCGC GGCGCGCGCC GTACCGGCAG CCGCGGCAAT CGGCGGAATA CGGTTTCGTG
CGCGCGCGCC TGCGGATGCT GCAGAACCGC GCGCCGACCG ATTACGCGAA CGAGGCAAGA
CGAATGCGCG CGGAGCTCGA CGATCGCGTC GCGCCGAATG TCGCGGCGAA CTGGTATGGG
ATCGCGCTCG GCGAGATGCT GGGCGGCCGC TACGATGACG CGGACCGCGC GCTCGCCGCA
GCGCGCGATG CGTTCGCGCG CACGGCCGCG CGCGAGGGCG AGGCGGCGCG CACTTCGCCG
AGCCTCGACG TGCTCGCCGC GGAGATCGCG CGTCGCGCCG GCCGCGGGGA CGACGCGGTG
CGGCTCGCCG CCGCCGCGCA GGCGCGCTGG CCGGGTTCGC ACGCGGCTAT CGCCGCGCAT
TTGCAGGCGC TTCTCGCCGC GCGGCGTTAC GGGCAGGCGC AGGCGCTCGC ACAAGCGGAG
GCGAACGCGG CCCCCCGCCA GCCCGATTGG TGGAACTATC TCGCGCAGGC GAGCCTCGGC
CGGGGCGATG CGCTCACGCA GCGCCGCGCG CTCGCGGAGA AGTTCGCGCT CGAAGGCGCG
TGGCCGTCGG CGATCCGGCA ACTGCGCGAG GCGCGCGATC TCAAGTCGGC CGGTTTCTAC
GAGCAATCGA TCATCAGCGC GCGGCTGCAC GAATTCGAGG CACGCTACAA GGAAGAGCGG
GAAGAGGACA AGGACGATCG GCGCGGTTGA
 
Protein sequence
MTQAPSNHHA VAARSLSMRV KSSFAVLLCA ALALPPGGHA QSRGDAPPLE SARAAGAEDA 
AARARDALST VPSGIAPGVF GMYGGAQSRL ADPASGTPSL RAPLRSLQLP DLGDGSGGSL
TPQAERRLGE RVMREVRRDP DYLDDWLVRD YLNSVAAKLS AAAAAQFIGG YMPDFELFAM
RDPQINAFSL PGGFIGINSG LVAATQTESE LASVIGHEMG HVLQRHIARM IGASEKSGYA
ALATMLFGVL AGILARSGDL GSAIAMGGQA FAVDSQLRFS RSAEREADRV GFQLLAGAGY
DPYGMPGFFE RLERASVGDA GVPAYARTHP LTGERIADMD DRARRAPYRQ PRQSAEYGFV
RARLRMLQNR APTDYANEAR RMRAELDDRV APNVAANWYG IALGEMLGGR YDDADRALAA
ARDAFARTAA REGEAARTSP SLDVLAAEIA RRAGRGDDAV RLAAAAQARW PGSHAAIAAH
LQALLAARRY GQAQALAQAE ANAAPRQPDW WNYLAQASLG RGDALTQRRA LAEKFALEGA
WPSAIRQLRE ARDLKSAGFY EQSIISARLH EFEARYKEER EEDKDDRRG