Gene BURPS1106A_A1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1226 
Symbol 
ID4906193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1162711 
End bp1163916 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content72% 
IMG OID640144332 
ProductSer/Thr protein phosphatase family protein 
Protein accessionYP_001075261 
Protein GI126456022 
COG category[R] General function prediction only 
COG ID[COG1408] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.795007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGCG TTTCATCGTT TCTGCTGCGC CTGACGATCA TCGGCGTGCT GCTGCACGTG 
TACGTCGGCT TCCGTCTGCT GCCCGAGCTC GCCTCGCCCG CCGCGCGCTA CGCGGGCGCG
CTGTGGCTCG TCGGGTCGTG CCTGCTGATT CCGCTCGGCA TGCTGTCGCG CGTGTTCGAG
CGGCAGCCGC TCGGCGATCG CGTCGCCTGG GCCGGCCTCC TCGCGATGGG CTTCTTCTCG
TCGCTGCTCG TGCTCACGCT CGCGCGCGAC GTGCTGCTCG CCTCGCTCGT CACCGTCGAC
GCGCTCGCGC CCGGCGCGGT GTCGCTCGCG CAGTGGCGGA TACAGACGGC GGCCGGCGTG
CCGCTCGCGG CGCTCGCGGT GAGCGTCGTC GGCTTCGTCA ATGCGCGACG CCGCGCACGC
GTCGTCGACG TCGCGGTGCC GATCGACGAT CTGCCCGCCG CGCTCGACGG CTTCACGATC
GTGCAGATCA GCGACATCCA TGTCGGCCCG ACGATCAAGC GCGGCTACGT CGAGGCGATC
GTCGACGCGG TCAACCGGCT CGCGCCGGAT CTCGTCGCGG TGACGGGTGA CGTCGTCGAC
GGCACGGTCG CGCAACTGGC CGGCCATGCG GCGCCGCTCG GGCGGCTGCG CGCGCGCCAC
GGCGCATTCG TCGTGACGGG CAACCACGAG TACTATTCGG GCGCCGACGA GTGGATCGCC
GAGTTCCGCC GCCTCGGCCT CGACGTGCTG CTCAACGAGC ATCGGACGCT CGACCACGGC
GACGGCCGGC TCGTGATCGC GGGCGTCACC GATTACTCGG CGGGCCACTT CGATCCCGCG
CATCGGAGCG ACCCGAGCGC GGCGCTCGCC GGCGCGCCCG CCGACGTGCG CATCCGCGTG
CTGCTCGCGC ACCAGCCGCG CAGCGCAACC GCCGCGGCCG ATGCGGGCTT CACGCTGCAA
CTGTCCGGGC ACACGCACGG CGGCCAGTTT TTCCCGTGGA ATTTCTTCGT GCGATTGCAG
CAGCCGTTCA CCGCCGGGCT CGCGCGACTC GACGGCCTGT GGGTCTATAC GAGCCGCGGC
ACCGGTTACT GGGGGCCGCC GAAACGGCTC GGCGCGCCGT CGGAAATCAC GCGCGTGCGG
CTCGTGCGCG GCGAAGGGAA CCGAACGCGC GCGCCGGCGT CCGTCACGCT GAACGCTGAA
CGCTGA
 
Protein sequence
MRRVSSFLLR LTIIGVLLHV YVGFRLLPEL ASPAARYAGA LWLVGSCLLI PLGMLSRVFE 
RQPLGDRVAW AGLLAMGFFS SLLVLTLARD VLLASLVTVD ALAPGAVSLA QWRIQTAAGV
PLAALAVSVV GFVNARRRAR VVDVAVPIDD LPAALDGFTI VQISDIHVGP TIKRGYVEAI
VDAVNRLAPD LVAVTGDVVD GTVAQLAGHA APLGRLRARH GAFVVTGNHE YYSGADEWIA
EFRRLGLDVL LNEHRTLDHG DGRLVIAGVT DYSAGHFDPA HRSDPSAALA GAPADVRIRV
LLAHQPRSAT AAADAGFTLQ LSGHTHGGQF FPWNFFVRLQ QPFTAGLARL DGLWVYTSRG
TGYWGPPKRL GAPSEITRVR LVRGEGNRTR APASVTLNAE R