Gene BURPS1106A_A2152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2152 
Symbol 
ID4904318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2108094 
End bp2109446 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content72% 
IMG OID640145257 
Productputative regulatory protein HipA 
Protein accessionYP_001076185 
Protein GI126455544 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.748793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCCC GCCGCGCACG CGCGACGCGC CTGCACCTGT GGATGAACGG CCTGCCCGTC 
GGCTACTGGG AGCACGCGCG CGACGGCGAG CGCCTTGTCT ACTTCGACGA ATGGATCGGC
GATCCGCAAG GCCGGCCGCT GTCGCTGTCG CTGCCGTTCA CGCCGGGCAA CCAGCCGTAT
CGCGGTCGGC TCGTCAGCGA TTATTTCGAC AACCTGCTGC CCGACAGCGA GCCGATCCGC
CGGCGAATCG CGATGCGCTA CCGCACGGGC GGCACGTCCG CGTTCGCGCT GCTCGCGACG
CTCGGCCGCG ATTGCGTCGG CGCGCTGCAG ATGCTGCCGC CCGACGAAGC GCCGGACGAC
ATCGAACGCA TCCGCGGCCA CGCGCTCGCC GACGCGGACA TCGCGCGCCT GCTGCGCGAA
GTCACGTCCG CGCCGCAGGC CGGCCGGCAC GCGCCGCTCG ACGATCTGCG CCTGTCGATC
GCCGGCGCGC AGGAGAAGAC CGCGCTGCTG CGCCATCGCG GCCGCTGGCT GCTGCCCGAA
GGGAGCACGC CGACCACGCA CATCCTGAAG CTGCCGCTCG GGCTCGTCGG CAACCGGCGC
GCCGACATGC GCACGTCGGT CGAGAACGAA TGGCTGTGCG CGCGGATCGT CGCCGCGTAC
GGGTTGCCCG TCGCGCGCTG CGACATCGCT CAGTTCGACG ATCAGAAAGC GCTCGTCGTC
GAGCGCTTCG ACCGCCGGCC GTCGCGCGAC GCACGCTGGC TCCTGCGGCT GCCGCAGGAA
GACATGTGCC AGGCAACCGG CACGTCCGCG CTCGACAAAT ATCAGGCCGA CGGCGGCCCC
GGCATCGAGA CGATCATGGA AGTGCTCGCC GGCTCCGAGC ACGCGCGGGA CGACCGCCGC
GCGTTCTTCG CGGCGCAGAT CGTGTTCTGG CTGCTCGCCG CGACCGACGG CCACGCGAAG
AACTTCAGCA TCGCGCACCT GCCCGGCAAC CGCTACCGTT CGACGCCGCT TTACGACGTG
CTGTCCGCGC ATCCGGTCAT CGGCCGGGGC GCGAACCAGT TGCCCGCGCA GCGCGCGCGG
CTCGCGATGG GCGTGCGCGG CAAGCACATC CACTATCCGC TGCACCAGAT CCGGCGGCGG
CACTGGATCG CGCAGGGCCA GCGCGTCGGC TTCGCGCCCG CCGACGTCGA CGCGCTGATC
GACACGCTGA CCGCGCGCAC CGCGGGCGTC GTCGACGCGG TGTCGGCGCG GCTGCCGCGC
GATTTTCCGC GCGACGTCGC CGATGCGATC TTCAGCGGAA TGCTCGGCCT GAGCGCAAGG
CTCGCCGGCG ACGCGGCCGC GCGCGCGCCA TGA
 
Protein sequence
MSARRARATR LHLWMNGLPV GYWEHARDGE RLVYFDEWIG DPQGRPLSLS LPFTPGNQPY 
RGRLVSDYFD NLLPDSEPIR RRIAMRYRTG GTSAFALLAT LGRDCVGALQ MLPPDEAPDD
IERIRGHALA DADIARLLRE VTSAPQAGRH APLDDLRLSI AGAQEKTALL RHRGRWLLPE
GSTPTTHILK LPLGLVGNRR ADMRTSVENE WLCARIVAAY GLPVARCDIA QFDDQKALVV
ERFDRRPSRD ARWLLRLPQE DMCQATGTSA LDKYQADGGP GIETIMEVLA GSEHARDDRR
AFFAAQIVFW LLAATDGHAK NFSIAHLPGN RYRSTPLYDV LSAHPVIGRG ANQLPAQRAR
LAMGVRGKHI HYPLHQIRRR HWIAQGQRVG FAPADVDALI DTLTARTAGV VDAVSARLPR
DFPRDVADAI FSGMLGLSAR LAGDAAARAP