Gene BURPS668_A2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2237 
Symbol 
ID4888164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2169246 
End bp2170598 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content72% 
IMG OID640132174 
Productputative regulatory protein HipA 
Protein accessionYP_001063231 
Protein GI126442667 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCCC GCCGCGCACG CGCGACGCGC CTGCACCTGT GGATGAACGG CCTGCCCGTC 
GGCTACTGGG AGCACGCGCG CGACGGCGAG CGCCTTGTCT ACTTCGACGA ATGGATCGGC
GATCCGCAAG GCCGGCCGCT GTCGCTGTCG CTGCCGTTCA CGCCGGGCAA CCAGCCGTAT
CGCGGTCGGC TCGTCAGCGA TTATTTCGAC AACCTGCTGC CCGACAGCGA GCCGATCCGC
CGGCGAATCG CGATGCGCTA CCGCACGGGC GGCACGTCCG CGTTCGCGCT GCTCGCGACG
CTCGGCCGCG ATTGCGTCGG CGCGCTGCAG ATGCTGCCGC CCGACGAAGC GCCGGACGAC
ATCGAACGCA TCCGCGGCCA CGCGCTCGCC GACGCGGACA TCGCGCGCCT GCTGCGCGAA
GTCACGTCCG CGCCGCAGGC CGGCCGGCAC GCGCCGCTCG ACGATCTGCG CCTGTCGATC
GCCGGCGCGC AGGAGAAGAC CGCGCTGCTG CGCCATCGCG GCCGCTGGCT GCTGCCCGAA
GGGAGCACGC CGACCACGCA CATCCTGAAG CTGCCGCTCG GGCTCGTCGG CAACCGGCGC
GCCGACATGC GCACGTCGGT CGAGAACGAA TGGCTGTGCG CGCGGATCGT CGCCGCGTAC
GGGTTGCCCG TCGCGCGCTG CGACATCGCG CAGTTCGACG ATCAGAAAGC GCTCGTCGTC
GAGCGCTTCG ACCGCCGGCC GTCGCGCGAC GCACGCTGGC TCCTGCGGCT GCCGCAGGAA
GACATGTGCC AGGCAACCGG CACGTCCGCG CTCGACAAAT ATCAGGCCGA CGGCGGCCCC
GGCATCGAGA CGATCATGGA AGTGCTCGCC GGCTCCGAGC ACGCGCGGGA CGACCGCCGC
GCGTTCTTCG CGGCGCAGAT CGTGTTCTGG CTGCTCGCCG CGACCGACGG CCACGCGAAG
AACTTCAGCA TCGCGCACCT GCCCGGCAAC CGCTACCGTT CGACACCGCT TTACGACGTG
CTGTCCGCGC ATCCGGTCAT CGGCCGGGGC GCGAACCAGT TGCCCGCGCA GCGCGCGCGG
CTCGCGATGG GCGTGCGCGG CAAGCACATC CACTATCCGC TGCACCAGAT CCGGCGGCGG
CACTGGATCG CGCAGGGCCA GCGCGTCGGC TTCGCGCCCG CCGACGTCGA CGCGCTGATC
GACACGCTGA CCGCGCGCAC CGCGGACGTC GTCGACGCGG TGTCGGCGCG GCTGCCGCGC
GATTTTCCGC GCGACGTCGC CGATGCGATC TTCAGCGGAA TGCTCGGCCT GAGCGCAAGG
CTCGCCGGCG ACGCGGCCGC GCGCGCGCCA TGA
 
Protein sequence
MSARRARATR LHLWMNGLPV GYWEHARDGE RLVYFDEWIG DPQGRPLSLS LPFTPGNQPY 
RGRLVSDYFD NLLPDSEPIR RRIAMRYRTG GTSAFALLAT LGRDCVGALQ MLPPDEAPDD
IERIRGHALA DADIARLLRE VTSAPQAGRH APLDDLRLSI AGAQEKTALL RHRGRWLLPE
GSTPTTHILK LPLGLVGNRR ADMRTSVENE WLCARIVAAY GLPVARCDIA QFDDQKALVV
ERFDRRPSRD ARWLLRLPQE DMCQATGTSA LDKYQADGGP GIETIMEVLA GSEHARDDRR
AFFAAQIVFW LLAATDGHAK NFSIAHLPGN RYRSTPLYDV LSAHPVIGRG ANQLPAQRAR
LAMGVRGKHI HYPLHQIRRR HWIAQGQRVG FAPADVDALI DTLTARTADV VDAVSARLPR
DFPRDVADAI FSGMLGLSAR LAGDAAARAP