Gene BURPS1106A_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2151 
SymboltkrA 
ID4899438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2135624 
End bp2136601 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content73% 
IMG OID640135381 
Productgluconate 2-dehydrogenase 
Protein accessionYP_001066416 
Protein GI126455132 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1052] Lactate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00341832 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCATC GCATCGTCGT CTACAAGCCG CTCCCCGACG ACGTGCTCGC GGCCTTGCGC 
GCGCGCGCCG ACGTCGTGCT CGCCGAGGGC GCCGACGCGC TCGCGCGCGC GCTGCCCGAC
GCCGACGGCG CGCTCGGCGC GAGCCTGCGG ATCACGCCCG AGCTGCTTGA TCGCGCACCG
CGGCTGCGCG CGTGGTCGAC GATCTCGGTC GGCTTCGACA ACTTCGACGT CGCCGATCTG
ACGCGCCGCG GGATCGTGCT CGCGCACACG CCCGACGTGC TCACCGAGGC GACCGCCGAC
ACCGTGTTCG CGCTGATCCT CGCGAGCGCG CGGCGCGTCG TGGAGCTCGC CGAATACGTG
AAGGCGGGGC AGTGGCGCCA GAGCATCGGC GAGGCGCTGT ACGGCACCGA CGTGAACGGC
AAGACGCTCG GCATCGTCGG GCTCGGGCGC ATCGGCACGG CGCTCGCGCG GCGCGCGGCG
CTCGGCTTCC GGATGCCGGT GCTCTACACG AGCCGCAGCG CGCATCCGCA GGCCGAGGCG
CAGTTCGGCG CGCGCCGCGT CGAGCTCGAC GAGCTGCTCG CCACGGCCGA TTTCGTGTGC
CTGCAGGTGC CGCTTTCGCC GCAGACGCGG CACCTGATCG GCGCGCGCGA ACTCGCGAAG
ATGAAGCGCG ACGCGATACT CGTGAACGCG TCGCGCGGGC CCGTCGTCGA CGAGGCGGCG
TTAATCGACG CGCTGCGCGC GGGAGCGATC CGTGCGGCGG GGCTCGACGT GTTCGAGCAC
GAGCCGCTCG CCGCGGATTC GCCGTTGCTG TCGATGCGCA ACGTCGTCGC GCTGCCGCAC
ATCGGCTCGG CGACGCGCGA GACGCGCCAC GCGATGGCGC GCTGCGCGGC CGAGAACGTG
ATCGCGGCGC TCGACGGCAC GCTCGCGCGC AATATCGTCA ATCGCGACGT GCTGCAGCGC
ACGCCGTCGA CGCCGTGA
 
Protein sequence
MKHRIVVYKP LPDDVLAALR ARADVVLAEG ADALARALPD ADGALGASLR ITPELLDRAP 
RLRAWSTISV GFDNFDVADL TRRGIVLAHT PDVLTEATAD TVFALILASA RRVVELAEYV
KAGQWRQSIG EALYGTDVNG KTLGIVGLGR IGTALARRAA LGFRMPVLYT SRSAHPQAEA
QFGARRVELD ELLATADFVC LQVPLSPQTR HLIGARELAK MKRDAILVNA SRGPVVDEAA
LIDALRAGAI RAAGLDVFEH EPLAADSPLL SMRNVVALPH IGSATRETRH AMARCAAENV
IAALDGTLAR NIVNRDVLQR TPSTP