Gene BURPS668_A2451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2451 
SymbolleuB 
ID4887401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2370223 
End bp2371290 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID640132388 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001063445 
Protein GI126443320 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.172832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTG CAGTGCTGCC CGGCGACGGC ATCGGTCCGG AAATCGTCAA TGAAGCGGTG 
AAGGTGCTGA ACGCGCTCGA CGAAAAGTTC GAACTGGAGC AGGCGCCGGT CGGCGGCGCC
GGCTACGAGG CAAGCGGCCA TCCGTTGCCC GACGCGACGC TCGCGCTCGC GAAGGAAGCG
GACGCGATCC TGTTCGGCGC GGTCGGCGAC TGGAAGTACG ATTCGCTCGA GCGCGCGCTG
CGCCCCGAGC AGGCGATCCT CGGCCTGCGC AAGCATCTGG AGCTGTTCGC GAACTTCCGT
CCGGCGATCT GCTATCCGCA GCTCGTCGAC GCTTCGCCGC TCAAGCCCGA GCTCGTCGCG
GGCCTCGACA TCCTGATCGT GCGCGAACTG AACGGCGATA TCTACTTCGG CCAGCCGCGC
GGCGTGCGCG CCGCGCCGGA CGGCCCGTTC GCGGGCGCGC GCGAAGGCTT CGACACGATG
CGCTATTCGG AGCCGGAAGT GCGCCGCATC GCGCACGTCG CGTTCCAGGC CGCGCGAAAG
CGCGCGAAGA AGCTGCTGTC GGTCGACAAA TCGAACGTGC TCGAGACGTC GCAGTTCTGG
CGCGACGTGA TGATCGACGT GTCGAAGGAA TACGCGGACG TCGAGCTGTC GCACATGTAC
GTCGACAACG CGGCGATGCA GCTCGCGAAG GCGCCGAAGC AGTTCGACGT GATCGTGACG
GGCAACATGT TCGGCGACAT TTTGTCCGAC GAGGCGTCGA TGCTGACGGG CTCGATCGGC
ATGCTGCCGT CCGCGTCGCT CGACCAGCGC AACAAGGGCC TGTACGAGCC GTCGCACGGC
TCCGCGCCGG ACATCGCGGG CAAGGGCATC GCGAATCCGC TCGCGACGAT CCTGTCGGCC
GCGATGCTGC TGCGCTACTC GCTGAACCGC GCGGAGCAGG CCGACCGCAT CGAGCGCGCG
GTCAAGGCGG TGCTCGAGCA GGGCTACCGC ACGGGCGACA TCGCGACGCC GGGCTGCAAG
CAGGTGGGCA CGGCCGCGAT GGGCGACGCG GTGGTCGCGG CGCTGTAA
 
Protein sequence
MKIAVLPGDG IGPEIVNEAV KVLNALDEKF ELEQAPVGGA GYEASGHPLP DATLALAKEA 
DAILFGAVGD WKYDSLERAL RPEQAILGLR KHLELFANFR PAICYPQLVD ASPLKPELVA
GLDILIVREL NGDIYFGQPR GVRAAPDGPF AGAREGFDTM RYSEPEVRRI AHVAFQAARK
RAKKLLSVDK SNVLETSQFW RDVMIDVSKE YADVELSHMY VDNAAMQLAK APKQFDVIVT
GNMFGDILSD EASMLTGSIG MLPSASLDQR NKGLYEPSHG SAPDIAGKGI ANPLATILSA
AMLLRYSLNR AEQADRIERA VKAVLEQGYR TGDIATPGCK QVGTAAMGDA VVAAL