Gene BURPS668_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1911 
Symbol 
ID4884010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1872733 
End bp1873824 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content69% 
IMG OID640127839 
Productzinc-binding dehydrogenase family oxidoreductase 
Protein accessionYP_001058946 
Protein GI126439608 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.197112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATC GAAACGACGA AACGCAGCAG ATGACGGCGA TCGTCTGCCA CGCCCCCGAG 
GACTACCGCG TCGAGCGCGT CGCGAAGCCG CGCGCGAACG CGCGCGAGCT CGTGATCCGC
ATCGGCGCGT GCGGCATCTG CGCGAGCGAC TGCAAGTGCC ACGCCGGCGC GAAGATGTTC
TGGGGCGGGC CGAGCCCGTG GGTCAAGGCA CCCGTGATTC CCGGCCACGA GTTCTTCGGC
TACGTCGAGG CGCTGGGCGA GGGCGCGGCC GAGCACTTCG GCGTCGCGCT CGGCGATCGC
GTGATCGCCG AGCAGATCGT GCCGTGCGGC ACGTGCCGCT ATTGCAAGTC GGGCCAGTAC
TGGATGTGCG AGGTCCATCA CATCTTCGGC TTTCAGCGCG AGGTCGCCGA CGGCGGGATG
GCCGAGTACA TGCGCATACC GTCGGGCGCG ATCGTCCACC CGGTCCCGCT CGGCATCTCG
CTCGAGGACG CGGCGATCAT CGAGCCGCTC GCGTGCGCGA TCCACACGGT CAATCGCGGC
GACATCCAGC TCGACGACGT CGTCGTGATC GCGGGCGCGG GCCCGCTCGG CCTGATGATG
ACGCAGGTCG CGAAGCTGAA GACGCCCAGG CGGCTCGTCG TCGTCGATCC CGTCGAAGCG
CGGCGCGCGC TCGCGCGCGC ATACGGCGCC GACGTGACGA TCGATCCGGC CCGCGAGGAC
GCGCCCGCGA TCGTGCGCGC GCTGACGGGG GGCTACGGCT GCGACGTCTA CATCGAGACG
ACCGGCGTGC CGGCGGGCGT CACGCAGGGC ATGGCGCTGA TCCGCAAGCT CGGCCGCTTC
GTCGAGTTCT CGGTGTTCGG CAAGGATACG GCGCTCGACT GGTCGATCAT CGGCGATCGC
AAGGAGCTCG ATGTGCGCGG CGCGCATCTC GGCCCGTATT GCTATCCTGT CGCGATCGAT
CTGCTCGCGC GCGGGCTCGT CACGTCGAAG GGCATCGTCA CGCACGGCTT CTCGCTCGAC
GAATGGGACG AGGCGATCCG GGTCGCGAAC TCGCTCGACT CGATCAAGGT GCTGATGAAG
CCGCGCGGCT GA
 
Protein sequence
MTNRNDETQQ MTAIVCHAPE DYRVERVAKP RANARELVIR IGACGICASD CKCHAGAKMF 
WGGPSPWVKA PVIPGHEFFG YVEALGEGAA EHFGVALGDR VIAEQIVPCG TCRYCKSGQY
WMCEVHHIFG FQREVADGGM AEYMRIPSGA IVHPVPLGIS LEDAAIIEPL ACAIHTVNRG
DIQLDDVVVI AGAGPLGLMM TQVAKLKTPR RLVVVDPVEA RRALARAYGA DVTIDPARED
APAIVRALTG GYGCDVYIET TGVPAGVTQG MALIRKLGRF VEFSVFGKDT ALDWSIIGDR
KELDVRGAHL GPYCYPVAID LLARGLVTSK GIVTHGFSLD EWDEAIRVAN SLDSIKVLMK
PRG