Gene BURPS668_A2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2097 
Symbol 
ID4887514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2036147 
End bp2037178 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content70% 
IMG OID640132035 
Productputative dehydrogenase 
Protein accessionYP_001063092 
Protein GI126445481 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGCG TCGTCGTCGA CCAGCCGCAT AGCATGGGCG TGCGCGAAAT GCCCACGCCC 
GAGCCCGCCG CGGGCGAAGT GCGCGTGCGC GTGCGCTATG CGGGCATCTG CGGATCGGAT
CTGCACATCT TCCACGGCAA GAACCCGTTC GTCTCGTATC CGCGCGTCAT CGGGCACGAG
TTCGTCGGGC GAATCGAATC GGTCGGCGCG GGCGTCGACG CGTCGCGCAT CGGCGAGATC
GTCGCAGTCG ACCCGGTCAT CAGTTGCGGG CGCTGCCACG CATGCGCGAT CGGCCGGCGC
AACGTGTGCC GCAGCCTGAC CGTGCTCGGC GTGCATCGCG ACGGCGGCTT CAGCGAGTAC
GCCTGCGTGC CCGCCGCGAA CGCCCACCGG ATTGCGCCCG AGATCGCCGA CACGTGCGCT
GCGATCGTCG AGCCGTTCGC GGTCGCCGCG AACGCGACCG CGCGCACCGG CGTGCTGCCG
TCCGACGTCG CGCTGATCTA CGGCGCGGGC ACCGTCGGCC TCACGCTGCT GCAAGTGCTC
AAGCACGTCT ACGGCATTCG CGCGTTCATC GCCGATCGCC TCGACGAGCG TCTCGCGCTC
GCGCGCAAGT GCGGCGCGGC GGCCGACGAA GTCATCCACG CGGCAACGGA AACGGTGCCG
GACGCGCTCG AGCGACGCGG CGTCGACGGC GGCCCGACGC TGATCTTCGA CGCGGTGTGC
CATCCGTCGA TCCTCGAGGA GGCGGTGCGG CTCGCGGCGC CCGCCGCGCG CATCGGTGTG
CTCGGCTTCT CGTCGGAGCC GTCGTCGATC GTGCAGGCCG AGCTGACGAA GAAGGAATTG
ACGCTGTGCG CGTCGCGCCT GAACTGCGCG ATGTTCCCGC AGGTCATCGA ATGGATCGCC
GACGGGCGCG TGCATCCGGA GCACATCGTC ACGCACACGC TCGATTTTCG CGATGTCGCG
CGCGCGTTCG AGCTCGCCGA GCGCAACCCG CGCGAAAGCT GCAAGATCCT GCTGGATTTC
GCCGCGCATT GA
 
Protein sequence
MLSVVVDQPH SMGVREMPTP EPAAGEVRVR VRYAGICGSD LHIFHGKNPF VSYPRVIGHE 
FVGRIESVGA GVDASRIGEI VAVDPVISCG RCHACAIGRR NVCRSLTVLG VHRDGGFSEY
ACVPAANAHR IAPEIADTCA AIVEPFAVAA NATARTGVLP SDVALIYGAG TVGLTLLQVL
KHVYGIRAFI ADRLDERLAL ARKCGAAADE VIHAATETVP DALERRGVDG GPTLIFDAVC
HPSILEEAVR LAAPAARIGV LGFSSEPSSI VQAELTKKEL TLCASRLNCA MFPQVIEWIA
DGRVHPEHIV THTLDFRDVA RAFELAERNP RESCKILLDF AAH