Gene BURPS668_A1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1031 
Symbol 
ID4887539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp995649 
End bp996767 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content74% 
IMG OID640130971 
Productzinc-binding dehydrogenase family oxidoreductase 
Protein accessionYP_001062030 
Protein GI126443569 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCGAG CGATACCGCG CGCGGCGGGA TGGTCCGCGC CGGCGGCGCA GCGCTTGCCG 
TGCGCATGGG CGCGCGCCGT GGCCGGCGCA TCCGAGATCC GCGAGGCGCG GCGATCGCGC
GAGCCCGCCG CGCCCGATGA GCAAGGAGCG TCGATGAAAG CTGCTGTCGT ACACCGCGCG
GGCGAGCGCC CGACTTATGC CGAATTCGAG CCGCCGCGCG CGTTGCCCGG CCATCGCCTG
ATCGACGTGA GCGCGTCCGC GTTGAGCCGG CTCGCGCAGG CGCGCGCGTC GGGCGCGCAT
TATTCGTCGA CGGGCGGCTT TCCGTTCGTC GCGGGCGTCG ACGGCGTGGG GCGTCTGGAC
GACGGACGGC GCGTGTACTT CTTCGGCCCG CCGGCGCCGT TCGGCGCGCT GGCCGAGCGT
ACCCTCGTGC CAGCAGCGCA GTGCATCCCG TTGCCCGATT CGATCGACGA TGCGACGGCG
GCGGCCATCG CGATTCCGGG CATGTCGTCG TGGGCGGCGT TGACCGAGCG CGCGCGGCTC
GCCGCGGGCG AGACGGTGCT CGTGAACGGC GCGACGGGCG CGTCGGGGCG GCTCGCGGTG
CGCATCGCGA AGCATCTCGG CGCCGCGAGC GTGATCGCGA CGGGGCGCAA CGCACACGCG
CTCGATGCGC TGAGCTCCGC GGGCGCCGAC GCGACGATCT CGCTTGCGCA GGATGACGAA
CAGGTGGCGC GCGCGTTCGA GGCGCACTTT CGCGCGGGCG TGGATGTCGT GCTCGATTAT
CTGTGGGGCG CGAGTGCGCG CGCGGCCCTG CTCGCCGCGG CGAAGGCGCC GCAGCAGGCG
CGCCCGGTGC GCTTCGTGCA GATCGGCACG ATCGGCGGTG CCGAACTGCC GTTGCCGGGC
GCGGTGCTGC GCGCGAGCGC GATCACGCTG ATGGGCAGCG GGCTCGGCAG CATCGCGCTG
CCGCGCCTGC TGAACGCGGC GAGGGCGGTG CTCGGCGCGG CGTGCGAGGC CCGGCTGCGG
ATCGACACGC GAACCGTGCC GCTCGCGGAC GTCGACGCGC ATTGGGGCGA CACGGGCAGC
ACGCTACGCC CGGTGTTCAC GATGCGCGCG CCGGGATGA
 
Protein sequence
MHRAIPRAAG WSAPAAQRLP CAWARAVAGA SEIREARRSR EPAAPDEQGA SMKAAVVHRA 
GERPTYAEFE PPRALPGHRL IDVSASALSR LAQARASGAH YSSTGGFPFV AGVDGVGRLD
DGRRVYFFGP PAPFGALAER TLVPAAQCIP LPDSIDDATA AAIAIPGMSS WAALTERARL
AAGETVLVNG ATGASGRLAV RIAKHLGAAS VIATGRNAHA LDALSSAGAD ATISLAQDDE
QVARAFEAHF RAGVDVVLDY LWGASARAAL LAAAKAPQQA RPVRFVQIGT IGGAELPLPG
AVLRASAITL MGSGLGSIAL PRLLNAARAV LGAACEARLR IDTRTVPLAD VDAHWGDTGS
TLRPVFTMRA PG