Gene BURPS1710b_A1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1366 
Symbol 
ID3694385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1673423 
End bp1674781 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content74% 
IMG OID637731620 
Productglutathione-dependent formaldehyde dehydrogenase 
Protein accessionYP_336523 
Protein GI76818029 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.987055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCATG TCGCATGCCG GGTTCGCGCG CACGTCGCGC GTACGTCGGA CGGCGCGCGT 
CGCACTTCAT ACATCGCACG TCGCGGCCCG CACGCGGTAC GCGGCCCCCG GCCGGGCCCG
ATCCGGTTCG CCCCCACCGC CACGCATGCG GCGCGCGCGC CGTCCCACCC GGAGGAACTC
GCGATGAAGG CACTTTGCTG GCACGGCAAG CATACGGTCC GATACGAGAC TGTCGCGGAC
CCGGCCGTCG AGGACCCGCG CGACGCGATC GTCGAGGTTC GCGCGTGCTC GATCGGCGGC
GCGGACCTGC ATCTGTACGA CGCGGTGATC CCCGGCCTGA AGGATGGCGA CGTGCTCGGC
CGCGAATGCG TCGGCGAGGT CGTCGAGATC GGCGCGGGCG TATCGCGGCG GCGCGTCGGC
GAGCGCGTCG TCGTGCCGTT CGCGATCGCG TGCGGCGAGT GCGACCAGTG CCGGCGCGGC
AACTGGTCGG TATGCGAGAC GACGAACCCG CACCGCGCGC TGGCCGACAA GGTGTTCGGC
CACGCCACCG GCGGCCTGCT CGGCTGCGGC CATCTGGCGG GCGGCTATCC GGGCGGACAG
GCGCAATACG TGCGCGTGCC GCACGCGGAC GTCGCGCCCG CGACGATACC GGACGGCGTC
GACGACGAGC GCGCGCTGTT CGTCGGCGAC AGCCTCGCGA CCGGCTGGCA GGCGGCCGCG
CAGTGCGAGA TCGAGCCGGG CGACGTCGTC GCGGTGTGGG GCGCGGGGGC GGTGGGGCTG
TTCGCGGCGA TGAGCGCGCG CCTGCTCGGC GCGGCCGAGG TGATCGCGAT CGACCGCGTG
CCGGAGCGGC TCGCGCTCGC GCAGAAGCTC GGCGCGACGC CGCTCGATTT CGAGCGGCTC
GGCGTGGCCG ATACGCTCGC CGAGCGCACG CGCGGCAAGG GTCCGGACAA ATGCATCGAC
GCGGTCGGGC TGGAGGCGCA GGCGGGCGAG GCGCTCGACG CGGTGACCGA CCGCTTCAGG
CCGACCGTGG CCGCGGGCGC CGATCAGCCG CACGTGCTGC GCGAGATGAT CTACGCGTGC
CGGCCGGGCG GCGTGCTATC GGTGCCGGGC GTCTACGGCG GGCTCGTCGA CAAGCTGCCG
ATGGGCGCGC TGATGCACAA GGGCCTCACG CTGCGCGCGG GGCAGACGCA CGTCTGCCGC
TGGACGGCCG GGCTGCTCGA GCGGATCGCC GACGGGCGGC TCGATCCGTC GTGCGTCATC
ACGCATCGCG CGACGCTCGA AGAGGGGCCT GCGATGTATG AAACCTTCGG CGCGCGGCGC
GACGGCTGCA TTCGGCCGGT GCTGCGGCCC CGGCATTGA
 
Protein sequence
MSHVACRVRA HVARTSDGAR RTSYIARRGP HAVRGPRPGP IRFAPTATHA ARAPSHPEEL 
AMKALCWHGK HTVRYETVAD PAVEDPRDAI VEVRACSIGG ADLHLYDAVI PGLKDGDVLG
RECVGEVVEI GAGVSRRRVG ERVVVPFAIA CGECDQCRRG NWSVCETTNP HRALADKVFG
HATGGLLGCG HLAGGYPGGQ AQYVRVPHAD VAPATIPDGV DDERALFVGD SLATGWQAAA
QCEIEPGDVV AVWGAGAVGL FAAMSARLLG AAEVIAIDRV PERLALAQKL GATPLDFERL
GVADTLAERT RGKGPDKCID AVGLEAQAGE ALDAVTDRFR PTVAAGADQP HVLREMIYAC
RPGGVLSVPG VYGGLVDKLP MGALMHKGLT LRAGQTHVCR WTAGLLERIA DGRLDPSCVI
THRATLEEGP AMYETFGARR DGCIRPVLRP RH