Gene BURPS1710b_A1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1541 
Symbol 
ID3693130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1871734 
End bp1873713 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content71% 
IMG OID637731795 
Productaldehyde dehydrogenase (NADP) family protein 
Protein accessionYP_336698 
Protein GI76818785 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.211116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGCTGT TCGGCTTCAA GTCGATCGTT GCGCGAGCCC GCGCCGAAGC CGATATCCCG 
TCATGGGCCC TCAACCGGCG GCTCGCGTTC GCGACGATTG CAATCCCGGC CGGGAATGCC
GCGTGTTCGC ATTCGTTTGC CGGACGGCGC GCATGCGCGC AGGACATGGC GACCGGTGCG
GGCACGCGTG CGCCGGTCGG GATGCGCGGC GCACGCGTTG CCGTCTGCCA CGATAATGGA
ATGCATGCGC GCCGGGGCGG TGCGATCCGT CGAACCGGCC GGTCGCCTCG ACTCGTTCAA
TTCGACCCAT TCGACCCATT CGACCCATTC GTCCGCAGGC GGCGCACGCC GCATCGCGAA
GCGCACCGCC CGCGGCCAGG TTTCCAATTC GGAGGAAGCA TGCAGATCAC CGGCGAGATG
TTGATTGGCG CGGCCGCGGT GCGCGGTAGC GAAGGCACGA TGCGCGCTTA CGCGCCGGCG
CAGGGCGTCG AGCTCGAGCC GACGTTCGGC GCGGGCGGTG CGGCCGACGT CGATCGCGCG
TGCCGCCTCG CGAACGCCGC TTTCGATCCC TTTCGTCAGG CGCCGCTCGA GACGCGCGCA
CGCTTTCTCG AGGCGATCGC CGAGCGCATC GTCGGGCTCG GCGATCCATT GATCGAACGC
GCGCACGCGG AATCGGCGCT GCCCGTCGCG CGGCTCGAAG GCGAGCGCGC GCGCACGGTC
GGTCAGCTCA GGCTCTTCGC GGCGATCGTG CGCGACGGCC GCTGGCTGAG CGCGACGCTC
GATTCCGCGC AGCCCGAGCG CAAGCCGCTG CCGCGCGCCG ATCTGCGCTT GCAGAAGATT
CCCGTCGGCC CGGTCGCGGT GTTCGGCGCG AGCAATTTCC CGCTCGCGTT CTCGGTCGCG
GGCGGCGACA CCGCTTCGGC GTTCGCGGCC GGCTGCCCCG TCGTCGCGAA GGCGCACCCC
GCGCATCTCG GCACGTCGGA GCTCGTCGGG CGCGCGATCC GGCAGGCTGT CGCCGATTGC
GGTTTGCACG AGGGCGTGTT CTCGCTCGTC GTCGGCGCGG GCAACGCGAT CGGCGAGGCG
CTCGTCGCGC ATCCCGCGAT CAGGGCGGTC GGCTTCACCG GCTCGCGCGC GGGCGGCCTT
GCGCTGATGG GCGTTGCCGC GCGGCGGCAC GAGCCGATTC CGGTCTTCGC GGAAATGAGC
AGCATCAATC CGTTCTTCGT GTTGCCCGGC GCGTTGCGCG CACGCGGTGC GCAAATCGCG
CAAGGCTTCG TCGAATCGCT GACGCTCGGC GTCGGGCAGT TCTGCACGAA CCCGGGGCTC
GTCGTCGCGC TCGAAGGGCC CGACCTGAAG GCGTTCGTCG ACGCGGCCGC GCAGGCGCTC
TCGCAAAAGG GCGCGCAGAC GATGCTGACC TCGGGCATCG CGTCGTCTTA CGAGAGCGCG
GTCGCGGCGC GCCGCGCGGC CGCGGGCGTC AGCGAGGTCG CGCGCGGCGT GCGCAGCGAC
GCGCGGAACG CCGCGTTGCC CGCGCTCTTC ACGACGACGC ACACGCAGTT CGTCCAGAAC
CCGCAGCTCG AAGCCGAGAT CTTCGGGCCG ACGTCGCTCG TCGTCGCGTG CCGCGACATC
GACGAGATGA TCGCGCTTGC CGAGCATGTC GAGGGGCAAC TGAGCGCGAC GCTGCATCTC
GAAGACGACG ATGTCGATCT GGCGCGCAAA CTGCTGCCGA CGCTCGAGCG CCGCGCCGGC
CGCATCGTCG CGAACGGCTA TCCGACGGGC GTCGAGGTCG CGTACGCGAT GGTGCACGGC
GGGCCGTTTC CGGCGACGTC GGACCCGCGC AGCACATCGG TGGGCGCGCT TGCGATCGAG
CGCTTCCTGC GGCCCGTCTG CTATCAGGAT TTGCCGGCGG CGTTGTTGCC CGAGGCGCTC
GCCGACGCGA ATCCGCTCGG CCTCTGGCGC CTGCGCGACG GCCAACTCGG CAAGGCGTGA
 
Protein sequence
MWLFGFKSIV ARARAEADIP SWALNRRLAF ATIAIPAGNA ACSHSFAGRR ACAQDMATGA 
GTRAPVGMRG ARVAVCHDNG MHARRGGAIR RTGRSPRLVQ FDPFDPFDPF VRRRRTPHRE
AHRPRPGFQF GGSMQITGEM LIGAAAVRGS EGTMRAYAPA QGVELEPTFG AGGAADVDRA
CRLANAAFDP FRQAPLETRA RFLEAIAERI VGLGDPLIER AHAESALPVA RLEGERARTV
GQLRLFAAIV RDGRWLSATL DSAQPERKPL PRADLRLQKI PVGPVAVFGA SNFPLAFSVA
GGDTASAFAA GCPVVAKAHP AHLGTSELVG RAIRQAVADC GLHEGVFSLV VGAGNAIGEA
LVAHPAIRAV GFTGSRAGGL ALMGVAARRH EPIPVFAEMS SINPFFVLPG ALRARGAQIA
QGFVESLTLG VGQFCTNPGL VVALEGPDLK AFVDAAAQAL SQKGAQTMLT SGIASSYESA
VAARRAAAGV SEVARGVRSD ARNAALPALF TTTHTQFVQN PQLEAEIFGP TSLVVACRDI
DEMIALAEHV EGQLSATLHL EDDDVDLARK LLPTLERRAG RIVANGYPTG VEVAYAMVHG
GPFPATSDPR STSVGALAIE RFLRPVCYQD LPAALLPEAL ADANPLGLWR LRDGQLGKA