Gene BURPS1710b_0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_0918 
SymboldgoA 
ID3689640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp954733 
End bp955881 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content67% 
IMG OID637727374 
Productgalactonate dehydratase 
Protein accessionYP_332331 
Protein GI76809443 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.462959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA CCCGCCTCGA AACCTTCGTC GTGCCGCCTC GATGGCTGTT CCTGAAGATC 
GAGACCGACG CGGGCATCGT CGGCTGGGGC GAGCCGATCG TCGAAGGCCG CGCGCATACG
GTCGAGGCGG CCGTGCACGA GCTCGCCGAC TACCTCGTCG GCCAGGATCC GCTTCGCATC
GAGGACCACT GGCAGGTGAT GTACCGCGCG GGCTTCTACC GCGGCGGGCC GATCACGATG
AGCGCGATCG CGGGCATCGA CCAGGCGCTC TGGGACATCA AGGGCAAGCA TCACGGCGCG
CCCGTGCATG CGCTCCTCGG CGGCCCGGTG CGCGAGCGGA TCAAGGTGTA TTCGTGGATC
GGCGGCGATC GGCCGAGCGA CGTCGCGAAC AACGCGCGCG CGGTCGTCGA ACGCGGCTTC
CAGGCGGTGA AGATGAACGG CTCGGAAGAG CTGCAGATCG TCGACACCTT CGACAAGGTC
GACAAGGTGA TCGCGAACGT CGCGGCGGTG CGCGACGCGG TGGGCCCGTA CGTCGGCATC
GGCGTCGATT TCCACGGCCG CGTGCACAAG CCGATGGCGA AGGTACTCGC CAGGGAGCTC
GATCCGTACA AGCTGATGTT CATCGAGGAG CCCGTGCTGT CGGAGAACGC CGAGGCGCTG
CGCGACATCG CGAACCAGAC GAGCACGCCG ATCGCGCTCG GCGAGCGGCT CTACTCGCGC
TGGGATTTCA AGCGCATTCT CGAAGGCGGC TACGTCGACA TCGTGCAGCC CGACGCGTCG
CACGCGGGCG GGATCACCGA GTGCCGGAAG ATCGCGACGC TCGCGGAAAG CTACGACGTC
GCGCTCGCGC TGCACTGCCC GCTCGGGCCG ATCGCGCTCG CCGCGTGCCT GCAGCTCGAC
GCGGTCAGCT ACAACGCGTT CATTCAGGAG CAGAGCCTCG GCATTCACTA CAACCAGGGC
AGCGATCTGC TCGACTATCT GCGCAACCCG GACGTGTTCC GCTACGCGGA CGGCTTCGTC
GCGATTCCGC AGGGGCCCGG GCTCGGCATC GACGTCGACG AGGACAAGGT GTGCGAGATG
GCGAAAACCG GGCACCGCTG GCGTAATCCG GTATGGCGGC ACGCGGACGG CAGCGTCGCC
GAGTGGTGA
 
Protein sequence
MKITRLETFV VPPRWLFLKI ETDAGIVGWG EPIVEGRAHT VEAAVHELAD YLVGQDPLRI 
EDHWQVMYRA GFYRGGPITM SAIAGIDQAL WDIKGKHHGA PVHALLGGPV RERIKVYSWI
GGDRPSDVAN NARAVVERGF QAVKMNGSEE LQIVDTFDKV DKVIANVAAV RDAVGPYVGI
GVDFHGRVHK PMAKVLAREL DPYKLMFIEE PVLSENAEAL RDIANQTSTP IALGERLYSR
WDFKRILEGG YVDIVQPDAS HAGGITECRK IATLAESYDV ALALHCPLGP IALAACLQLD
AVSYNAFIQE QSLGIHYNQG SDLLDYLRNP DVFRYADGFV AIPQGPGLGI DVDEDKVCEM
AKTGHRWRNP VWRHADGSVA EW