Gene BURPS668_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0740 
Symbol 
ID4883990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp717250 
End bp718398 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content67% 
IMG OID640126668 
Productgalactonate dehydratase 
Protein accessionYP_001057792 
Protein GI126442264 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA CCCGCCTCGA AACCTTCGTC GTGCCGCCTC GATGGCTGTT CCTGAAGATC 
GAGACCGACG CGGGCATCGT CGGCTGGGGC GAGCCGATCG TCGAAGGCCG CGCGCATACG
GTCGAGGCGG CCGTGCACGA GCTCGCCGAC TACCTCGTCG GCCAGGATCC GCTTCGCATC
GAGGACCACT GGCAGGTGAT GTACCGCGCG GGCTTCTACC GCGGCGGGCC GATCACGATG
AGCGCGATCG CGGGCATCGA CCAGGCGCTC TGGGACATCA AGGGCAAGCA TCACGGCGCG
CCCGTGCATG CGCTCCTCGG CGGCCCGGTG CGCGAGCGGA TCAAGGTGTA TTCGTGGATC
GGCGGCGATC GGCCGAGCGA CGTCGCGAAC AACGCGCGCG CGGTCGTCGA ACGCGGCTTC
CAGGCGGTGA AGATGAACGG CTCGGAAGAG CTGCAGATCG TCGACACCTT CGACAAGGTC
GACAAGGTGA TCGCGAACGT CGCGGCGGTG CGCGACGCGG TGGGCCCGTA CGTCGGCATC
GGCGTCGATT TCCACGGCCG CGTGCACAAG CCGATGGCGA AGGTGCTCGC CAGGGAGCTT
GATCCGTACA AGCTGATGTT CATCGAGGAG CCCGTGCTGT CGGAGAACGC CGAGGCGCTG
CGCGACATCG CGAACCAGAC GAGCACGCCG ATCGCGCTCG GCGAGCGGCT CTACTCGCGC
TGGGATTTCA AGCGCATTCT CGAAGGCGGC TACGTCGACA TCGTGCAGCC CGACGCGTCG
CACGCGGGCG GGATCACCGA GTGCCGGAAG ATCGCGACGC TCGCGGAATG CTACGACGTC
GCGCTCGCGC TGCACTGCCC GCTCGGGCCG ATCGCGCTTG CCGCGTGCCT GCAGCTCGAC
GCGGTCAGCT ACAACGCGTT CATTCAGGAG CAGAGCCTCG GCATTCACTA CAACCAGGGC
AGCGATCTGC TCGACTATCT GCGCAACCCG GACGTGTTCC GCTACGCGGA CGGCTTCGTC
GCGATTCCGC AGGGGCCCGG GCTCGGCATC GACGTCGACG AGGACAAGGT GCGCGAGATG
GCGAAAACCG GGCACCGCTG GCGCAATCCG GTATGGCGGC ACGCGGACGG CAGCGTCGCC
GAGTGGTGA
 
Protein sequence
MKITRLETFV VPPRWLFLKI ETDAGIVGWG EPIVEGRAHT VEAAVHELAD YLVGQDPLRI 
EDHWQVMYRA GFYRGGPITM SAIAGIDQAL WDIKGKHHGA PVHALLGGPV RERIKVYSWI
GGDRPSDVAN NARAVVERGF QAVKMNGSEE LQIVDTFDKV DKVIANVAAV RDAVGPYVGI
GVDFHGRVHK PMAKVLAREL DPYKLMFIEE PVLSENAEAL RDIANQTSTP IALGERLYSR
WDFKRILEGG YVDIVQPDAS HAGGITECRK IATLAECYDV ALALHCPLGP IALAACLQLD
AVSYNAFIQE QSLGIHYNQG SDLLDYLRNP DVFRYADGFV AIPQGPGLGI DVDEDKVREM
AKTGHRWRNP VWRHADGSVA EW