Gene BURPS1106A_A1925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1925 
Symbol 
ID4906251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1885311 
End bp1886282 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content72% 
IMG OID640145031 
ProductNAD-dependent epimerase/dehydratase family protein 
Protein accessionYP_001075959 
Protein GI126457337 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.281448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTC TCATCACCGG CGGCGCCGGC TTTCTCGGCC AGCGTCTCGC GAAACAGCTG 
CTCGCGCGCG GCAAGCTGAC CGGCCCGAAC GGCGCGCCGC GGCGCATCGA CGAGCTCGTG
CTGCTCGACG TCGTCCAGGC CCACGACTTC GATGATGCGC GCGTGACGGC GCGCGTCGGC
GACATCGCCG ATCGCGCCGT GCTCGAGGCC GCGATCGACG CGCGCACGCA CGCGGTCTTC
CATCTCGCGG CGATCGTGAG CGGCCAGGCG GAAGCCGATT TCGACCTCGG GATGCGGATC
AACCTCGATG CGTCGCGCCT GTTGCTCGAC GTGTGCCGCG CGCGCGGGCA CCGGCCGCGC
GTGGTGTTCA CGAGCTCGGT GGCGGTGTAC GGCGGCGCGC TGCCCGAACT CGTGCGGGAC
GACACCGCGC TCGAACCGCA GTCGTCGTAC GGCGCGCAGA AGGCAATCGC CGAGTTGCTG
CTGTCCGATT ACACGCGCCG CGGCTTCGTC GACGGGCGCG TGCTGCGGCT GCCGACGATC
AGCGTGCGGC CGGGCCGGCC GAACGCGGCG GCTTCGTCGT TCGCGAGCGG GATCGTCCGC
GAGCCGCTGA ACGGCGAGCA AGCCGTATGC CCGGTGCCGG GCGGCACGCG GCTGTGGCTG
CTGTCGCCGC GCCGCGCGAT CGACGCGCTC ATCGCCGGCT GCGAGCTCGA CGGCGCGGCG
CTCGGCAACC GGCGCACGAT CAACTTGCCG GGGCTCTCGG TGACGGTCGA CGACATGATC
GACGCGCTGC GCGAAGTCGC CGGCATCGAA GCGGTGAAGC TGATCCGGCG CGCCGAGGAC
GAGCGCGTCG TGAAGATCGT CGGCAGTTGG CCGGGACGCT GGGACACGTC GCGCGCCGAA
GCGCTCGGCC TCGCGGGCGA CGCGAGCTTC GTCGACGTGA TCCGCGGCTA TCTCGAAGAC
GAGCGGCGAT AA
 
Protein sequence
MKVLITGGAG FLGQRLAKQL LARGKLTGPN GAPRRIDELV LLDVVQAHDF DDARVTARVG 
DIADRAVLEA AIDARTHAVF HLAAIVSGQA EADFDLGMRI NLDASRLLLD VCRARGHRPR
VVFTSSVAVY GGALPELVRD DTALEPQSSY GAQKAIAELL LSDYTRRGFV DGRVLRLPTI
SVRPGRPNAA ASSFASGIVR EPLNGEQAVC PVPGGTRLWL LSPRRAIDAL IAGCELDGAA
LGNRRTINLP GLSVTVDDMI DALREVAGIE AVKLIRRAED ERVVKIVGSW PGRWDTSRAE
ALGLAGDASF VDVIRGYLED ERR