Gene BURPS1106A_2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2278 
Symbol 
ID4901919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2263797 
End bp2264822 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID640135507 
ProductNAD-dependent epimerase/dehydratase family protein 
Protein accessionYP_001066542 
Protein GI126452402 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.617261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGGCT TCATCGGTCA TCACCTGTCC AAGCGCATTC TTGAAACCAC CGATTGGGAA 
GTGTTCGGCA TGGACATGCA GACCGACCGG CTCGGCGATC TCGTCAAGCA CGAGCGGATG
CATTTCTTCG AAGGCGACAT CACGATCAAC AAGGAGTGGG TCGAGTATCA CGTGAAGAAG
TGCGACGTGA TCCTGCCGCT CGTCGCGATC GCGACGCCCG CCACCTACGT CAAACAGCCG
CTGCGCGTGT TCGAGCTCGA CTTCGAGGCG AACCTGCCGA TCGTGCGCTC GGCCGTCAAG
TACGGCAAGC ATCTCGTGTT TCCGTCGACG TCCGAGGTCT ATGGCATGTG CGCGGACGAG
CAGTTCGACC CGGATGCGTC CGCCCTCACC TACGGCCCGA TCAACAAGCC GCGCTGGATC
TACGCGTGCT CGAAGCAACT GATGGACCGC GTGATCTGGG GCTACGGGAT GGAAGGCCTG
AACTTCACGC TGTTCCGCCC GTTCAACTGG ATCGGCCCGG GCCTCGACTC GATCTACACG
CCGAAGGAAG GCAGCTCGCG CGTCGTCACG CAGTTCCTCG GCCACATCGT GCGCGGCGAG
AACATCAGCC TCGTCGACGG CGGCTCGCAA AAGCGCGCGT TCACGTACGT CGACGACGGC
ATCAGCGCAC TGATGAAGAT CATCGAGAAT TCGAACGGCG TCGCGACGGG CAAGATCTAC
AACATCGGGA ATCCGAACAA TAACTTCTCG GTGCGCGAAC TCGCGAACAA GATGCTCGAG
CTCGCGGCGG AGTTCCCCGA GTACGCCGAT TCGGCCAAGC GCGTGAAGCT CGTCGAGACG
ACCTCGGGCG CGTACTACGG CAACGGCTAT CAAGACGTGC AGAACCGCGT GCCGAAGATC
GAGAACACGA TGCAGGAGCT CGGCTGGGCA CCGCAGTTCA CGTTCGACGA CGCGCTGCGC
CAGATCTTCG AGGCGTACCG CGGCCACGTC GCCGACGCGC GCGCGCTCGT CGAGCAGCAC
GGCTGA
 
Protein sequence
MNGFIGHHLS KRILETTDWE VFGMDMQTDR LGDLVKHERM HFFEGDITIN KEWVEYHVKK 
CDVILPLVAI ATPATYVKQP LRVFELDFEA NLPIVRSAVK YGKHLVFPST SEVYGMCADE
QFDPDASALT YGPINKPRWI YACSKQLMDR VIWGYGMEGL NFTLFRPFNW IGPGLDSIYT
PKEGSSRVVT QFLGHIVRGE NISLVDGGSQ KRAFTYVDDG ISALMKIIEN SNGVATGKIY
NIGNPNNNFS VRELANKMLE LAAEFPEYAD SAKRVKLVET TSGAYYGNGY QDVQNRVPKI
ENTMQELGWA PQFTFDDALR QIFEAYRGHV ADARALVEQH G