Gene BMASAVP1_A1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1941 
Symbol 
ID4678719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1922491 
End bp1924086 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content77% 
IMG OID639846204 
Productputative carbohydrate kinase 
Protein accessionYP_993259 
Protein GI121601232 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCTGC CTGCCGCCTC CCCGCTGCCC CGTTCGCTCG AACCGTTCGA TGCGCCGCCG 
ATTCATGCGA GCGCGCCGCT CCTGAGCGTC GCCGAGCTGC GCGACATCGA AACCGCGGCG
GCCGCCGCGC TGCCGCCCCA CACGCTGATG GAGCGCGCGG GCAAATCGGC CGCGCAGTGG
CTCGCCGCGC GGCTCGTGAG CGACCCGCGG CCCGTGTGGT TCGCGGTCGG CCCGGGCAAC
AACGGCGGCG ACGCGCTCGT CGCCGCGGCC GAGCTGCGCC GGCTCGGCTT CGCGGCCGAC
GCCTGGATGC CGATCGAGGT GAAGCCTGAC GACGCGCGCT GGGCGCTCGA GCGCGCGCGC
GCGGCGAACG TGCCGATCGA CGAGGCGGCG CCCGAATCGT TCGACGGCTA CGGCTGGCTC
GTCGACGGGC TGTTCGGCAT CGGCCTCGCA CGGCCGCTCG ACGGCGCGTT CGCCGCGATC
GCGCAGCGCA TCGCGGCGCG CGCGCGGCAC ACCGGCCGCG TGCTCGCGCT CGACGTGCCG
AGCGGCCTCG ACAGCGACAC CGGCGCGCGC GTCGGCGGCG GGACCGCCGT CACGGCCACC
TGCACGCTGT CGTTCATCGC CGCGAAGCCC GGCCTCTATA CCGGCGACGG GCGCGACCTC
GCGGGCGAAA TCCATGTCGC GCCCCTCGAT CTCGGCGAGC CGCCCGCGCC CGCGATCCGG
CTGAACGCGC CCGAGCTCTT CGAGGCGCGC CTGCCCGAGC GCGCGTTCGC ATCGCACAAG
GGCACGTACG GCAGCCTCGG GATCGTCGGC GGAGACACGG GCATGTGCGG CGCGCCGATC
CTCGCCGCGC GCGCGGCGCT CTTCGCCGGC GCGGGCAAGG TCCATGTCGG CTTCGTCGGC
ACGGGCGCGC CGCCGTACGA TCCGCCGTAT CCGGAGCTGA TGCTGCATCC GGCCGACGCG
CTGCCGAGCG CGTCGCTCAC CGCGCTCGCG ATCGGCTGCG GGCTCGGCGC GAGCGAGCGC
GCCGCGCGCG TGCTCGCGGC GCTGCTGCCG CTCGATGCGC CGAAGCTCAT CGACGCCGAC
GCGCTGAATC TGATCGCGAC GACGCCCGCG CTCGCGGCGA CGCTCGCCGC GCGCGGCCGC
ACAGGCGACG CCGCCGTCCT CACGCCGCAT CCGCTCGAGG CCGCGCGCCT GCTCGCCACC
GACGCGGCCG ACGTCCAGCG CGACCGCGTC GCCGCCGCGC GCGCGCTCTG CGCGCGCTTC
TCGGCGGTCG TCGTGCTGAA AGGGTCCGGC ACCGTGATCG CGGCGCCGGA CGGCCGCCTC
GCGATCAATC CGACCGGCAA CGCGGCGCTC GCCACCGGCG GCACGGGCGA CGTGCTGGGC
GGCCTGATCG GCGCGTTTCT TGCGCAGCGG ATGCCGCGCT ACGAAGCGGC GCTCGCGGGC
GTCTACCTGC ACGGGCTCGC CGCCGAGCGG CTGTGCGCGG CGGGCGCGGG CCCGGCCGGC
CTCGCCGCGG GCGAACTCGC GCCCGCCGTG CGCGCGCTCG TCAATCGGCT GTTTTATACG
CGGCCCGCCG CGCCGGACGA AGCGCCGCTA TACTGA
 
Protein sequence
MILPAASPLP RSLEPFDAPP IHASAPLLSV AELRDIETAA AAALPPHTLM ERAGKSAAQW 
LAARLVSDPR PVWFAVGPGN NGGDALVAAA ELRRLGFAAD AWMPIEVKPD DARWALERAR
AANVPIDEAA PESFDGYGWL VDGLFGIGLA RPLDGAFAAI AQRIAARARH TGRVLALDVP
SGLDSDTGAR VGGGTAVTAT CTLSFIAAKP GLYTGDGRDL AGEIHVAPLD LGEPPAPAIR
LNAPELFEAR LPERAFASHK GTYGSLGIVG GDTGMCGAPI LAARAALFAG AGKVHVGFVG
TGAPPYDPPY PELMLHPADA LPSASLTALA IGCGLGASER AARVLAALLP LDAPKLIDAD
ALNLIATTPA LAATLAARGR TGDAAVLTPH PLEAARLLAT DAADVQRDRV AAARALCARF
SAVVVLKGSG TVIAAPDGRL AINPTGNAAL ATGGTGDVLG GLIGAFLAQR MPRYEAALAG
VYLHGLAAER LCAAGAGPAG LAAGELAPAV RALVNRLFYT RPAAPDEAPL Y