Gene BURPS668_A2941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2941 
Symbol 
ID4888773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2788567 
End bp2789607 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content73% 
IMG OID640132877 
ProductFatty-acid desaturase 
Protein accessionYP_001063932 
Protein GI126442549 
COG category[I] Lipid transport and metabolism 
COG ID[COG1398] Fatty-acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCACG CGAGCGCCGA GAGCCCCGAT AGCGCCGGCT GCGCCGAGAC GCCGGCGCGC 
GCGCCGGCCG GCTCGGCGGC GGATACGGCC GCCCCGCCCG CGCCCGACGA GCGCGCCTCC
GGCCATCTGT CGCGCGCGTC GTCCGCGCGC CATCTGGGCG TCGCGGCGCT GCCCGCCGCC
GGCACGGCCG CCGCGATCGC GCTCTGGGCC GGGTTCGGCC TCGCGCCGCG CGTGCAGGAC
ATCGCCATGC TCGCGGTCTT CTACGTGCTC AACATTCTCG GCATGGAGCT CGCGCTGCAC
CGCTATTTCG CGCATCGCAC GTTCAAGGCG AAGCCGGCCG TGAAGATCGC GCTCGCGATC
CTCGGCTCGC TCGCGTACAT GGGGCCGCTG ATGTGGTGGG TGGCGATCCA CCGGCTGCAT
CACGCGAACG CCGACCGGCC GGGCGACCCG CACACGCCGC AACTCGGCGG GCGCGGCTTC
GCCGGCCGCG CGAAGGGCAT CCTGCACGGG CACGTCGGCT GGCTGTTCGA TCCATCGTCC
GCGCGCCCGA AGGGCTGGAA CCAATATGCG AACGACATGT ACCGCGACCC GACGCTGCTG
CGCATCCATC TCGCGTACGA CTACTGGCTG CTGCTCGGCC TGCTGCTGCC GGCCGCGCTC
GGCTGGCTGC TCGATCCTTC GTGGCGGGGC GCGCTGCTCG GCCTGCTGTG GGGCGGCACC
GTGCGGATCT TTCTCGCGAC GAACGCGATC TGGGCGGTCA ATTCGATCGG CCACGCGCTC
GGCGGCCGGC GGCCGTTTCC CGGCCGCGAC CAGAGCCGCA ACGCGGCGTG GCTCGCGCTC
GTCACGCTCG GCGCGGGCTG GCACAACAAC CATCACGCGT TTCCGCAGTA TGCGAGCACG
CGCCTGACCC GCTGGCAGAT CGACGTGACC GGCATGCTGA TCGCGCTGCT CGAACGGCTG
GGGCTCGTGT GGGACGTTCA GCACCCGGAC CGGAACGCGG TGCGCGAGCG GCTCGCGAAC
GCACGGCGCG ACGACGCGTA G
 
Protein sequence
MKHASAESPD SAGCAETPAR APAGSAADTA APPAPDERAS GHLSRASSAR HLGVAALPAA 
GTAAAIALWA GFGLAPRVQD IAMLAVFYVL NILGMELALH RYFAHRTFKA KPAVKIALAI
LGSLAYMGPL MWWVAIHRLH HANADRPGDP HTPQLGGRGF AGRAKGILHG HVGWLFDPSS
ARPKGWNQYA NDMYRDPTLL RIHLAYDYWL LLGLLLPAAL GWLLDPSWRG ALLGLLWGGT
VRIFLATNAI WAVNSIGHAL GGRRPFPGRD QSRNAAWLAL VTLGAGWHNN HHAFPQYAST
RLTRWQIDVT GMLIALLERL GLVWDVQHPD RNAVRERLAN ARRDDA