Gene BURPS1106A_A2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2787 
Symbol 
ID4904565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2716638 
End bp2717678 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content73% 
IMG OID640145890 
Productfatty acid desaturase family protein 
Protein accessionYP_001076816 
Protein GI126456520 
COG category[I] Lipid transport and metabolism 
COG ID[COG1398] Fatty-acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCACG CGAGCGCCGA GAGCCCCGAT AGCGCCGGCT GCGCCGAGAC GCCGGCGCGC 
GCGCCGGCCG GCTCGGCGGC GGATACGGCC GCCCCGCCCG CGCCCGACGA GCGCGCCTCC
GGCCATCTGT CGCGCGCGTC GTCCGCGCGC CATCTGGGCG TCGCGGCGCT GCCCGCCGCC
GGCACGGCCG CCGCGATCGC GCTCTGGGCC GGGTTCGGCC TCGCGCCGCG CGTGCAGGAC
ATCGCCATGC TCGCGGTCTT CTACGTGCTC AACATTCTCG GCATGGAGCT CGCGCTGCAC
CGCTATTTCG CGCATCGCAC GTTCAAGGCG AAGCCGGCCG TGAAGATCGC GCTCGCGATC
CTCGGCTCGC TCGCGTACAT GGGGCCGCTG ATGTGGTGGG TGGCGATCCA CCGGCTGCAT
CACGCGAACG CCGACCGGCC GGGCGACCCG CACACGCCGC AACTCGGCGG GCGCGGCTTC
GCCGGCCGCG CGAAGGGCAT CCTGCACGGG CACGTCGGCT GGCTGTTCGA TCCGTCGTCC
GCGCGCCCGA AGGGCTGGAA CCAATATGCG AACGACATGT ACCGCGACCC GACGCTGCTG
CGCATCCATC TCGCGTACGA CTACTGGCTG CTGCTCGGCC TGCTGCTGCC GGGCGCGCTC
GGCTGGCTGC TCGATCCTTC GTGGCGGGGC GCGCTGCTCG GCCTGCTGTG GGGCGGCACC
GTGCGGATCT TTCTCGCGAC GAACGCGATC TGGGCGGTCA ATTCGATCGG CCACGCGCTC
GGCGGCCGGC GGCCGTTTCC CGGCCGCGAC CAGAGCCGCA ACGCGGCGTG GCTCGCGCTC
GTCACGCTCG GCGCGGGCTG GCACAACAAC CATCACGCGT TTCCGCAGTA TGCGAGCACG
CGCCTGACCC GCTGGCAGAT CGACGTGACC GGCATGCTGA TCGCGCTGCT TGAACGGCTG
GGCCTCGTGT GGGACGTTCA GCACCCGGAC CGGGACGCGG TGCGCGAGCG GCTCGCGAAC
GCACGGCGCG ACGACGCGTA G
 
Protein sequence
MKHASAESPD SAGCAETPAR APAGSAADTA APPAPDERAS GHLSRASSAR HLGVAALPAA 
GTAAAIALWA GFGLAPRVQD IAMLAVFYVL NILGMELALH RYFAHRTFKA KPAVKIALAI
LGSLAYMGPL MWWVAIHRLH HANADRPGDP HTPQLGGRGF AGRAKGILHG HVGWLFDPSS
ARPKGWNQYA NDMYRDPTLL RIHLAYDYWL LLGLLLPGAL GWLLDPSWRG ALLGLLWGGT
VRIFLATNAI WAVNSIGHAL GGRRPFPGRD QSRNAAWLAL VTLGAGWHNN HHAFPQYAST
RLTRWQIDVT GMLIALLERL GLVWDVQHPD RDAVRERLAN ARRDDA