Gene BURPS1710b_A1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1968 
Symbol 
ID3693766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2408420 
End bp2409586 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content64% 
IMG OID637732222 
Productputative heptosyltransferase (O-antigen related) 
Protein accessionYP_337119 
Protein GI76819408 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.78963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCCT CCATCAAGAT CAGAATCTAC AAGCGGCTGG ACCGGATGTT GTCGGATCTG 
GTCAGGGCCG TGCCGCATCC GAAACGCGCG CTCGGGCGGA CACCGACGCG CGTGCTGATC
ATCAAGCTCT CGGCGATGGG GGATTCGCTG TGCCTCTTTC CCACCGTTCG GCAACTGGCG
CTCGCGTTCC CGGGCGCGAC GATTGACTGG CTGACGTCCA ATCGCTCCAA TCCCGCGCTG
TTCGCCGAGC TGCCGTTCAT CCGCACGATC TTCGTCACGC CGCCGTCGCT CGTGCGCGCG
CTGACGTATC TGGGCGGGTG GCTCTTCAGG GCGCGCCGCT ACGACCTCAC GATCGATTAC
GACCAGTACT ACTGCATATC CGAGCTGATC GCGGGCATGT CGGCGTGCAG CGCGGGTTTC
AGGACGTCGC TCAAGGGCAC GACGTTCTCG CTGAGCGTCG AATACGATCC GCTGCTCAAC
GAGAAGGCGA TGTTCCGCAA GCTGACCGAG CGCGTCTTCG CGACGTACGG CGTCTCGATT
CCCGATTACC GGGCGGAACT GCCGGAACTG ATCGAGCGAT TCGTGCCGAG CGCGCAATTG
CAGACGTTGC GCGCGCGGCT GAAGGCGCAG GGCAAGCCGA TCGTCGGCAT TTATCCCGGC
TCGGGCGCGA ACGCGACGTT CAGGCGCTGG GGCGTCGGCA ATTACGTGGC GCTGATCGAG
CGCTACAAGG ATCGCTACGC GTTCGTGCTC CTCGGCGGCC CGGACGAGCG TGACCTGCAG
GCGGACCTGA AGGATATCGA CGGCGTGTTC AATCTGATCG ATTCGATGTC GTTCAAGGAG
GTCGCGTGGT TCCTGAAGCA TACGATCGAC CTGCTCGTCG GCAACGACGG CGGGCTGCTG
CACGTCGCCG AGAGCCAGGC GGTGGCGACC GTGGGGATAT TCGGGCCCGC GCTGTACCGG
AAGTGGGGGT CGTCGCTGGA GCGTTCGATC GGCGTCGAGA AGGAACTGCC GTGCCGGCCG
TGTCTGAAGA ACTATCTCGG CACCGTGCCG TCGGCGTGTT GTCTCGGCAC CACCGCATGC
CTGAGCGCGA TCTCGACCGA CGACGTTGCG CAGGCGATGC ATCGCGCCGT CCATCGGATT
CACGTCGTGC CGATCGCGCA TGCTTGA
 
Protein sequence
MQPSIKIRIY KRLDRMLSDL VRAVPHPKRA LGRTPTRVLI IKLSAMGDSL CLFPTVRQLA 
LAFPGATIDW LTSNRSNPAL FAELPFIRTI FVTPPSLVRA LTYLGGWLFR ARRYDLTIDY
DQYYCISELI AGMSACSAGF RTSLKGTTFS LSVEYDPLLN EKAMFRKLTE RVFATYGVSI
PDYRAELPEL IERFVPSAQL QTLRARLKAQ GKPIVGIYPG SGANATFRRW GVGNYVALIE
RYKDRYAFVL LGGPDERDLQ ADLKDIDGVF NLIDSMSFKE VAWFLKHTID LLVGNDGGLL
HVAESQAVAT VGIFGPALYR KWGSSLERSI GVEKELPCRP CLKNYLGTVP SACCLGTTAC
LSAISTDDVA QAMHRAVHRI HVVPIAHA