Gene BURPS668_A3170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3170 
Symbol 
ID4887681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2998114 
End bp2999982 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content74% 
IMG OID640133106 
Productheptosyltransferase family protein 
Protein accessionYP_001064161 
Protein GI126444459 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCCCG CCGAGCGCGC GATGACGCTT CACGCGTGGC GGCGCGCGCG CCGCATTCTG 
TGCGTGAGGC TCGACAACAT GGGCGACGTG CTGATGACGA CGCCCGCGTT GCGCGCGCTG
AAGGAAAGCG GCGAAGGGCG GCACTTGACG CTGCTCACGT CGAGCGCGGC CGCGCCGCTC
GCCGCGCATC TGCCGATGAT CGACGACGTG TGGATCTATG ATGCGCCGTG GGTCAAGCAT
CCGGGCGCGG ACGACGGCGC GGCCGCGGAC GTCGCGATGA TCGAGCGGCT CCTGTGCGGC
GGCTTCGACG CCGCGGCGAT CTTCACCGTC TACAGCCAGA GCCCGCTGCC CGCCGCGATG
ATGTGCCGGC TCGCGGGGAT CGCGCTGCGG CTCGCGCATT GCCGCGAGAA TCCGTACCGG
CTGCTGACCG AGCGCATCGC GGAAACGGAG CCGCAGGTGC AGTTGCGCCA TGAAGTGGCG
CGGCAACTGG CGCTCGTGCG CGAGGTGGGC GCGACGACGC GCGACATGCG GCTCGCGTTC
GAGCCGGGCG ACGCCGCGCG GCGCGCTGTG CGCGCGCGGT TGCGTGCGGC GCGGCTTGCG
TGGCGCGCGC AGGCGGAGGG GGGCGCGGAA GGCGATGGGC GGGATCGCGC GCAGGCGCAC
GGGGCGGGGG CTTGCCTCGA GCCGGACGAC GCGGCGGCTT CGGCTGCTTC GGCTGCTTCG
GCGCGTTCGG CTGCGGCCGA GGCGAATGGC TCCGACGATT CGTTTGCGGC CCACGGGGCA
AATACTTCAA TCGCTGCGAC TGCTTCGGTT GTTCCGGCTG CCGCGGCTGC ATCGGCCCAG
TCGACTGCAT CGACCGATTC GACCGATTCG ACCGATTCGG CTGATGCGGC TGATGCGGCT
GATGCGGCTG ATTCGATCGC TTCAGCCGCT GCGGTTGCCG CAAGCAATCC GAACGATTCG
ACCGATCCGC CCACCGACCT CGGCCGCTGG CTCGTCGTTC ATCCGGGCGC GAGCGCGGCG
TCGCGGCGTT GGCCCGCCGA GCGCTTCGCG GACGTCGTCG CGCGCGTCGC CCGGTATTTC
GACGGCGTAG CGGTGACGGG CGGCGCGCAC GAGCGCGCGC TCGTCGAGAC TGTTTGCGCG
CGCGCCGGCA AGCGCGTGCT GCCATTGGCC GGCGCGCTGT CGATCGGCGA GCTCGGCGCG
CTGATCGAAG CGGCCGACCT GCTGCTCGCG AACAACAGCG GCCCCGTGCA TCTCGCGGCG
GCGCTGGGCA CGCCGGTGGT CGATCTGTAT GCGCTGACGA ATCCGCAGCA CACGCCGTGG
CGCGTGCCGA GCCGCGTGCT GAACGCCGAC GTGCCGTGCC GTCACTGTTA CCGCAGCGTC
TGCGATCAGC CGGGCCATCC TTGCCTCGAG CGCGTGAGCG TGGACGACGT CGTCGCGGCC
GTGCGCGAAC TGATGCGCGA AACCACGGGT TCGCACGGCG CGGGCGGCGG CGGCGCGGTG
TCGCCGATTC GCGGCGGCGC AGCGCGGCAC GGTACGCACG CCGGCTCCGT GCGCGGCGCG
GCCGCGCCCT GCGCCGTGCG GGGCGCCGCG CATGCGCGCG ACGCATCGGA GCCCGCCGTG
AGCGCGTCGG CGCTTGCCGT GCGCGCGGCC GGCGCGCACG CATCCGACGC GGGCTTGTCC
GCGGCCGGGG CATCCGCTCG CGGCGTATCC GCTGCCGGCG TGCCCGTCGA GGGCGGACCG
CTTGCCGACC CGCCGCACGA CGCACTGCCC ACGCCGCCCA TGCCGCCCCC ACGCGCCGCC
GTCCCCTCAA CCGCTTCGCC GCGCGCGGCG AACGTCGTTC CGATCACCTC CGCGACCGCA
AGGACCTGA
 
Protein sequence
MSPAERAMTL HAWRRARRIL CVRLDNMGDV LMTTPALRAL KESGEGRHLT LLTSSAAAPL 
AAHLPMIDDV WIYDAPWVKH PGADDGAAAD VAMIERLLCG GFDAAAIFTV YSQSPLPAAM
MCRLAGIALR LAHCRENPYR LLTERIAETE PQVQLRHEVA RQLALVREVG ATTRDMRLAF
EPGDAARRAV RARLRAARLA WRAQAEGGAE GDGRDRAQAH GAGACLEPDD AAASAASAAS
ARSAAAEANG SDDSFAAHGA NTSIAATASV VPAAAAASAQ STASTDSTDS TDSADAADAA
DAADSIASAA AVAASNPNDS TDPPTDLGRW LVVHPGASAA SRRWPAERFA DVVARVARYF
DGVAVTGGAH ERALVETVCA RAGKRVLPLA GALSIGELGA LIEAADLLLA NNSGPVHLAA
ALGTPVVDLY ALTNPQHTPW RVPSRVLNAD VPCRHCYRSV CDQPGHPCLE RVSVDDVVAA
VRELMRETTG SHGAGGGGAV SPIRGGAARH GTHAGSVRGA AAPCAVRGAA HARDASEPAV
SASALAVRAA GAHASDAGLS AAGASARGVS AAGVPVEGGP LADPPHDALP TPPMPPPRAA
VPSTASPRAA NVVPITSATA RT