Gene BURPS668_A0669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0669 
Symbol 
ID4886760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp642733 
End bp643989 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content61% 
IMG OID640130609 
Productheptosyltransferase 
Protein accessionYP_001061668 
Protein GI126443611 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.822838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATG AGCGCGCGGC GAAGCGATGG CCCGCGCGCG GCGAAGACAT GAAAATACTG 
TTCATCAAGC TCTCCGCATT GGGCGACGTG CTGGCCAGCA CGCCGCTGTT CGCGTCGACC
AAGGCCGAGC ATCCGGACTG GTTCGTCGGG CACGTGGTGG CGCGTCCCTA TGCGGCTGCG
ACGCGAAACA ATGCGCACGT CGATGCGCAA TTCATTGTCG ATTCGCCGCT GTCCGGTGGC
GCGATTCGGA AAATCAGGGT GGCGGCGCGA ATATGGCGGT ACATGATGCG CGAACGATAC
GACATCGCGG TTGTGCTGCA TCGGAGTTTT GTACTACAAC TCATATGTCG CCTCGCATCG
GTCAGGAAAA CCATCGGCTA TGAAAGCCGA TTCTCATTCC TATTGAGCCA CTCCATTCCG
TTTTCGATGC AGGGAAATCG AAGCGGGCTG GAATTGCGTT TGCTGAAGTC GGCGGGAATC
ATCCACGACG AAAAGAAGAA ATTGAGGTTC GACATCGATT TCGGGAACGT GGACCGAAAC
CGGCTGCGCG CGTTGCCCGC CGCGTTCATC GCCGTCAACG CGGGCGGCGG CAACGCGGAT
GCGCAGGCAG CCAACAAGCT GTGGCCCGCC GAGCGTTACG GTGCATTGAT CAAGCGGTTG
CCGTTGCCGG TCGTGATGCT CGGACACGGC GCGGCGGATG AAGACATCAG GGATCGGGTC
GCGGCGACGG GGGCGAGGTT CGTCGACATG GTCGGCAAGA CGAATCTCGA CGAGACGGCG
GTCATCCTCG AACGCTCGCG TCTGTATGTG GGCAACGACA GCGCGCTTTT GTATCTCGCG
GCATCGCTCG GCGTGACGAC GATCGGGATC TACGGGCCTA CCGATCCCGC CGCGTTCAGT
CCGTTGGGCG CGAACAATCT GTGGCTGAGT GGCAAGACGT CCTGTGCACC GTGTTATTCG
TCGTTCGACG GGATCGGCGG GCGCATGTAC ACGTGCACGA ACAACATTTG CATGCAGGCC
GTTACGGTCG AATCCGTCAG CGAGAGAATC CATGCAGCCC TCCATCAAGA TCAGAATCTA
CAAGCGGCTG GACCGGATGT TGTCGGATCT GGTCAGGGCC GTGCCGCATC CGAAACGCGC
GCTCGGGCGG ACACCGACGC GCGTGCTGAT CATCAAGCTC TCGGCGATGG GGGATTCGCT
GTGCCTCTTT CCCACCGTTC GGCAACTGGC GCTCGCGTTC CCGGGGGCGA CGATTGA
 
Protein sequence
MADERAAKRW PARGEDMKIL FIKLSALGDV LASTPLFAST KAEHPDWFVG HVVARPYAAA 
TRNNAHVDAQ FIVDSPLSGG AIRKIRVAAR IWRYMMRERY DIAVVLHRSF VLQLICRLAS
VRKTIGYESR FSFLLSHSIP FSMQGNRSGL ELRLLKSAGI IHDEKKKLRF DIDFGNVDRN
RLRALPAAFI AVNAGGGNAD AQAANKLWPA ERYGALIKRL PLPVVMLGHG AADEDIRDRV
AATGARFVDM VGKTNLDETA VILERSRLYV GNDSALLYLA ASLGVTTIGI YGPTDPAAFS
PLGANNLWLS GKTSCAPCYS SFDGIGGRMY TCTNNICMQA VTVESVSERI HAALHQDQNL
QAAGPDVVGS GQGRAASETR ARADTDARAD HQALGDGGFA VPLSHRSATG ARVPGGDD