Gene BURPS668_3088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3088 
Symbol 
ID4884777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3029716 
End bp3030822 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID640129016 
Productglycosyl transferase, group 4 family protein 
Protein accessionYP_001060100 
Protein GI126439021 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAGCT TCGCCGTCGG CTTCATCGTC TCGCTTCTCG TCACGCTGCT CATCGTCCGC 
TATGCGCACC TGCACGAACG ATTCTCGATC GACAACGATC TTGCCGGCGT GCAGAAATTC
CATGCGCGGC CGGTGCCGCG CGTGGGCGGC ACCGGCATCC TGATCGGGCT CGTCGTCGCG
ACGGCGCTGC TGTCGCGGCG ATACCCGGCG ATCGCGGGCG GCATCCTCGG GCTCGCCGCG
TGCGGGCTGC CCGCCTTCGC CTCCGGGCTG ATCGAAGACC TGACGAAGAA GGTGACGCCC
GCGGTGCGGC TCGTCTGCAC GATGGCGGCC GCGGCGCTCG CGTTCGCGCT GATGGGCATC
GCGATCACGC GCATCAGCGT GCCGCCCCTC GACTTCCTGC TCGGCTATAC GGCGATCTCG
GCCGCGGTCA CGGTGCTCGC CGTCGCCGCG CTCGCGAACG CGGTCAACAT CATCGACGGC
TTCAACGGCC TCGCGTCGAT GGTCGCGTTC ATGATGTTCG CGTCGCTCGC GTACGTCGCG
TTCCAGGTCG GCGACCCGGT CGTGATGTCC GGCTCGATCG TGATGATGGG CGCGATCATG
GGCTTTTTCA TCTGGAACTT CCCGGCGGGC CTCATCTTCC TCGGCGACGG CGGCGCGTAC
TTCATCGGCT TCATGCTCGC CGAGCTCGCG ATCTCGCTCG TGATGCGGCA CCGCGAAGTG
TCCGCGTGGT ATCCGGTGCT GCTGTTCATG TACCCGATCT TCGAGACCTG CTTCTCGATC
TACCGGAAGA AATTCGTTCG CGGCATGTCG CCGGGCATCC CGGACGGCGT GCATCTGCAC
ATGCTCGTCT ACAAGCGGCT GATGCGCTGG GCGGTGGGCA CGCGCGCCGC GCACGAGCTC
ACGCGCCGGA ACTCGCTGAC CTCGCCCTAT CTATGGCTGC TCTGCCTCGT CGCGGTGATC
CCCGCCACCC TGTTCTGGCA GCATACGATC CACCTGTTCG CGTTCGTGAT CGTGTTCGCG
CTCACTTACG TGTGGCTCTA CGTAAGCATC GTCCGGTTCA AGTCGCCGAG ATGGATGGTG
ATCCGCAAGC GGCTGCCGAA ACGGTGA
 
Protein sequence
MLSFAVGFIV SLLVTLLIVR YAHLHERFSI DNDLAGVQKF HARPVPRVGG TGILIGLVVA 
TALLSRRYPA IAGGILGLAA CGLPAFASGL IEDLTKKVTP AVRLVCTMAA AALAFALMGI
AITRISVPPL DFLLGYTAIS AAVTVLAVAA LANAVNIIDG FNGLASMVAF MMFASLAYVA
FQVGDPVVMS GSIVMMGAIM GFFIWNFPAG LIFLGDGGAY FIGFMLAELA ISLVMRHREV
SAWYPVLLFM YPIFETCFSI YRKKFVRGMS PGIPDGVHLH MLVYKRLMRW AVGTRAAHEL
TRRNSLTSPY LWLLCLVAVI PATLFWQHTI HLFAFVIVFA LTYVWLYVSI VRFKSPRWMV
IRKRLPKR