Gene BURPS668_2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2271 
Symbol 
ID4882108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2254887 
End bp2255951 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content66% 
IMG OID640128200 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001059307 
Protein GI126440518 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.519766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTG CGCAGATCGC TCCGTTGACC GAATCGGTGC CGCCGAAGCT CTACGGCGGC 
ACCGAGCGCG TCGTGTCGTA CATCACCGAG GCGCTCGTCG ATCTGGGGCA CGACGTGACG
CTGTTCGCGA GCGGCGATTC GATCACGCGC GCGAAACTCG ACGCAGTATG GCCGCGCGCA
TTGCGACTGG ATGCGTCGAT CCGCGACCGG ATCGCGCCGC ACATGCTGCT GATGGAGACC
GTCGCGCGCC GCGCGCGGGA TTTCGACGTG CTCCATTTCC ACATGGATTA CTACTCGTTC
TCGATCTTCA AGCGGCAGGA CACGCCGTTC GTGACGACGC TGCACGGCCG CCTCGATTTG
CCGGAGCAGC AGCCGGTGTT CGACACGTTC GACACCGCGC CCGTGATCTC GATCTCGAAC
GCGCAGCGCC ACCCGATGCC GCAGGCGAAA TGGCTGACAA CCGTCTATCA CGGGCTGCCG
GAGACGCTCT ACACGCCGCA GCCCGTCGAG CAGTCGTATC TTGCGTTCCT TGGCCGGATC
TCGCCGGAAA AGCGCGTCGA CACCGCGATC CGGATCGCGC AGCGCTGCGG GATGCGCATC
CGCATCGCGG CGAAGATCGA CGCGGCGGAC GAGGAGTACT TCGAGCGCGA GATCAAGCCG
CTCCTCGCGC TGCCGCACGT CGAATACATC GGCGAAATCG CCGATCACGA GAAGGCGGCG
TTCCTGTCCG GCGCGCACGC GCTGCTGTTT CCGATCGACT GGCCCGAGCC GTTCGGCCTC
GTGATGATCG AGGCGATGGC GTGCGGCACG CCCGTCATCG CGTTCAATCG CGGCGCGGTG
CCGGAGGTGA TCGACGAGGG CGTGTCGGGC TTCATCGTCG AGGACGAGAT CGGCGCCGCC
GCGGCGGTGA ACCGGCTGCA CATGCTGTCG CGCGAGCGGG TGCGCGCGCG CTTCGACGAG
CGTTTCACTT CGCGCCGGAT GGCGCAGCAA TACGTCGACG TCTATCAATC GCTGATCCGC
GCGCAGAAGC GCTCGCGCTT CAAGGTGATC GATTCGGCGA CTTGA
 
Protein sequence
MRIAQIAPLT ESVPPKLYGG TERVVSYITE ALVDLGHDVT LFASGDSITR AKLDAVWPRA 
LRLDASIRDR IAPHMLLMET VARRARDFDV LHFHMDYYSF SIFKRQDTPF VTTLHGRLDL
PEQQPVFDTF DTAPVISISN AQRHPMPQAK WLTTVYHGLP ETLYTPQPVE QSYLAFLGRI
SPEKRVDTAI RIAQRCGMRI RIAAKIDAAD EEYFEREIKP LLALPHVEYI GEIADHEKAA
FLSGAHALLF PIDWPEPFGL VMIEAMACGT PVIAFNRGAV PEVIDEGVSG FIVEDEIGAA
AAVNRLHMLS RERVRARFDE RFTSRRMAQQ YVDVYQSLIR AQKRSRFKVI DSAT