Gene BURPS1106A_0553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0553 
Symbol 
ID4901796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp520529 
End bp521536 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content74% 
IMG OID640133783 
ProductSIS domain-containing protein 
Protein accessionYP_001064836 
Protein GI126452572 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2222] Predicted phosphosugar isomerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.926392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAACG AGGCGCGCGA ATCGGCGCGC GTCGTCGCCG CGCAACTGGC GGACACGCGC 
CGCGTCGAGG CGCTCGCGCA GCACCTCGCC ACGCATGCGC CGCAAGTCGC GCTCACCGTC
GCGCGCGGCA GCTCCGATCA CGCGGCGAGC TACTTCGCGA GCCTGACGAT GAGCCGCCTC
GGCGTGCCCG TCGCGTCGCT GCCGATGTCG GTCGCCACGC TGCAGCAGGC GCCGCTGAAA
GTGCGGGGCC AGCTCGCGCT CGCGTTCTCG CAATCGGGCA AGAGCCCGGA TCTCGTCAAC
ACGATGGCCG CGCTGCGCGA GGCGGGCGCG CTGACGGTGG CCGCCGTCAA CGTGCTGCCG
TCGCCGCTCG CGCACGCGTG CGAGCACCCG TTGCCGCTGC TCGCCGGCCC GGAGCTGTCG
GTCGCCGCGA CGAAGAGCTA CATCGCGATG CTGTCGATTG CCGCGCAGCT CGTCGCGTTC
TGGCAGCGCG ACGCCGCGCT CGCGTCCGCG CTGCGCGGCC TGCCCGACGC GCTCGAGCAG
GCGGGCCGGC TCGACTGGTC GAGCGCCGTC GACGAACTGC GCGACGTCGA GCGGATGATC
GTGATCGGCC GCGGGCTCGG TCTCGCGATC GCGCAGGAGG CGGCGCTCAA GCTGAAGGAG
ACCTCGGGCA TCCAGGCCGA GGCGTTCTCG AGCGCCGAAG TGCGGCACGG CCCGATGGAG
CTGATCGAGC GCGACTACCC GCTGCTCGTG TTCGCGCCGC CCGGGCCCGA GCAGGAGAGC
CTGCTGCAGC TCGCGCGCGA CATGCGCGCG CGCGGCGCGC GCGTGCTGCT CGCCGCGCCG
GCGGGTACGC CCGATGCGAC GCTGCCGCTC GCGCGCACCG CGCACGCGGC GCTCGATCCG
ATCGCCGCGA TCCTCACGTT CTACGTGATG GCGGCCGGGC TCGCGCCCGC GCGCGGCCGC
GATCCCGATG CGCCGCGCCA TCTGCACAAG ATCACCGAAA CACACTGA
 
Protein sequence
MLNEARESAR VVAAQLADTR RVEALAQHLA THAPQVALTV ARGSSDHAAS YFASLTMSRL 
GVPVASLPMS VATLQQAPLK VRGQLALAFS QSGKSPDLVN TMAALREAGA LTVAAVNVLP
SPLAHACEHP LPLLAGPELS VAATKSYIAM LSIAAQLVAF WQRDAALASA LRGLPDALEQ
AGRLDWSSAV DELRDVERMI VIGRGLGLAI AQEAALKLKE TSGIQAEAFS SAEVRHGPME
LIERDYPLLV FAPPGPEQES LLQLARDMRA RGARVLLAAP AGTPDATLPL ARTAHAALDP
IAAILTFYVM AAGLAPARGR DPDAPRHLHK ITETH