Gene BURPS1106A_A0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0567 
Symbol 
ID4905445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp559497 
End bp560714 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content69% 
IMG OID640143673 
Productchain length determinant protein 
Protein accessionYP_001074603 
Protein GI126458405 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAC TCGAAACCGG TGGCGACGGG CCGGCTGACG TCGGGCAGAG CGCACCCGTC 
GGCCCGGCGC TCAGCCGCGC CGACGTGCTG ATCGCGCTCG GTCACGGCAA GGGGCTGATC
GCGCGCATCG TCGCCGCGAC GGTGCTGCTC GGCATCGCGC TCGCGCTCGT GCTGCCGCCC
ATCTATCAGG CGAGCACCGT GCTGCTGCCG CCCGACGAAT CGCGCGGGCT GTTCGGCCAT
TCGATGAGCA GCCTCGACGT CATCGCCGGC GCGGCGATGG GCATCGAGAT GAAGACGCCC
GGCGAACTGT ATGTCGCGCT GTTGAAGAGC ACGTCGATCG AGGACGGCCT GATCCGGCAG
TTCGACCTGC GCAAGCGATA TCGCGTCGAC ACGATGCATG CCGCGCGCAA GGCGCTGCAG
TCGCGCGTGA ACATCACGAT CGACAAGAAG TCCGGCCTGC TGACGATCGC GGCCGACGAC
ACCGACCCGG CGGTCGCGGC GGAGCTGGCG AACGCACACG TCGCGGCGCT CGCGAAGCTG
CTCGAGCGCA TTGCGGTGAC GCAGGCGCAG CAGCGGCGCG CATTCCTCGA AAAGGAGGTG
GCCAAGGCGC GCATCGCGCT CGCCAATGCG CAGGACGCGT ATGTGAAGTT GCAGGCGAAA
TCCGGCATCG TCAGCGTCGA CGCGGACACG CAGCTCGCGA TCCGGCACAG CGCGGAGATC
CGTTCGTTGC TGGCCGCGAA GCAGATCGAG CTGAGCTCGC TCGGCACCTA TGCGACGGCC
GAGAATCCGC AGGTCAAGCG CATCGAGGCC GAGGTGTCGA CGCTCAAGGC GCAGCTCGAG
AAGATCGAGA ACGGCGACGC CGCGTCGCTC AGGGGATCGG ATGCGGGCAT GGCCACGCTG
CGCAGCTACC GTGAAATGAA GTATCAGGAG AGCGTCGTCG ACGTCCTGTC GAGGCAGCTC
GAGCTCGCGC GCGTCGACGA GGCGAAGAGC GGGCCGCTCG TGCAGCAGGT CGACGTGGCC
GCGCCGCCGG AGCGCAAGGC CAAGCCGTCA CGCCTGCTCA TCCTGCTCGC GAGCGTCGCG
GGCGGCTTCG TGCTGGCGGT GACGGCCGTC ATCGGCAGGG CGTTCGGCAG GCAGGCGGTG
GAGCGTGCGC GGCGAAGCGG CGACCTCGCG CGCCTCAGGC ATGCGTGGAC GATAACTTTC
AAGAGGACGC GATCGTGA
 
Protein sequence
MAELETGGDG PADVGQSAPV GPALSRADVL IALGHGKGLI ARIVAATVLL GIALALVLPP 
IYQASTVLLP PDESRGLFGH SMSSLDVIAG AAMGIEMKTP GELYVALLKS TSIEDGLIRQ
FDLRKRYRVD TMHAARKALQ SRVNITIDKK SGLLTIAADD TDPAVAAELA NAHVAALAKL
LERIAVTQAQ QRRAFLEKEV AKARIALANA QDAYVKLQAK SGIVSVDADT QLAIRHSAEI
RSLLAAKQIE LSSLGTYATA ENPQVKRIEA EVSTLKAQLE KIENGDAASL RGSDAGMATL
RSYREMKYQE SVVDVLSRQL ELARVDEAKS GPLVQQVDVA APPERKAKPS RLLILLASVA
GGFVLAVTAV IGRAFGRQAV ERARRSGDLA RLRHAWTITF KRTRS