Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0567 |
Symbol | |
ID | 4905445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 559497 |
End bp | 560714 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640143673 |
Product | chain length determinant protein |
Protein accession | YP_001074603 |
Protein GI | 126458405 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3524] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAC TCGAAACCGG TGGCGACGGG CCGGCTGACG TCGGGCAGAG CGCACCCGTC GGCCCGGCGC TCAGCCGCGC CGACGTGCTG ATCGCGCTCG GTCACGGCAA GGGGCTGATC GCGCGCATCG TCGCCGCGAC GGTGCTGCTC GGCATCGCGC TCGCGCTCGT GCTGCCGCCC ATCTATCAGG CGAGCACCGT GCTGCTGCCG CCCGACGAAT CGCGCGGGCT GTTCGGCCAT TCGATGAGCA GCCTCGACGT CATCGCCGGC GCGGCGATGG GCATCGAGAT GAAGACGCCC GGCGAACTGT ATGTCGCGCT GTTGAAGAGC ACGTCGATCG AGGACGGCCT GATCCGGCAG TTCGACCTGC GCAAGCGATA TCGCGTCGAC ACGATGCATG CCGCGCGCAA GGCGCTGCAG TCGCGCGTGA ACATCACGAT CGACAAGAAG TCCGGCCTGC TGACGATCGC GGCCGACGAC ACCGACCCGG CGGTCGCGGC GGAGCTGGCG AACGCACACG TCGCGGCGCT CGCGAAGCTG CTCGAGCGCA TTGCGGTGAC GCAGGCGCAG CAGCGGCGCG CATTCCTCGA AAAGGAGGTG GCCAAGGCGC GCATCGCGCT CGCCAATGCG CAGGACGCGT ATGTGAAGTT GCAGGCGAAA TCCGGCATCG TCAGCGTCGA CGCGGACACG CAGCTCGCGA TCCGGCACAG CGCGGAGATC CGTTCGTTGC TGGCCGCGAA GCAGATCGAG CTGAGCTCGC TCGGCACCTA TGCGACGGCC GAGAATCCGC AGGTCAAGCG CATCGAGGCC GAGGTGTCGA CGCTCAAGGC GCAGCTCGAG AAGATCGAGA ACGGCGACGC CGCGTCGCTC AGGGGATCGG ATGCGGGCAT GGCCACGCTG CGCAGCTACC GTGAAATGAA GTATCAGGAG AGCGTCGTCG ACGTCCTGTC GAGGCAGCTC GAGCTCGCGC GCGTCGACGA GGCGAAGAGC GGGCCGCTCG TGCAGCAGGT CGACGTGGCC GCGCCGCCGG AGCGCAAGGC CAAGCCGTCA CGCCTGCTCA TCCTGCTCGC GAGCGTCGCG GGCGGCTTCG TGCTGGCGGT GACGGCCGTC ATCGGCAGGG CGTTCGGCAG GCAGGCGGTG GAGCGTGCGC GGCGAAGCGG CGACCTCGCG CGCCTCAGGC ATGCGTGGAC GATAACTTTC AAGAGGACGC GATCGTGA
|
Protein sequence | MAELETGGDG PADVGQSAPV GPALSRADVL IALGHGKGLI ARIVAATVLL GIALALVLPP IYQASTVLLP PDESRGLFGH SMSSLDVIAG AAMGIEMKTP GELYVALLKS TSIEDGLIRQ FDLRKRYRVD TMHAARKALQ SRVNITIDKK SGLLTIAADD TDPAVAAELA NAHVAALAKL LERIAVTQAQ QRRAFLEKEV AKARIALANA QDAYVKLQAK SGIVSVDADT QLAIRHSAEI RSLLAAKQIE LSSLGTYATA ENPQVKRIEA EVSTLKAQLE KIENGDAASL RGSDAGMATL RSYREMKYQE SVVDVLSRQL ELARVDEAKS GPLVQQVDVA APPERKAKPS RLLILLASVA GGFVLAVTAV IGRAFGRQAV ERARRSGDLA RLRHAWTITF KRTRS
|
| |