Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2132 |
Symbol | |
ID | 4886764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2072872 |
End bp | 2073936 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640132069 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001063126 |
Protein GI | 126443656 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.855918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGACC TCGTTCACAC GGCGCTCGCC GCCCTCGCCG ACACCCGCAC GCTAACCGGC ATCGATCTTT CGGATGCCGA TCTTTCCGGC CGCGACCTGT CCGGCTGCAC GTTCGAACGC GTGAGCCTGC GCGGCGCGAA CCTGTCGGCC GCGCAGCTCG ACGCGACGCG CTGGCTGCAC TGCGACCTGA CGGGCGCGCG CCTCGACGGC GCGACGCTCG GCGAATCGAG CTGGCACGCG GTCGCGCTGC GCGCGGCGAG CCTGCGCGCG ACGACGGGCG ACGCATTCGC GATGGCGGAA ACCGACCTCG CCGGCGCGAC GCTGACCGAC GCGCTGTGGG CGCGCGCGAC GTTCGAGCGC GGGGATTTCT CCGCCGCGCA GTGCGGGCGC GCGAAGCTGC TGCGCTGCGA GGCGGCCGAC TGCCGCTTCG AGCGCACCGA TTTCGCGAAC GCCGAGCTCG AGCGCTTCGT CGCGATGCGC GCCGAGCTGT CGAGCGCGCG CTTCGACGCC ACGCGGCTCA CCCACGCGTT CTTCGCCGAA GCGAATCTGC GCGGGCAACG CTTCGAGCGC TGCGATCTGA CGATGACCCA TTTCAGCCGC GCGGCGCTCG CCGGCTGCGA TTTCAGCGGT GCGTCGCTCA TGCAGACGAT GTTCTTCGAC GCGGACCTCG AACGCGCGAC GCTCGCCGGC GCGCGCGGCC GCCACGTGCG TTTCGCGGGC GCGACGCTCG ACGGCGCGAA TCTCGCGCAC GCCGCGTTCG ACGAATCCGA TTTCGCGCGC GCGCGGCTGC GGACGGCGAA CGCGCGCGGG CTGCGCGCGC GGATGTCGCT CTTCGCGCAC GCCGATTGCG CGGAGGCGAC GCTCGCGGGC GGCCACTTCG TCTACTGCGA CTTCTCGCAC GCCACGCTGT CGCACGCCGA CTGCACCGGC GCCGACTTCT CGCACGCGAA CCTGCACGGC CTGGCCGATC ACGCCGCCCG CTGGGACGGC GCGCGCAAGA CAGGCGCGCG CGCGACCGAT CCCGCGCTCG CGCACGCCGA ACGATGGTCC GCGCCCCAAC GATGA
|
Protein sequence | MSDLVHTALA ALADTRTLTG IDLSDADLSG RDLSGCTFER VSLRGANLSA AQLDATRWLH CDLTGARLDG ATLGESSWHA VALRAASLRA TTGDAFAMAE TDLAGATLTD ALWARATFER GDFSAAQCGR AKLLRCEAAD CRFERTDFAN AELERFVAMR AELSSARFDA TRLTHAFFAE ANLRGQRFER CDLTMTHFSR AALAGCDFSG ASLMQTMFFD ADLERATLAG ARGRHVRFAG ATLDGANLAH AAFDESDFAR ARLRTANARG LRARMSLFAH ADCAEATLAG GHFVYCDFSH ATLSHADCTG ADFSHANLHG LADHAARWDG ARKTGARATD PALAHAERWS APQR
|
| |