Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0252 |
Symbol | |
ID | 4904877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 243629 |
End bp | 246106 |
Gene Length | 2478 bp |
Protein Length | 825 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640143359 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001074295 |
Protein GI | 126455703 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins [COG5351] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCG TCAAACCGCT TGCCATCAGC CCGCTGACCC GCGTGTACCG GATGCACGGC CGGGAGTATC TCGGCGTCGC CGCGCTGTTG ATCGCGACGC TCGGCGACGA GCCGAAACTG CTGGCCGAAT CGGCGCTCTG GCGTCTGGCC GGCGACGAAC TGCGCGGCTA TCCGCTCGAC ATGGCGCTGC CGAAGGCGTG TCCGGAATTT CTCGTGTCCG GATACGCGTA CGGAAAGTAC GCGAGCGATC CGCACGCGTG CGCGTGCGAA GTGGGCGTGC GCATTGCCGG CCTCGAGAAG CGGCTGCGCG TCTGCGGCGA CCGGCAGTGG GCGGGCGCGC GCATCACCGC GCCGCGGCCG TTCGAGCGGC TACCGATCGA CTGGGATCTC GCTTACGGCG GCGCGGGTTG CGCGGACAAT CCGCGAGGCC GCGGCGCGCA CGCGCGGGAG GGCGCGCCGC GCGATCTGCC GAATGTCGAA TACGCGCACA GCCCGATGCG CTTTGCGCAC GAGCAACCCG CGCCCGCCGG CTTTTGCCCG GTCGACGCGG CATGGCCGGC GCGCGCCGGC CTGTACGGCG CGCTCGATCG GCAATGGCAG GAAGAGGATT GTCCGGGCTT TCCGCGCACG CTCGATCCGC GCTACTTCAA CATCGCGCCG GCCGATCAGC AACTGCCCGA GCTGCGGGCA TTCCCGGACG GCGCGCGCTA CGAACTGACG CACATGCATC CGGACCACGC GACGCTCGCG GGAAACCTGC CCGCGCTGCG CGCGAGATCG TTCGTGGTAC GTCGGGGCAG CGATGCGCCC GAGGAAATGC CGATGCGCTT GACGACCGCG TGGTTCGTTC CGCATCGCGA GCGCGTAATC CTGATCTATC ACGGCGTCAC GCCCGTTCGC GCGTTCGACG CGAGCGACGT GCAGACGGTG CTGTTCGGCG CGGAGGCGAG CGGGCACGCG AGGCCCGCCG ACTGGTATCG GCAGGTGATC GAGTGGCGCA CGCGGGACGA CAGGGCGGCG CTGTACGCGC TGCGCGACCG GGATCTGCTG CCCGAGCATG CGCTTGCGCC CGAAGCGGCG GCGACGCCCG AGCCGACGCA GCAGAGCGCG AAGCAGCGGC AGCTTCGCGA ACGGTTGAGC GTCTTTCCGG ATGCTCCGCG CGCACAGACG CCGGCGCCGG ATCGGCTGGC CGAATTCGTC GAGCAGCAGC AAGCGCTCGC CGACGAAAAG CGCGCCGCGC TGGAAGCCAT GCGGCGGGAA CTGGCGACCA GCGAAGTATT TTCGGTCGGC CGTCGGCGCG GCCCGCCCGG CCGGATCGCG CCCGCGGACG AAGAGCCCGC GCGGCACGCG GGCGCGTTGG CCGAATCGCC GGACATCCGG GCGCTCGAAC GCGACGCGGA CGAGCGTCTT CGCGGGCTGT ACCAGCAGTG CGCGCAACAT CAGGACGCAC CGGCCCGGCT GCACGGCGCG GCCGCGCGAG CGCGCCGCGA GTGCGTCGCG TCGGCCGCCG CGGCCGGCCA GTCGCTGCAA GTCGCCGATC TGACCGGCGC GGACCTCTCG GGAATGGACT TGCGCGGCGC GCGCCTGGCC GGCGCGATGC TGGAGAACGC CGATTTGAGC GACGCCGATC TGACGGGCGC GGATCTGTCG CGCACGGTGC TCGTGCGCGC CGATCTGACA CGTGCGAAGC TCGTCGATGC GCGCCTGACG GCGGCCAATC TGTCGCTCGC GCATTGCGAG CGGACGGATT TCTCCGGCTC GGATTTGAGT GACGGCATTT TCGAGCAGGT ACACCTACGA GATTGCCGCT TCAACGGCAG CGTGCTGGCG AGCACGCGCT TCGACGCGTG CCGGTTCGAT GCCGTCGATT TCGGTCGCGC GACGCTGCGC GAGCTGATCT TCATCGAACA ATCGTTCAGC GGCGTGAGCT TCTCGGATGC GACGATCCGC AAGATGCTGC TGATGCGTTG CGCGTTCGCC GACGTGCGGT TCTCGGCGGC GAGCATCGAC GGATTCGGGA TCGTCGAGAC GCAGGCGAGC GGGCAGCTCC GCTTCGATCG CGCGAGCGTG AACAAAGCGT GTTTCGTCGG GCGCTGCGAC ATCGGGCGCG CCGATTTCTC GTTCGCGACG CTGACGGAGG TCAATTTCCG CGAGACGCAG CTCGTCGAGG CGAACTTCGG CGGCGCGCGC ATCGGCAATT GCGATTTCAC CGATGCGTGC CTGCGAGCAG CCGATCTACG GGGCGCGAAG GCCGAGGGCA GCCCGTTCGT GCGCGCCGAT CTCACGCGCG CCGATCTTCG GGACACCGAT CTGATCGCCG CGTATCTGCG CGGCGCGAAG CTGGACGGCG CGGACCTTCG GCGCGCCAAC CTGTTTCGCG CGAACCTCTC GCAGATCCTC ACCGATGCCG ATACGCGCTG GCAGGGCGCG TACCTGAACC GGGCGGTGCG GTTTCCGCTG GCGGAGGCGC GCACATGA
|
Protein sequence | MKIVKPLAIS PLTRVYRMHG REYLGVAALL IATLGDEPKL LAESALWRLA GDELRGYPLD MALPKACPEF LVSGYAYGKY ASDPHACACE VGVRIAGLEK RLRVCGDRQW AGARITAPRP FERLPIDWDL AYGGAGCADN PRGRGAHARE GAPRDLPNVE YAHSPMRFAH EQPAPAGFCP VDAAWPARAG LYGALDRQWQ EEDCPGFPRT LDPRYFNIAP ADQQLPELRA FPDGARYELT HMHPDHATLA GNLPALRARS FVVRRGSDAP EEMPMRLTTA WFVPHRERVI LIYHGVTPVR AFDASDVQTV LFGAEASGHA RPADWYRQVI EWRTRDDRAA LYALRDRDLL PEHALAPEAA ATPEPTQQSA KQRQLRERLS VFPDAPRAQT PAPDRLAEFV EQQQALADEK RAALEAMRRE LATSEVFSVG RRRGPPGRIA PADEEPARHA GALAESPDIR ALERDADERL RGLYQQCAQH QDAPARLHGA AARARRECVA SAAAAGQSLQ VADLTGADLS GMDLRGARLA GAMLENADLS DADLTGADLS RTVLVRADLT RAKLVDARLT AANLSLAHCE RTDFSGSDLS DGIFEQVHLR DCRFNGSVLA STRFDACRFD AVDFGRATLR ELIFIEQSFS GVSFSDATIR KMLLMRCAFA DVRFSAASID GFGIVETQAS GQLRFDRASV NKACFVGRCD IGRADFSFAT LTEVNFRETQ LVEANFGGAR IGNCDFTDAC LRAADLRGAK AEGSPFVRAD LTRADLRDTD LIAAYLRGAK LDGADLRRAN LFRANLSQIL TDADTRWQGA YLNRAVRFPL AEART
|
| |