Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0343 |
Symbol | |
ID | 4887422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 312539 |
End bp | 315016 |
Gene Length | 2478 bp |
Protein Length | 825 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640130284 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001061349 |
Protein GI | 126442493 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins [COG5351] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.1938 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCG TCAAACCGCT TGCCATCAGC CCGCTGACCC GCGTGTACCG GATGCACGGC CGGGAGTATC TCGGCGTCGC CGCGCTGTTG ATCGCGACGC TCGGCGACGA GCCGAAACTG CTGGCCGAAT CGGCGCTCTG GCGTCTGGCC GGCGACGAAC TGCGTGGCTA TCCGCTCGAC ATGGCGCTGC CGAAGGCGTG TCCGGAGTTT CTCGTGTCCG GATACGCGTA CGGAAAGTAC GCGAGCGATC CGCACGCGTG CGCGTGCGAA GTGGGCGTGC GCATTGCCGG CCTCGAGAAG CGGCTGCGCG TCTGCGGCGA CCGGCAGTGG GCGGGCGCGC GCATCACCGC GCCGCGGCCG TTCGAGCGGC TACCGATCGA CTGGGATCTC GCTTACGGCG GCGCGGGTTG CGCGGACAAT CCGCGAGGCC GCGGCGCGCA CGCGCGGGAG GGCGCGCCGC GCGATCTGCC GAATGTCGAA TACGCGCACA GCCCGATGCG CCTTGCGCAC GAGCAACCCG CGCCCGCCGG CTTTTGCCCG GTCGACGCGG CATGGCCGGC GCGCGCCGGC CTGTACGGCG CGCTCGATCG GCAATGGCAG GAAGAGGATT GTCCGGGCTT TCCGCGCACG CTCGATCCGC GCTACTTCAA CATCGCGCCG GCCGATCAGC AACTGCCCGA GCTGCGGGCA TTCCCGGACG GCGCGCGCTA CGAACTGACG CACATGCATC CGGACCACGC GACGCTCGCG GGAAACCTGC CCGCGCTGCG CGCGAGATCG TTCGTGGTAC GTCGGGGCAG CGATGCGCCC GAGGAAATGC CGATGCGCTT GACGACCGCG TGGTTCGTTC CGCATCGCGA GCGCGTGATC CTGATCTATC ACGGCGTCAC GCCCGTTCGC GCGTTCGACG CGAGCGACGT GCAGACGGTG CTGTTCGGTG CGGAGGCGAG CGGGCACGCG AGGCCCGCCG ACTGGTATCG GCAGGTGATC GAGTGGCGCA CGCGGGACGA CAGGGCGGCG CTGTACGCGC TGCGCGACCG GGATCTGCTG CCCGAGCATG CGCTTGCGCC CGAAGCGGCG GCGACGCCCG AGCCGACGCA GCAGAGCGCG AAGCAGCGGC AGCTTCGCGA ACGGTTGAGC GTCTTTCCGG ATGCTCCGCG CGCACAGACG CCGGCGCCGG ATCGGCTGGC CGAATTCGTC GAGCAGCAGC AAGCGCTCGC CGACGAAAAG CGCGCCGCGC TGGAAGCCAT GCGGCGGGAA CTGGCGACCA GCGAAGTATT TTCGGTCGGC CGTCGGCGCG GCCCGCCCGG CCGGATCGCG TCCGCGGACG AAGAGCCCGC GCGGCACGCG GGCGCGTCGG CCGAATCGCC GGACATCCGG GCGCTCGAAC GCGACGCGGA CGAGCGTCTT CGCGGGCTGT ACCAGCAGTG CGCGCAACAT CAGGACGCGC CGGCCCGGCT GCACGGCGCG GCCGCGCGAG CGCGCCGCGA ATGCGTCGCG TCGGCCGCCG CGGCCGGCCA GTCGCTGCAA GGCGCCGATC TGACCGGCGC GGACCTCTCG GGAATGGACT TGCGCGGCGC GCGCCTGGCC GGCGCGATGC TGGAGAACGC CGATTTGAGT GACGCCGATC TGACGGGTGC GGATCTGTCG CGCACGGTGC TCGTGCGCGC CGATCTGACA CGTGCGAAGC TCGTCGATGC GCGCCTGACG GCGGCCAATC TGTCGCTCGC GCATTGCGAG CGGACGGATT TCTCCGGCTC GGATTTGAGT GACGGCATTT TCGAGCAGGT ACACCTACGA GATTGCCGCT TCAACGGCAG CGTGCTGGCG AGCACGCGCT TCGACGCGTG CCGGTTCGAT GCCGTCGATT TCGGTCGCGC GACGCTGCGC GAGCTGATCT TCATCGAACA ATCGTTCAGC GGCGTGAGCT TCTCGGATGC GACGATCCGC AAGATGCTGC TGATGCGTTG CGCGTTTGCC GACGTGCGGT TCTCGGCGGC GAGCATCGAC GGATTCGGGA TCGTCGATAC GCAGGCGAGC GGGCAGCTCC GCTTCGATCG CGCGAGCGTG AACAAAGCGT GTTTCGTCGG GCGCTGCGAC ATCGGGCGCG CCGATTTCTC GTTCGCGACG CTGACGGAGG TCAATTTCCG CGAGACGCAG CTCGTCGAGG CGAACTTCGG CGGCGCGCGC ATCGGCAATT GCGATTTCAC CGATGCGTGC CTGCGGGCAG CCGATCTACG GGGCGCAAAG GCCGAGGGCA GCCCGTTCGT GCGCGCCGAT CTCACGCGCG CCGATCTTCG GGACACCGAT CTGATCGCCG CGTATCTGCG CGGCGCGAAA CTGGACGGCG CGGACCTTCG GCGCGCCAAC CTGTTTCGCG CGAACCTCTC GCAGATCCTC ACCGATGCCG ATACGCGCTG GCAGGGCGCG TACCTGAACC GGGCGGTGCG GTTTCCGCTG GCGGAGGCGC GCACATGA
|
Protein sequence | MKIVKPLAIS PLTRVYRMHG REYLGVAALL IATLGDEPKL LAESALWRLA GDELRGYPLD MALPKACPEF LVSGYAYGKY ASDPHACACE VGVRIAGLEK RLRVCGDRQW AGARITAPRP FERLPIDWDL AYGGAGCADN PRGRGAHARE GAPRDLPNVE YAHSPMRLAH EQPAPAGFCP VDAAWPARAG LYGALDRQWQ EEDCPGFPRT LDPRYFNIAP ADQQLPELRA FPDGARYELT HMHPDHATLA GNLPALRARS FVVRRGSDAP EEMPMRLTTA WFVPHRERVI LIYHGVTPVR AFDASDVQTV LFGAEASGHA RPADWYRQVI EWRTRDDRAA LYALRDRDLL PEHALAPEAA ATPEPTQQSA KQRQLRERLS VFPDAPRAQT PAPDRLAEFV EQQQALADEK RAALEAMRRE LATSEVFSVG RRRGPPGRIA SADEEPARHA GASAESPDIR ALERDADERL RGLYQQCAQH QDAPARLHGA AARARRECVA SAAAAGQSLQ GADLTGADLS GMDLRGARLA GAMLENADLS DADLTGADLS RTVLVRADLT RAKLVDARLT AANLSLAHCE RTDFSGSDLS DGIFEQVHLR DCRFNGSVLA STRFDACRFD AVDFGRATLR ELIFIEQSFS GVSFSDATIR KMLLMRCAFA DVRFSAASID GFGIVDTQAS GQLRFDRASV NKACFVGRCD IGRADFSFAT LTEVNFRETQ LVEANFGGAR IGNCDFTDAC LRAADLRGAK AEGSPFVRAD LTRADLRDTD LIAAYLRGAK LDGADLRRAN LFRANLSQIL TDADTRWQGA YLNRAVRFPL AEART
|
| |