Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2143 |
Symbol | |
ID | 4904708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2095685 |
End bp | 2097943 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640145248 |
Product | hypothetical protein |
Protein accession | YP_001076176 |
Protein GI | 126458419 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03369] cellulose biosynthesis protein BcsE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.137262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACT CATGCGTGAA ATCCCCGGAC CCGGTGCCCG GCCGGCCCGC GAGCGGCGGG GCCGGTGCGC GCGCGCTCGC CCGCCTGCGC GCGTTGTGGC GCGTGTGCTC GCGCGCGGCG CGGCCGCGCG AGCCCGCGCA TGCGGCGAAC CGGCTCGCGA TCGACGCGCT GCCCGACGAG TGGGCCGAGC TCGCGCCGGG CGGCCTGTAT GCGGTGTACG CGGCGGCGTG CACGAGCGCG TGCGACGCGC TGATCTGGGA CAGCGTGCGG GACGCGCGCA CGCGCGACGT CACGGTGGTG CTCGCGCGCG AGCGCGCGGC GGTCGCGACG CGGCTGCGCG AGCTCGGCTT CGTCGACGGC ATGCACGCGC GCGGCTGGCC GCGGCGGTTG AACGTGCTGG CGATGCCGCC GGGCGATATC GCGGCGCGCG GCGCGGCGCG TGAGGGCGCG CCCGCGCCCG TGCCTGCGCC CGCGTTCTCA CGCCTCGTCG GCGGCCTGCG CGCGCTGAGG CGCTACCGCT TCCGTTCGAA CGCGCTGTAT TTCGTCGAAG GCGCGGAGCG CTGGTTCAGT TGGCACGATC CGGTCGCGCT GACGCACGAG GGGTGGGCGC TGGCCGGCTG GTGCCGTTCG CATCGGATCG CGCTCGTGCT GCTGATCGAT CCGCGGGCGT CGCAAGCGGC CGCGAGCCGC GCCGATGCGC GGCACACGGC CCCGCTGCCC GACGCACCCG ATGCACCGGA CGCATCCGAG GCGGGTGACG GCGTGCTCGG CGGCGCGCGC GCGGAGCACG GCGCCGACGA TCGCACCACG CTCTTCGTCG CCGATCGCAC GCGCGCCGCG CGCGGCGGCT TTCACGGCGC GTGCGCGGGC GTCGCGCAAT TGCAGCGCAC GCACGGCGAG CTGCGCTGGC GGGTCGATTT CTGGCGCTCG CGCGGCGCGG TCGCCACGGG CGAGGTGCGC GCGCTGCGCT TCATCGGCGA CGGACGGCTC GCGGCCGTGC CGGCGGCCGG CGCGCACGCG GCGGGCGGCG GCGCGCGGCT CGCGTTCGAC GAGGCGCGCG TCGTCGTCAG CCGCCGCGTG GTCGAGCGCG AATCGTGGGT GCCGGGCGAT TGGGAAGTCG TCGACGACAA CGACGCGGTG CTCGCCGCGT GCGCCGGCGC GCATGCGGCG AGCGCGGTGC TGGCGTTTAC CGGCCGCGCG CAGCTCGAAG CGCTGTGCGC GACGATCCAT GCGCTGCGCC TGCGGTGCGG CGGCGCGCTG AAGATCGTCG TCGTCGAGCG CGGCGAGGCG ATGCGCCATC AATTCGAGCT GCTCGCGCTG AACCTCGGCG CGAACCAGGT CGTCGCGCGC AACCTGCCGT TCTCGCGCGT GCTCGCGGTG CTGCGCTCGC TGCAGGGCCA GTTGCACGCG CGCCCAGTCG CGGCCGACTA TCGGGCTGCG CTCGCCGCGT CGCTCGGCGA CACGGCGCTC GGCTATCTGC CTGTCGGCGC GTTCTGCTCG CAGGCGCGCG CGGTGCTCGA GCGCAGCGCG GTGCTCGCGC TGTCGCATAC GCTCGTGAAG CTGACGCTGC TGCCCGGCGT CGCGCACGCG CACGCGTTGC GCGCGTGCAC GCCGCGCCGC GCGGGCGACG TGCTGACCGC CGACGCGCAG CACCTGTATC TGTTCCTGTT CGCCTGCGAG CTCGCCGATG CGAACGACGT GCTCGGCCAC CTCTTCGACG TGCCCGTCGA GCGGATCTCG GATCGCGTCG TGCATCTCGC GCAGGACAGC ATCGAGCATG AGCTGAATGC GCTCGACGCG GCGAACCGGC GCGCGCCGAT CGCGGACTAC AGCGATCTCT TTTCGCCGGC GGCGGTGGCG ACGCGCGCGG CCGGCGCGCG CGCCTCGGCC GGCGCTCCGG CGGCGGCGCG CGACGGCGAA CTGTCCGCCG AGCCTATGTC GCCGCATGTG CCGCCGCATG CGCCGCATGT GTCGGGCGCG CCGACGCCGC CGGGCACGCG CGCCGCGGCG CACGGACCAC CGTGGTGCCC GGCGTTCGCG CTGTCGTCCG CATCGCAGAC CTCGCGGACA TCGCCCGCCT CGGCGCAGAT CGTCGTACCG CCGCCGCAGG CGCCGTCGAA CGTATCGCAC GCGCCGCTGT CCGCGACGCC GCGCGCGCCG CGACCGCGCC GACCGCACGA CGCCGGCGCG GTCGCCGGCG TGCGCACCCG CACCGCCACG CGCGACGCGA TGCCGTTGCG CCCCAGGGAG GCTGAATGA
|
Protein sequence | MNDSCVKSPD PVPGRPASGG AGARALARLR ALWRVCSRAA RPREPAHAAN RLAIDALPDE WAELAPGGLY AVYAAACTSA CDALIWDSVR DARTRDVTVV LARERAAVAT RLRELGFVDG MHARGWPRRL NVLAMPPGDI AARGAAREGA PAPVPAPAFS RLVGGLRALR RYRFRSNALY FVEGAERWFS WHDPVALTHE GWALAGWCRS HRIALVLLID PRASQAAASR ADARHTAPLP DAPDAPDASE AGDGVLGGAR AEHGADDRTT LFVADRTRAA RGGFHGACAG VAQLQRTHGE LRWRVDFWRS RGAVATGEVR ALRFIGDGRL AAVPAAGAHA AGGGARLAFD EARVVVSRRV VERESWVPGD WEVVDDNDAV LAACAGAHAA SAVLAFTGRA QLEALCATIH ALRLRCGGAL KIVVVERGEA MRHQFELLAL NLGANQVVAR NLPFSRVLAV LRSLQGQLHA RPVAADYRAA LAASLGDTAL GYLPVGAFCS QARAVLERSA VLALSHTLVK LTLLPGVAHA HALRACTPRR AGDVLTADAQ HLYLFLFACE LADANDVLGH LFDVPVERIS DRVVHLAQDS IEHELNALDA ANRRAPIADY SDLFSPAAVA TRAAGARASA GAPAAARDGE LSAEPMSPHV PPHAPHVSGA PTPPGTRAAA HGPPWCPAFA LSSASQTSRT SPASAQIVVP PPQAPSNVSH APLSATPRAP RPRRPHDAGA VAGVRTRTAT RDAMPLRPRE AE
|
| |