Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2229 |
Symbol | |
ID | 4887849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2156964 |
End bp | 2159228 |
Gene Length | 2265 bp |
Protein Length | 754 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 640132166 |
Product | hypothetical protein |
Protein accession | YP_001063223 |
Protein GI | 126445299 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03369] cellulose biosynthesis protein BcsE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.281505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACT CACGCGTGAA ATCCCCGGAC CCGGTGCCCG GCCGGCCCGC GAGCGGCGGG GCCGGTGCGC GCGCGCTCGC CCGCCTGCGC GCGTTGTGGC GCGTGTGCTC GCGCGCGGCG CGGCCGCGCG AGCCCGCGCA TGCGGCGAAC CGGCTCGCGA TCGACGCGCT GCCCGACGAG TGGGCCGAGC TCGCGCCGGG CGGCCTGTAT GCGGTGTACG CGGCGGCGCG CACGAGCGCG TGCGACGCGC TGATCTGGGA CAGCGTGCGG GACGCGCGCA CGCGCGACGT CACGGTGGTG CTCGCGCGCG AGCGCGCGGC GGTCGCGACG CGGCTGCGCG AGCTCGGCTT CGTCGACGGC ATGCACGCGC GCGGCTGGCC GCGGCGGTTG AACGTGCTGG CGATGCCGCC GGGCGATATC GCGGTGCGCG GCGCGGCGCG TGAGGGCGCA CCCGCGTCCG CGCCCGCGCC CGCGCCCGCG TTCTCACGCC TCGTCGGCGG CCTGCGCGCG CTGAGGCGCT ACCGCTTCCG TTCGAACGCG CTGTATTTCG TCGAAGGCGC GGAGCGCTGG TTCAGTTGGC ACGATCCGGT CGCGCTGACG CACGAGGGGT GGGCGCTGGC CGGCTGGTGC CGTTCGCATC GGATCGCGCT CGTGCTGCTG ATCGATCCGC GGGCGTCGCA AGCGGCCGCG AGCCGCGCCG ATGCGCGGCA CACGGCCCCG CTGCCCGACG CACCCGATGC ACCTGACGCA TCCGACGCGG GTGACGGCGT GCTCGGCGGC GCGCGCGCGG AGCACGGCGC CGACGATCGC ACCACGCTCT TCGCCGCCGA TCGCGCGCGC GCCGCGCGCG GCGGCTTTCA CGGCGCGTGC GCGGGCGTCG CGCAATTGCA GCGCACGCAC GGCGAGCTGC GCTGGCGGGT CGATTTCTGG CGCTCGCGCG GCGCGGTCGC CACGGGCGAG GTGCGCGCGC TGCGCTTCAT CGGCGACGGA CGGCTCGCGG CCGTGCCGGC GGCCGGCGCG CACGCGGCGG GCGGCGGCGC GCGGCTCGCG TTCGACGAGG CGCGCGTCGT CGTCAGCCGC CGCGTGGTCG AGCGCGAATC GTGGGTGCCG GGCGATTGGG AAGTCGTCGA CGACAACGAC GCGGTGCTCG CCGCGTGCGC CGGCGCGCAT GCGGCGAGCG CGGTGCTGGC GTTTACCGGC CGCGCGCAGC TCGAAGCGCT GTGCGCGACG ATCCATGCGC TGCGCCTGCG GTGCGGCGGC GCGCTGAAGA TCGTCGTCGT CGAGCGCGGC GAGGCGATGC GCCATCAATT CGAGCTGCTC GCGCTGAACC TCGGCGCGAA CCAGGTCGTC GCGCGCAACC TGCCGTTCTC GCGCGTGCTC GCGGTGCTGC GCTCGCTGCA GGGCCAGTTG CACGCGCGCC CGGTCGCGGC CGACTATCGG GCCGCGCTCG CCGCGTCGCT CGGCGACACG GCGCTCGGCT ATCTGCCCGT CGGCGCGTTC TGCTCGCAGG CGCGCGCGGT GCTCGAGCGC AGCGCGGTGC TCGCGCTGTC GCATACGCTC GTGAAGCTGA CGCTGCTGCC CGGCGTCGCG CACGCGCACG CGTTGCGCGC GTGCACGCCG CGCCGCGCGG GCGACGTGCT GACCGCCGAC GCGCAGCATC TGTATCTGTT CCTGTTCGCC TGCGAGCTCG CCGATGCGAA CGACGTGCTC GGCCACCTCT TCGACGTGCC CGTCGAGCGG ATCTCGGATC GCGTCGTGCA TCTCGCGCAG GACAGCATCG AGCATGAGCT GAATGCGCTC GACGCGGCGA ACCGGCGCGC GCCGATCGCG GACTACAGCG ATCTCTTTTC GCCGGCGGCG GTGGCGACGC GCGCGGCCGG CGCGCGCGCC TCGGCCGGCG CTCCGGCGGC GGCGCGCGAC GGCGAACCGT CCGCCGAGCC TGTGTCGCCG CATGTGCCGC CGCATGCGCC GCATGTGTCG GGCGCGCCGA CGCCGCCGGG CACGCGCGCC GCGGCGCACG GACCACCGTG GCGCCCGGCG TTCGCGCTGT CGTCCGCGTC TCAGACCTCG CGGACATCGC CCGCCTCGGC GCAGATCGTC GTACCGCCGC CGCAGGCGCC GTCGAACGTA TCGCACGCGC CGCTGTCCGC GACGCCGCGC GCGCCGCGAC CGCGCCGGCC GCACGACGCC GGCGCGGTCG CCGGCGTGCG CACCCGCACC GCCACGCGCG ACGCGATGCC GTTGCGCCCC AGGGAGGCTG AATGA
|
Protein sequence | MNDSRVKSPD PVPGRPASGG AGARALARLR ALWRVCSRAA RPREPAHAAN RLAIDALPDE WAELAPGGLY AVYAAARTSA CDALIWDSVR DARTRDVTVV LARERAAVAT RLRELGFVDG MHARGWPRRL NVLAMPPGDI AVRGAAREGA PASAPAPAPA FSRLVGGLRA LRRYRFRSNA LYFVEGAERW FSWHDPVALT HEGWALAGWC RSHRIALVLL IDPRASQAAA SRADARHTAP LPDAPDAPDA SDAGDGVLGG ARAEHGADDR TTLFAADRAR AARGGFHGAC AGVAQLQRTH GELRWRVDFW RSRGAVATGE VRALRFIGDG RLAAVPAAGA HAAGGGARLA FDEARVVVSR RVVERESWVP GDWEVVDDND AVLAACAGAH AASAVLAFTG RAQLEALCAT IHALRLRCGG ALKIVVVERG EAMRHQFELL ALNLGANQVV ARNLPFSRVL AVLRSLQGQL HARPVAADYR AALAASLGDT ALGYLPVGAF CSQARAVLER SAVLALSHTL VKLTLLPGVA HAHALRACTP RRAGDVLTAD AQHLYLFLFA CELADANDVL GHLFDVPVER ISDRVVHLAQ DSIEHELNAL DAANRRAPIA DYSDLFSPAA VATRAAGARA SAGAPAAARD GEPSAEPVSP HVPPHAPHVS GAPTPPGTRA AAHGPPWRPA FALSSASQTS RTSPASAQIV VPPPQAPSNV SHAPLSATPR APRPRRPHDA GAVAGVRTRT ATRDAMPLRP REAE
|
| |