Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2708 |
Symbol | |
ID | 4887398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2591871 |
End bp | 2592875 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640132644 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001063700 |
Protein GI | 126444626 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.792624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAGGA CGCCGGCGCT TTGTGCGAGG CAGACGATCA TGCCCCCGAC CCCATTCGAT CCGCTCGCGT TGCGCGAGCA CCGGCTGTTC GAATCGCGCG ACCTCGACGA GACGCGCGAG CGGATCTCGC GCGTGATGCA GCCGCATGCG CTGTTGCCGG ACGGCTCGCG GCACGGGCCG TCGCACATGG ATTACGTGCG CCTCGGCGGG CTCGGCATCG GAACCATCGC GTTCGGCGAC GCGATGCGGG TGCATCTCGA TGCGGTGGAC GGCTATCATC TGCTGATGTT TTGTCTGACG GGTTCCGCGC AGGTCCGCAC GATGGGCCGC GCGTTCGACG TCGACGCGCA CACGGGCGTG CTGTGCGCGC CGGGCGAGCC GTTCGACGCG CACCTGTCGC GCGATTGCGA GCAGTTCGTC CTCCGTATCG ATGCGGCGAC CCTCGCCGCG CACGCGGGCG ACGCGGCGGC GGCGCTCGAT CCCGTGATCG GCATCGACGA TTCGGCGCTG AGCGCGTGGA TGCAGCAACT GCAGCTCGTC GCGCGCTCGC CGGAGCTGCT CGCGAGCGCA AGCGCGAACC CGCGCGTCGC GACGCGGCTC GAACAGTTGC TGCTCGATCT GCTGATCGAC GGGCATCCGC CCGCCGCGCC GCCCGCGCGG CGCGCCGATC CGGCGCCAGG CTTCGTGCGG CGCGCGCAGG AGTTCATCGG CGCGCAGCTC GCCCAGCCGC TGCAGCTCGC CGACATCGCG CAGGCCGCGG GCGTACCCGA GCGCACGCTG CGCGACGGCT TCCTGCAGTT TCGCGGGACG AGCCCGATGC AATACCTGCG CCAGCGGCGC CTCGAGCGCG CGCGCGAGCT GCTGCGCACG GCCGCGCCCG AGCGCCGGAT CGCCGAGATC GCGCTCGATT GCGGTTTCGC GCACTTCGGC CGCTTCGCGA TCGCCTACCG CGAACGGTTC GGCGAGCTGC CGTCCGCGAC GCTCGCCGAT CGGCGCGACG CCTGA
|
Protein sequence | MFRTPALCAR QTIMPPTPFD PLALREHRLF ESRDLDETRE RISRVMQPHA LLPDGSRHGP SHMDYVRLGG LGIGTIAFGD AMRVHLDAVD GYHLLMFCLT GSAQVRTMGR AFDVDAHTGV LCAPGEPFDA HLSRDCEQFV LRIDAATLAA HAGDAAAALD PVIGIDDSAL SAWMQQLQLV ARSPELLASA SANPRVATRL EQLLLDLLID GHPPAAPPAR RADPAPGFVR RAQEFIGAQL AQPLQLADIA QAAGVPERTL RDGFLQFRGT SPMQYLRQRR LERARELLRT AAPERRIAEI ALDCGFAHFG RFAIAYRERF GELPSATLAD RRDA
|
| |