Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1336 |
Symbol | |
ID | 4881971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1305090 |
End bp | 1306100 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640127264 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001058379 |
Protein GI | 126438967 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACAAGG CGACCGTATC GTCGGCCTAT GCGCTTTTCA TGCTGATGCT CGCGGAAGAG CGCGGCATCG CCGATGCGGA CCTTCTCGCG CATACGGGCG TCACGCGCAC GAAGCTCGAG GAGCCTAACG CGCGCATCAC GCCGCTGCAG CAGGCGGCGA TCGTGTTCAA TTTGCTCGGG ATGACGAATG ATCCGTCGAT CGCGATCGAG ATCGGCCTGC GAAGCAGCCT GACGAAATCG GGGCTGATCG GCTTCGGCCT GATGAGCTGT GCGACGCTCG GCGAGGCGAT CCAGCTCGGC ATTCGCTATC TGCCGACGCG CGTGCCGTTC TTCTCGGTGC GGTTCACGGA ATTCGAGCAC ACGGTGCAGA TCGACATTCT CGAAGCGTTT CCGCTCGGCA GGCTGCGGCA GTTCGCCGTC GAGAACTTCA TGGTCGAGAC GGCGATCCTG TTCAACTCGC TGCTGACGCC TTCGCATGAC AAGACGATGA AGGCGAACGC CGAGCTCTGC TTCGAGTGGC CCGAGCCGCC TTATTTCGCA CGTTATCGTG ATCGCCTGCC GCGCTGCCAT TTCGGCGCTC CGGCCAATCA GATCCGTTGC GAGGCCGCGC TGCTCGACGA GCCGATCAAG ACCGCGAACG CGCACACGGC GCAGATGATC GTCCAGCAGT GCGAGGCGGA GCTCGCGCGG CTCGGGTATG CGGAGAGCAT CGTCGAGCGC GTGCGCAATC TGCTGATTCG CGGCAGCCAC GGCTATCCGT CGCTCGACGC GCTCGCGCGC GAGCTCCATC TGTCCGAGCG CACGCTCAAG CGCAAGCTGA GCGACTATGG CACGACGTAT TCGGCGCTGC TCGACGAGAT CCGGCTGCGC GACGCGCTGC GCCTGCTCGA AGGCACGACG CTGACGGTCG AGGAGATCGC GGCACGCGTC GGCTATACGG ATCGCGCCAA TTTCAGCCGC GCGTTTCGGC GCTGGACCGG CACGTCGCCG AGCGACCGGC GCCGGACGTG A
|
Protein sequence | MNKATVSSAY ALFMLMLAEE RGIADADLLA HTGVTRTKLE EPNARITPLQ QAAIVFNLLG MTNDPSIAIE IGLRSSLTKS GLIGFGLMSC ATLGEAIQLG IRYLPTRVPF FSVRFTEFEH TVQIDILEAF PLGRLRQFAV ENFMVETAIL FNSLLTPSHD KTMKANAELC FEWPEPPYFA RYRDRLPRCH FGAPANQIRC EAALLDEPIK TANAHTAQMI VQQCEAELAR LGYAESIVER VRNLLIRGSH GYPSLDALAR ELHLSERTLK RKLSDYGTTY SALLDEIRLR DALRLLEGTT LTVEEIAARV GYTDRANFSR AFRRWTGTSP SDRRRT
|
| |