Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0681 |
Symbol | |
ID | 4885788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 652828 |
End bp | 653748 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640130621 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001061680 |
Protein GI | 126443029 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTGCA CCGATCTTGC CCCGACAAGC CTGCGCGAAG CGTTTCGCGG CGCGACGATC GAGACGATCG ACGCGCGCGT CGAAGGCTAT GCGAGCGTCG TGCTCACGCG GGTGCGCCAT GCGGCGCACG GCTTCGGCTT CGTCGAGTTC ATGCCGCCCG CCGACGGCTA TCTGGTGGGC ATGTCGCTGA CGCCGCCCGC CGCGCGCGCG CCACGCGCGG CGCCGGACGC CGCGGCCGAT GCCGATGCCG ACGCGCCGCA TCGCGCCGCA TGCGTCGCCG TGCAGATCCT GCAGGATGAC GAACCGTTTC GCGCGGACCT GCTGCAGCCG TTCGACATGC TGTTCCATGC GCTGCCGCGC CGCACGCTCG CCGAGCTCGC CGCCGATCTG CGCATGGGCG GCGTGCGCGA GCTGGCGAGC CCGGCGGGCG GCTGCGACGC GGTGCTGCAG AGTCTCGTCG GCGTGCTGCT CGCCGCATCG GCGGGATCGT CCGCGCGCAG CCCGCTGCTC GCGGGCCACG TCGCGCGCGC GATGCAGATC CACATCGTGC AGCAATACGG CGTGGCCGCG CCGAGCACGC CCGCGAAGGG CGGGCTCGCC GGCTGGCAGC TCGAGCGGGC GAAAGCGATC CTCACCGAGA ACATCGCGGG CGAGGTGCCG ATCTCGCAGG TCGCGTCCGC GTGCGGCCTG TCGCGCAGCT ACTTCATCAA GGCGTTTCGG CAGACGGTCG GCACGACGCC GCATCGCTGG CTGCTCGAGC ACAAGATCGA GCGCGTGAAG CGCAGCCTGC TGTCGCAGTC CGCGCCGATC GCCGATATCG CGCATCAGTG CGGCTTCTCG GATCAGGCGC ACCTGACGCG TGTCTTCACG AGCATGATCG GCACGCCGCC CGCCGCGTGG CGCCGTGTGA ACCGCCCATA G
|
Protein sequence | MNCTDLAPTS LREAFRGATI ETIDARVEGY ASVVLTRVRH AAHGFGFVEF MPPADGYLVG MSLTPPAARA PRAAPDAAAD ADADAPHRAA CVAVQILQDD EPFRADLLQP FDMLFHALPR RTLAELAADL RMGGVRELAS PAGGCDAVLQ SLVGVLLAAS AGSSARSPLL AGHVARAMQI HIVQQYGVAA PSTPAKGGLA GWQLERAKAI LTENIAGEVP ISQVASACGL SRSYFIKAFR QTVGTTPHRW LLEHKIERVK RSLLSQSAPI ADIAHQCGFS DQAHLTRVFT SMIGTPPAAW RRVNRP
|
| |