Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1906 |
Symbol | |
ID | 4885137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1867864 |
End bp | 1868853 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640127834 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001058941 |
Protein GI | 126439829 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.166677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGCCCG ATCTCGAAGT CGTCGACGTG CGCCGCGGCG AGTCGTTCAA GGCGTGGTCG CATGGCTACC CGTACCGCAC GGTGCGGTGG CACTTTCATC CGGAGTTCGA AGTGCATCTG ATCGTCGAGA CGACGGGGCA GATGTTCGTC GGCGATTATG TCGGCGGCTT CGGCCCCGGC AATCTCGTGC TGATGGGACC GAACCTGCCG CACAACTGGG TGAGCGACGT GCCGGAAGGC AGGACGATCG CCGAGCGCAA TCTCGTCGTG CAGTTCGGCC AGGCGTTCGT GTCGCGCTGC GCGGACAGCT TCACCGAATG GCGGCACGTC GAGGCGCTGC TCGCCGACGC GCGCCGCGGC GTGCAGTTCG GCCCGCGCAC GAGCGAGGCG ATCAAGCCGC TCTTCGCCGA GCTGATTCAC GCGCGGGGGT TGCGGCGCAT CGTGCTGTTC CTGTCGATGC TGCAGATCCT CATCGACGCG ACCGACCGCG AGCTGCTCGC GAGCCCCGCG TACGAAGCCG ATGCGTCGAG CTTCGCGTCG ACGCGCATCA ACCACGTGCT CGCGTACCTC GGCAAGAACC TCGCGAACGA GCTGCGCGAG ACCGATCTCG CGCGGCTCGC CGGGCAGAGC GTGAGCGCGT TCTCGCATTA CTTCCGGCGG CACACCGGTT TGCCGTTCGT CCAGTACGTG AACCGGATGC GGATCAATCT CGCGTGCCAA CTGCTGATGG ACGGCGACGC GAGCATCACC GACATCTGCT TCAGGAGCGG CTTCAACAAC CTGTCGAACT TCAATCGCCA GTTCCTCGCG GTGAAGGGCA TGTCGCCGTC GCGCTTTCGC CGCTATCAGG CGCTGAACGA CGCGAGCCGC GAGGCGTCCG AGGCCGCCGC GCAGCGCGGC GCGGGCATCG CCGGCGCGCC GGCGATCGTG CCGGCCGCGC GGGCGCGCGG CGAGGCGCGT GCGCCCGCCG AAGTCCTGCT GTCCGGCTGA
|
Protein sequence | MQPDLEVVDV RRGESFKAWS HGYPYRTVRW HFHPEFEVHL IVETTGQMFV GDYVGGFGPG NLVLMGPNLP HNWVSDVPEG RTIAERNLVV QFGQAFVSRC ADSFTEWRHV EALLADARRG VQFGPRTSEA IKPLFAELIH ARGLRRIVLF LSMLQILIDA TDRELLASPA YEADASSFAS TRINHVLAYL GKNLANELRE TDLARLAGQS VSAFSHYFRR HTGLPFVQYV NRMRINLACQ LLMDGDASIT DICFRSGFNN LSNFNRQFLA VKGMSPSRFR RYQALNDASR EASEAAAQRG AGIAGAPAIV PAARARGEAR APAEVLLSG
|
| |