Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1919 |
Symbol | |
ID | 4899722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1873815 |
End bp | 1874804 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640135149 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001066184 |
Protein GI | 126452445 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.480868 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGCCCG ATCTCGAAGT CGTCGACGTG CGCCGCGGCG AGTCGTTCAA GGCGTGGGCG CATGGCTACC CGTACCGCAC GGTGCGGTGG CACTTTCATC CGGAGTTCGA AGTGCATCTG ATCGTCGAGA CGACGGGGCA GATGTTCGTC GGCGATTATG TCGGCGGCTT CGGCCCCGGC AATCTCGTGC TGATGGGACC GAACCTGCCG CACAACTGGG TGAGCGACGT GTCGGAAGGC AGGACGATCG CCGAGCGCAA TCTCGTCGTG CAGTTCGGCC AGGCGTTCGT GTCGCGCTGC GCGGACAGCT TCACCGAATG GCGGCACGTC GAGGCGCTGC TCGCCGACGC GCGCCGCGGC GTGCAGTTCG GCCCGCGCAC GAGCGAGGCG ATCAAGCCGC TCTTCGCCGA GCTGATTCAC GCGCGGGGGT TGCGGCGCAT CGTGCTGTTC CTGTCGATGC TGCAGATCCT CATCGACGCG ACCGACCGCG AGCTGCTCGC GAGCCCCGCG TACGAAGCCG ATGCGTCGAG CTTCGCGTCG ACGCGCATCA ACCACGTGCT CGCGTACCTC GGCAAGAACC TCGCGAACGA GCTGCGCGAG ACCGATCTCG CGCGGCTCGC CGGGCAGAGC GTGAGCGCGT TCTCGCATGA CTTCCGGCGG CACACCGGTT TGACGTTCGT CCAGTACGTG AACCGGATGC GGATCAATCT CGCGTGCCAA CTGCTGATGG ACGGCGACGC GAGCATCACC GACATCTGCT TCAGGAGCGG CTTCAACAAC CTGTCGAACT TCAATCGCCA GTTCCTCGCG GTGAAGGGCA TGTCGCCGTC GCGCTTTCGC CGCTATCAGG CGCTGAACGA CGCGAGTCGC GAGGCGTCCG AGGCCGCCGC GCTGCGCGGC GCGGGCATCG CCGGCGCGCC GGCGATCGTG CCGGCCGCGC GGGCGCGCGG CGAGGCGCGT GCGCCCGCCG AAGTCCTGCT GCCCGGCTGA
|
Protein sequence | MQPDLEVVDV RRGESFKAWA HGYPYRTVRW HFHPEFEVHL IVETTGQMFV GDYVGGFGPG NLVLMGPNLP HNWVSDVSEG RTIAERNLVV QFGQAFVSRC ADSFTEWRHV EALLADARRG VQFGPRTSEA IKPLFAELIH ARGLRRIVLF LSMLQILIDA TDRELLASPA YEADASSFAS TRINHVLAYL GKNLANELRE TDLARLAGQS VSAFSHDFRR HTGLTFVQYV NRMRINLACQ LLMDGDASIT DICFRSGFNN LSNFNRQFLA VKGMSPSRFR RYQALNDASR EASEAAALRG AGIAGAPAIV PAARARGEAR APAEVLLPG
|
| |