Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1793 |
Symbol | |
ID | 4903346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1764086 |
End bp | 1765384 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640144899 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001075827 |
Protein GI | 126455961 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCCC GATATCCCGG CGATTCGAAT TATCGATTCG CACAGATTTT GTGCTTTAAC GCGGAAACGT ATTTTCGTGG TGCATCGCCC TGCACAAGAA AATATCGCGC ACGGCGATCC CGATTCGCGG TGCGCGCGCC TCAGGCTCGC GCGAGCGAAC CGCGCGCGGC GGTGCCGGAC GCCCGTTTCG CATCGCCGCG GCGATTCCCA CTAGAATGGT GTCCGTCGGC GCACGGCGCG CGAGCGGCGG CGACCGGCGC GGCCGCGCGG CACGGTATGA AACTTGCGTC TCATGGATGG CGCGCGCCCC GCGCACACAT GCCGACGACA TCGAAACACC CGCCGGCGTG CGCGGCGCCC AGCACGCGCA CGGAGCGGCC TCGCCCGCCG CCGGCTGTTT CGTTTCATCG GAATCGGGGT TCGACCGTGG CCAAGCTAGA CCATCGCAAC CAGTCGCGTT ACTGGCACTC TCCCGGCATT TCAGGGGTCG ATCTGTTGCT CGCCGACTTC ACGACGCACG ACTACGCGCC GCACGTGCAC GATTCGCTTG TCGTCGCCGT CACGGAAGTC GGCGGTTCGG TGTTCAAGAG CCGCGGGCAG ACGCGCCTCG CCGAGCCGAA CGCCGTGCTC GTGTTCAATC CGTGCGAGCC GCATTCGGGG CGCATGGGCG GCAGCAGCCG CTGGCGCTAC CGGTCGTTCT ACCTCGCGGA AGCGGGCCTT TCCCGCGTGC TGACGTTGCT CGGCATGGCG CAGCCGCGCT TTTTCACGTC GAACGTGCTC GACGATCCTC AGCTCGTCGA ACAGTTTCTC ACCCTGCACC GCGCGATGGA CGAGCAGGAC GATCTGCTGC GGCAGCAGGA ACTGCTCGTC AGCAGCTTCG GCACGCTGTT TTCGCGGCAC GGGCTCCAGG CCGGGCTCGG CGCCGGCCCC GGCTTCGGCA CGAAGGCGGG CCTGCCGGCG CTCAAGCCCG CGCTCGATCT GATGAACGAT TGCTTCGACC ACGCGCTCAC CCTCGAGCAG ATCGCGGCGG CGGCGGGCCT CACGTCGTTC CAGCTGATCA CCGCGTTCAA CCGCGTGATC GGCCTCACAC CGCACGCGTA CCTGAACCAG TTGAGGTTGC GCGCGGCGCT GCGCGAGCTG CAGGCCGGCC GCTCGCTCGC CGACGCCGCG CTGACATCGG GCTTCTACGA TCAAAGCGCG CTTTGCAACC ACTTCAAGCG CACGTTCGGG ATGACGCCGA TGCAGTACAC GCGCGCGCTC GCGCCTGGCA AGCGCGCGCT CGCGCCGATC GGAATCTGA
|
Protein sequence | MDARYPGDSN YRFAQILCFN AETYFRGASP CTRKYRARRS RFAVRAPQAR ASEPRAAVPD ARFASPRRFP LEWCPSAHGA RAAATGAAAR HGMKLASHGW RAPRAHMPTT SKHPPACAAP STRTERPRPP PAVSFHRNRG STVAKLDHRN QSRYWHSPGI SGVDLLLADF TTHDYAPHVH DSLVVAVTEV GGSVFKSRGQ TRLAEPNAVL VFNPCEPHSG RMGGSSRWRY RSFYLAEAGL SRVLTLLGMA QPRFFTSNVL DDPQLVEQFL TLHRAMDEQD DLLRQQELLV SSFGTLFSRH GLQAGLGAGP GFGTKAGLPA LKPALDLMND CFDHALTLEQ IAAAAGLTSF QLITAFNRVI GLTPHAYLNQ LRLRAALREL QAGRSLADAA LTSGFYDQSA LCNHFKRTFG MTPMQYTRAL APGKRALAPI GI
|
| |