Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0752 |
Symbol | |
ID | 4905498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 744407 |
End bp | 745285 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640143858 |
Product | LysR family transcriptional regulator |
Protein accession | YP_001074788 |
Protein GI | 126456587 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | [TIGR03418] putative choline sulfate-utilization transcription factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGACCT TGTCCGCGTT CGAAGCGGCC GCGCGCCTCG CGAGCTTCAC GGCCGCCGCG CGCGAGCTCG GCTCGACGCA GCCGGCCGTC AGCCAGCGCG TCGTTCAGCT CGAAGAGGAT CTCGGCACGC CGCTCTTCGA GCGCGGGCGC CGCGGCGTCA CGCTGACCGA GGACGGCGCG CGGCTCTTCG CGGCGGTCCG CCAAAGCCTC GACGCGCTGC GCGCCGCGAC GGCGGACATC CGCAGCCGCC GCGCGAACGG CACATTCACG CTCGTCACCG ATTTCGGCTT CGCCACCTAC TGGCTGATGC CGCGCCTCGA CGATCTGAAG CGCGCGATGC CCGGCGTCGA CGTGCGCGTC GTCACGTCTC AGGACATCGA TCCGCAGCGC GAGCACGCCG ACGTCGCGAT CCTGTTCGGC GCCGGCGACT GGCCGGGCTG CACATCGACG CGGCTCTTTC AGGAGCACGT GACGCCCGTG TGCTCGCCCG CGTTTCGCAC CGCGCATGCC GATATCGCGC GGCCGGCCGA CCTGTTGCGC GCGCCGCTGC TGCACGTGCA GCCGACGCGC CCCGAGCGCT GGCTCGCGTG GCGCGATTGG TTCGACGCGC ATGGGCTCGC CGCGCCGCCC GAGCCGCACG GGCTGACGTT CAACAGCTAC TCGCTCGTGA TTCAAGCGGC GCTGATGAAT CAGGGTGTCG CGCTCGGCTG GGCGCCGCTC GTCGACACGC CGATCGCGGC CGGCCAGCTC GTGCGGCTCG TCGACGCGCC CGTCGTCACG CCGCGCGGCT ACTACCTCGT CCGGCCGCCC GCGCGGCCGG AGGCGCGCGC GGTGCCGCTC TTTCGCCGCT GGCTGCTCGG CGCATGCGCA TGCGCATGA
|
Protein sequence | MQTLSAFEAA ARLASFTAAA RELGSTQPAV SQRVVQLEED LGTPLFERGR RGVTLTEDGA RLFAAVRQSL DALRAATADI RSRRANGTFT LVTDFGFATY WLMPRLDDLK RAMPGVDVRV VTSQDIDPQR EHADVAILFG AGDWPGCTST RLFQEHVTPV CSPAFRTAHA DIARPADLLR APLLHVQPTR PERWLAWRDW FDAHGLAAPP EPHGLTFNSY SLVIQAALMN QGVALGWAPL VDTPIAAGQL VRLVDAPVVT PRGYYLVRPP ARPEARAVPL FRRWLLGACA CA
|
| |