Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A2120 |
Symbol | |
ID | 3692876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 2583111 |
End bp | 2584130 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637732374 |
Product | LysR family regulatory protein |
Protein accession | YP_337271 |
Protein GI | 76818532 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | [TIGR03418] putative choline sulfate-utilization transcription factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.246446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCGT CGGATCGGAA CGGCGGCGCG ACAGGCTGCC GCCGCCGGCC GGATCTCACT ATTGCCGGCC GTAATGCGGC CGTGAAGGCA TCGAATGCTT ATGAGCCATC AGAGGTGCTT ATGACGAGAC CAGACCGTCT GCCGCCGATG CAGACCTTGT CCGCGTTCGA AGCGGCCGCG CGCCTCGCGA GCTTCACGGC CGCCGCGCGC GAGCTCGGCT CGACGCAGCC GGCCGTCAGC CAGCGCGTCG TTCAGCTCGA AGAGGATCTC GGCACGCCGC TCTTCGAGCG CGGGCGCCGC GGCGTCACGC TGACCGAGGA CGGCGCGCGG CTCTTCGCGG CGGTCCGCCA AAGCCTCGAC GCGCTGCGCG CCGCGACGGC GGACATCCGC AGCCGCCGCG CGAACGGCAC ATTCACGCTC GTCACCGATT TCGGCTTCGC CACCTACTGG CTGATGCCGC GCCTCGACGA TCTGAAGCGC GCGATGCCCG GCGTCGACGT GCGCGTCGTC ACGTCTCAGG ACATCGATCC GCAGCGCGAG CACGCCGACG TCGCGATCCT GTTCGGCGCC GGCGACTGGC CGGGCTGCAC ATCGACGCGG CTCTTTCAGG AGCACGTGAC GCCCGTGTGC TCGCCCGCGT TTCGCACCGC GCATGCCGAT ATCGCGCGGC CGGCCGACCT GTTGCGCGCG CCGCTGCTGC ACGTGCAGCC GACGCGCCCC GAGCGCTGGC TCGCGTGGCG CGATTGGTTC GACGCGCATG GGCTCGCCGC GCCGCCCGAG CCGCACGGGC TGACGTTCAA CAGCTACTCG CTCGTGATTC AAGCGGCGCT GATGAATCAG GGTGTCGCGC TCGGCTGGGC GCCGCTCGTC GACACGCCGA TCGCGGCCGG CCAGCTCGTG CGGCTCGTCG ACGCGCCCGT CGTCACGCCG CGCGGCTACT ACCTCGTTCT GCCGCCCGCG CGGCCGGAGG CGCGCGCGGT GCCGCTCTTT CGCCGCTGGC TGCTCGGCGC ATGCGCATGA
|
Protein sequence | MESSDRNGGA TGCRRRPDLT IAGRNAAVKA SNAYEPSEVL MTRPDRLPPM QTLSAFEAAA RLASFTAAAR ELGSTQPAVS QRVVQLEEDL GTPLFERGRR GVTLTEDGAR LFAAVRQSLD ALRAATADIR SRRANGTFTL VTDFGFATYW LMPRLDDLKR AMPGVDVRVV TSQDIDPQRE HADVAILFGA GDWPGCTSTR LFQEHVTPVC SPAFRTAHAD IARPADLLRA PLLHVQPTRP ERWLAWRDWF DAHGLAAPPE PHGLTFNSYS LVIQAALMNQ GVALGWAPLV DTPIAAGQLV RLVDAPVVTP RGYYLVLPPA RPEARAVPLF RRWLLGACA
|
| |