Gene BURPS1710b_A2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2120 
Symbol 
ID3692876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2583111 
End bp2584130 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content72% 
IMG OID637732374 
ProductLysR family regulatory protein 
Protein accessionYP_337271 
Protein GI76818532 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID[TIGR03418] putative choline sulfate-utilization transcription factor 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.246446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCGT CGGATCGGAA CGGCGGCGCG ACAGGCTGCC GCCGCCGGCC GGATCTCACT 
ATTGCCGGCC GTAATGCGGC CGTGAAGGCA TCGAATGCTT ATGAGCCATC AGAGGTGCTT
ATGACGAGAC CAGACCGTCT GCCGCCGATG CAGACCTTGT CCGCGTTCGA AGCGGCCGCG
CGCCTCGCGA GCTTCACGGC CGCCGCGCGC GAGCTCGGCT CGACGCAGCC GGCCGTCAGC
CAGCGCGTCG TTCAGCTCGA AGAGGATCTC GGCACGCCGC TCTTCGAGCG CGGGCGCCGC
GGCGTCACGC TGACCGAGGA CGGCGCGCGG CTCTTCGCGG CGGTCCGCCA AAGCCTCGAC
GCGCTGCGCG CCGCGACGGC GGACATCCGC AGCCGCCGCG CGAACGGCAC ATTCACGCTC
GTCACCGATT TCGGCTTCGC CACCTACTGG CTGATGCCGC GCCTCGACGA TCTGAAGCGC
GCGATGCCCG GCGTCGACGT GCGCGTCGTC ACGTCTCAGG ACATCGATCC GCAGCGCGAG
CACGCCGACG TCGCGATCCT GTTCGGCGCC GGCGACTGGC CGGGCTGCAC ATCGACGCGG
CTCTTTCAGG AGCACGTGAC GCCCGTGTGC TCGCCCGCGT TTCGCACCGC GCATGCCGAT
ATCGCGCGGC CGGCCGACCT GTTGCGCGCG CCGCTGCTGC ACGTGCAGCC GACGCGCCCC
GAGCGCTGGC TCGCGTGGCG CGATTGGTTC GACGCGCATG GGCTCGCCGC GCCGCCCGAG
CCGCACGGGC TGACGTTCAA CAGCTACTCG CTCGTGATTC AAGCGGCGCT GATGAATCAG
GGTGTCGCGC TCGGCTGGGC GCCGCTCGTC GACACGCCGA TCGCGGCCGG CCAGCTCGTG
CGGCTCGTCG ACGCGCCCGT CGTCACGCCG CGCGGCTACT ACCTCGTTCT GCCGCCCGCG
CGGCCGGAGG CGCGCGCGGT GCCGCTCTTT CGCCGCTGGC TGCTCGGCGC ATGCGCATGA
 
Protein sequence
MESSDRNGGA TGCRRRPDLT IAGRNAAVKA SNAYEPSEVL MTRPDRLPPM QTLSAFEAAA 
RLASFTAAAR ELGSTQPAVS QRVVQLEEDL GTPLFERGRR GVTLTEDGAR LFAAVRQSLD
ALRAATADIR SRRANGTFTL VTDFGFATYW LMPRLDDLKR AMPGVDVRVV TSQDIDPQRE
HADVAILFGA GDWPGCTSTR LFQEHVTPVC SPAFRTAHAD IARPADLLRA PLLHVQPTRP
ERWLAWRDWF DAHGLAAPPE PHGLTFNSYS LVIQAALMNQ GVALGWAPLV DTPIAAGQLV
RLVDAPVVTP RGYYLVLPPA RPEARAVPLF RRWLLGACA