Gene BURPS1106A_A2364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2364 
Symbol 
ID4905113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2340167 
End bp2341687 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content69% 
IMG OID640145469 
ProductGntR family transcriptional regulator 
Protein accessionYP_001076397 
Protein GI126455967 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0684851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCCTCT CCGACTGGCT TGCCGCGCGA CTCGAGCGGC ACTCGGCCGA GCCGATGTAC 
CGGCAACTGC TGCGGCTGAT GCAGCAGGCG ATCCTCTCGG GGGAGCTCAC GCCCGGCACG
AAGCTGCCGA GCTCGCGCAC GCTCGCCACC GATCTCGCGA TTGCGCGCAA CACCGTGCTG
CACGTGTACG ACCAACTGAC GACGGAAGGC TACGTGCTGA CGACGACGGG CAGCGGCACC
TACGTCGCCG ACACGCGCCC CGATGCAGCG GCGATGCGCG CGCAAGCCGG CGCGCCCGCG
CATCCGTCAG ACGAAACGCG CGACGCGCAG CAGCGCGCGC ACGGCGCGCT GTCGGCGCGC
GGCGCGCAAC TGATCCGGCA TGCGGGCGTG TCGCGCCGCC AGTGGGGCGC GTTCATGCCG
GGCGTGCCCG ACGTGTCCGA GTTTCCGACG CGTACATGGA GCCGCCTGCA GGCGCGGCTA
TGGAAGGAAG CGAATCCGGA GCTGCTGACC TATGCGCCGG GCGGCGGCTA CCGGCCGTTG
CGGCGCGCGC TGTCCGACTA CCTGCGCGTC GCGCGCTCGG TCAAATGCTC GCCCGATCAG
ATCATCATCA CGACGGGCAT TCATCAGTCG ATCGACCTGT CGGTGCGCCT GCTCGCCGAC
GTCGGCGATC GCGCGTGGGT CGAGGAGCCG TGCTACTGGG GCGTGCGCAG CGTGCTGCAG
GCGGCGGGGC TCGCACTCGC GCCCGTGCCC GTCGATCAGG AAGGGCTCGC GCCGCGCGCG
CAGGACCTGA AGCGCCCGCC GCGGCTCGTG CTCGTCACGC CGTCGCATCA ATATCCGCTC
GGCATGGTGA TGAGCCTCGC GCGGCGCCGG ATGCTGCTCG AATATGCGCG GCAGCATCAA
TGCTGGATCA TCGAAGACGA CTACGACAGC GAGTTCCGCT ACGGCAGCCG TCCGCTCGCG
TCGCTACAGG GGCTCGACGA CGCGGGCCGC GTGATCTACG TCGGCAGTCT CGGCAAGATG
CTGTTCCCGA GCCTGCGCCT CGGCTACATG GTCGTGCCCG AGCACCTCGT CGAGACCTTC
CGGACCGGGC TGTCGGAGCT GTATCGCGAA GGGCAACTGA TGCAGCAGGC GGTGCTCGCC
GAATTCATCA TGGACGGCTA TCTGACGTCG CACGTGCGCC GGATGCGCGC GCTGTACGGC
GAGCGCCGCC AGTTGCTGAT CGACGCGATC CACTCGCGCT TCGGCGATGC GCTGCCGGTG
ATGGGCGACG AGGCGGGCCT GCACCTCGTG ATCGGATTGC CGAACGGCTG CGACGACCGG
GCGATCACGC AGACCGCGTT CGACGCGGGG GTGATCGTGC GCCCGCTCAC GACGTACTAC
AACCATGCGG ATACCGCGCG CGAGGGATTG CTGCTCGGCT ACGCATGCGT GCCGAACGAG
CGCATCGCGC CCGCGTTCGA GACGCTCGCG CAGATAATCG AAGCGCATCT GAATCGGCGT
GCGAAGCAAC GGGCCGCGTG A
 
Protein sequence
MILSDWLAAR LERHSAEPMY RQLLRLMQQA ILSGELTPGT KLPSSRTLAT DLAIARNTVL 
HVYDQLTTEG YVLTTTGSGT YVADTRPDAA AMRAQAGAPA HPSDETRDAQ QRAHGALSAR
GAQLIRHAGV SRRQWGAFMP GVPDVSEFPT RTWSRLQARL WKEANPELLT YAPGGGYRPL
RRALSDYLRV ARSVKCSPDQ IIITTGIHQS IDLSVRLLAD VGDRAWVEEP CYWGVRSVLQ
AAGLALAPVP VDQEGLAPRA QDLKRPPRLV LVTPSHQYPL GMVMSLARRR MLLEYARQHQ
CWIIEDDYDS EFRYGSRPLA SLQGLDDAGR VIYVGSLGKM LFPSLRLGYM VVPEHLVETF
RTGLSELYRE GQLMQQAVLA EFIMDGYLTS HVRRMRALYG ERRQLLIDAI HSRFGDALPV
MGDEAGLHLV IGLPNGCDDR AITQTAFDAG VIVRPLTTYY NHADTAREGL LLGYACVPNE
RIAPAFETLA QIIEAHLNRR AKQRAA