Gene BURPS1106A_A1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1600 
Symbol 
ID4904413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1568457 
End bp1569938 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content74% 
IMG OID640144706 
ProductGntR family regulatory protein 
Protein accessionYP_001075634 
Protein GI126456808 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATCTTC AGATCAGCCT CCGCGATCGC CGCGACCTCG CCGGCCAGAT CTATCGGCAA 
CTGCGCGCGG CCATCGTCGA CGGACGCCTC GCCGCCGGCG CGCGGCTGCC TTCGACGCGC
GAGCTCGCGC GCCAGCTCGG CGTATCGCGC AAGACGACGC TCGACGCGTT CGAGCGCCTT
GCGAGTGAAG GCTTTCTGCT CGCGCGCACC GGCAACGGCA CGTTCGTCGC GGGCGGCCTG
ACGCGCGTGC CCGGCGCGCG CGCGCCCGCG GCGGACGGCT CCGGCACGCG GGCGCAGGCG
CCGCGCGCCG ATGGCCGCGA CGCCGCGCTC GCCCGTGCGC ACGCGGGCTG GTCCGACGTG
CCGGCCGGGC TGTCGATGCC GCCGCCGCAC GCGCCGCTCG CGTTCGACTT CCGCGGCGGC
GTCACCGACA AGGTGCATTT TCCATGGGAC GACTGGCGGC GCAGCATCCA GCACGCGCTG
CGCGCGCAGG CGCGCAGCAG CGGCGACTAT CGCGACCCCG CGGGCGAGCC CGAGCTGCGC
GACGCGATCG CGCGCTACGT CGGCTTCAGC CGCGCGGTGG CGTGCGACTG GCGGGACGTG
ATCGTCACGC AGGGCGCGCA GCAGGCGCTC GATCTGCTCG CACGCGTGCT CGTGCGCCCG
GGCGACATCG TCGCCGTCGA AGAGCCCGGC TACCCACCCG CCCGCGCGGC GTTCGCGGCG
CTCGGCGCGT GCGTGGTCGG CGTGCCTGTC GACGCGCACG GGCTCGGCGT CGCGCAACTG
CCGGACGACG CGCGCTTCGT CTACGTGACG CCGTCGCACC AGTTCCCGCT CGGCATGCCG
ATGAGCCTCG AGCGGCGCGT CGCGCTGCTC GAATGGGCGC AGCGGCGGCG CGCGGTGATC
GTCGAAGACG ATTACGACTG CGAATTCCGC TTCGAGGGCC GGCCGCTGGA GCCGCTGAAA
AGCCTCGATC GCGCGGGGCT CGTCGCATAC GTCGGCACGT TCTCGAAGAC GCTGTTTCCG
GAACTGCGCA TCGGCTACGC GGTGCCGCCG CGCTCGCTGA ACCTCGCGTT GCGCAAGGCC
AAGCAGATCG TCGACTGGCA CGCGTGCACG CTCACGCAGA CGGCGCTCGC GCGGTTCATC
GCCGACGGCG CGTTCGCCCG CCATCTGCGG CGCATCCATC GCCGCTACGA CGCGCGCCGC
CGGCTGCTCG CCGCGCACCT GCACGGCGCG CTGTCGCACT GGCTGCGGCC CGTGATTCCC
GCGGCCGGCA TTCATCTGAC CGCGCGCTTC GCCACGGGTA TCGACGAGCG CGCGGCGCTC
GACGCCGCCC GCGACGCCGA CATCGACCTG CACGGCATCT CGGCGTTCTA CGCGGACAGC
CCGGCCGAGT CGGGCATCCT GTTCGGGTAT GGCGGCATCG ACGCGCCGGC GATCGACGTC
GCGCTCACAC GGCTGGCGGG CGTGTTGAAG CGATGCGCGT GA
 
Protein sequence
MDLQISLRDR RDLAGQIYRQ LRAAIVDGRL AAGARLPSTR ELARQLGVSR KTTLDAFERL 
ASEGFLLART GNGTFVAGGL TRVPGARAPA ADGSGTRAQA PRADGRDAAL ARAHAGWSDV
PAGLSMPPPH APLAFDFRGG VTDKVHFPWD DWRRSIQHAL RAQARSSGDY RDPAGEPELR
DAIARYVGFS RAVACDWRDV IVTQGAQQAL DLLARVLVRP GDIVAVEEPG YPPARAAFAA
LGACVVGVPV DAHGLGVAQL PDDARFVYVT PSHQFPLGMP MSLERRVALL EWAQRRRAVI
VEDDYDCEFR FEGRPLEPLK SLDRAGLVAY VGTFSKTLFP ELRIGYAVPP RSLNLALRKA
KQIVDWHACT LTQTALARFI ADGAFARHLR RIHRRYDARR RLLAAHLHGA LSHWLRPVIP
AAGIHLTARF ATGIDERAAL DAARDADIDL HGISAFYADS PAESGILFGY GGIDAPAIDV
ALTRLAGVLK RCA