Gene BURPS668_A2503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2503 
Symbol 
ID4885886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2417373 
End bp2418893 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content69% 
IMG OID640132440 
ProductGntR family transcriptional regulator 
Protein accessionYP_001063497 
Protein GI126444443 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCCTCT CCGACTGGCT TGCCGCGCGA CTCGAGCGGC ACTCGGCCGA GCCGATGTAC 
CGGCAACTGC TGCGGCTGAT GCAGCAGGCG ATCCTCTCGG GGGAGCTCAC GCCCGGCACG
AAGCTGCCGA GCTCGCGCAC GCTCGCCACC GATCTCGCGA TCGCGCGCAA CACCGTGCTG
CACGTGTACG ACCAACTGAC GACGGAAGGC TACGTGCTGA CGACGACGGG CAGCGGCACC
TACGTCGCCG ACACGCGCCC CGATGCCGCG GCGATGCGCG CGCAAGCCGG CGCGCCCGCG
CATCCGTCAG ACGAAACGCG CGACGCGCAG CAGCGCGCGC ACGGCGCACT GTCGGCGCGC
GGCGCGCAAC TGATCCGGCA CGCGGGCGTG TCGCGCCGCC AGTGGGGCGC GTTCATGCCG
GGCGTGCCCG ACGTGTCCGA GTTTCCGACG CGTACATGGA GCCGCCTGCA GGCGCGGCTG
TGGAAGGAAG CGAATCCGGA GCTGCTGACC TATGCGCCGG GCGGCGGCTA CCGGCCGTTG
CGGCGCGCGC TGTCCGACTA CCTGCGTGTC GCGCGCTCGG TCAAATGCTC GCCCGATCAG
ATCATCATCA CGACGGGCAT TCATCAGTCG ATCGACCTGT CGGTGCGCCT GCTCGCCGAC
GTCGGCGATC GCGCGTGGGT CGAGGAGCCG TGCTACTGGG GCGTGCGCAG CGTGCTGCAG
GCGGCGGGGC TCGCACTCGC GCCCGTGCCC GTCGATCAGG AAGGGCTCGC GCCGCGCGCG
CAGGACCTGA AGCGCCCGCC GCGGCTCGTG CTCGTCACGC CATCGCATCA ATATCCGCTC
GGCATGGTGA TGAGCCTCGC GCGGCGCCGG ATGCTGCTCG AATATGCGCG GCAGCATCAA
TGCTGGATCA TCGAAGACGA CTACGACAGC GAGTTCCGCT ACGGCAGCCG TCCGCTCGCG
TCGCTACAGG GGCTCGACGA CGCGGGCCGC GTGATCTACG TCGGCAGTCT CGGCAAGATG
CTGTTCCCGA GCCTGCGCCT CGGCTACATG GTCGTGCCCG AGCACCTCGT CGAGACCTTC
CGGACCGGGC TGTCGGAGCT GTATCGCGAA GGGCAACTGA TGCAGCAGGC GGTGCTCGCC
GAATTCATCA TGGACGGCTA TCTGACGTCG CACGTGCGCC GGATGCGCGC GCTGTACGGC
GAGCGCCGCC AGTTGCTGAT CGACGCGATC CACGCGCGCT TCGGCGATGC GCTGCCGGTG
ATGGGCGACG AGGCGGGCCT GCACCTCGTG ATCGGATTGC CGAACGGCTG CGACGACCGG
GCGATCACGC AGACCGCATT CGACGCGGGG GTGATCGTGC GCCCGCTCAC GACGTACTAC
AACCATGCGG ATACCGCGCG CGAGGGATTG CTGCTCGGCT ACGCATGCGT GCCGAACGAG
CGCATCGCGC CCGCGTTCGA TACGCTCGCG CAGATAATCG AAGCGCATCT GAATCGGCGT
GCGAAGCAAC GGGCCGCGTG A
 
Protein sequence
MILSDWLAAR LERHSAEPMY RQLLRLMQQA ILSGELTPGT KLPSSRTLAT DLAIARNTVL 
HVYDQLTTEG YVLTTTGSGT YVADTRPDAA AMRAQAGAPA HPSDETRDAQ QRAHGALSAR
GAQLIRHAGV SRRQWGAFMP GVPDVSEFPT RTWSRLQARL WKEANPELLT YAPGGGYRPL
RRALSDYLRV ARSVKCSPDQ IIITTGIHQS IDLSVRLLAD VGDRAWVEEP CYWGVRSVLQ
AAGLALAPVP VDQEGLAPRA QDLKRPPRLV LVTPSHQYPL GMVMSLARRR MLLEYARQHQ
CWIIEDDYDS EFRYGSRPLA SLQGLDDAGR VIYVGSLGKM LFPSLRLGYM VVPEHLVETF
RTGLSELYRE GQLMQQAVLA EFIMDGYLTS HVRRMRALYG ERRQLLIDAI HARFGDALPV
MGDEAGLHLV IGLPNGCDDR AITQTAFDAG VIVRPLTTYY NHADTAREGL LLGYACVPNE
RIAPAFDTLA QIIEAHLNRR AKQRAA