Gene BURPS668_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1424 
Symbol 
ID4882382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1391125 
End bp1392666 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content74% 
IMG OID640127352 
ProductGntR family transcriptional regulator 
Protein accessionYP_001058467 
Protein GI126439073 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGAC GCGCCCGCAT CATCGAGATT CCGTCGCTCG GCATGCTCGA GCGCACGGCC 
GGCAACCTGA GCCGGCAACT GGCGCAGGCG CTGCGGGACG CCGTTCGCCG CGGCGAGGTG
ATGCCGGGCG ACGCGCTGCC GTCCACGCGG CTGCTCGCCG CTTCGCTGCG CATCGCGCGC
GGCACGGTGA TCGACGCCTA CGAGCAGCTG ATTGCGGAAG GGTTTCTGGA GTCGCGCGGC
GGGGTGGGCA CGCGCGTCGC GCCCGCGCTC GCCGAGCCGC GCGGCGCGCG CCGCCGCGCG
GCCGCGGCGG CGCGTTCGGC CCTGCCGCCG CCCGCCGCCG AGTATGCGCG CGTCGCGCGC
GAGTTCGCGC CGTTGCCGCC GGCGCCGTTC GCGATTTCGG TGCCGGCCGG CGCGACGGCG
CCCGACGACG TGTGGCGCCG CCTCGGCAAC CGCCTGCGGG CGAGGGGGCC CGCCGCGCCG
GCCGGCTACT CGGATCCGCT CGGCGTGCGG GCGTTGCGCG AGGCGATCGC CGGCTACGTG
CGCAAGTCGC GCTCCGTGCA TTGCGCGCCC GATCAGATCA TCGTCACGAG CGGCGCGCAG
CAGGGGCTCT ATCTCGCGTG CCAGGTGCTG CTGGGCGCGC ACGATCGCGC GTGGGTCGAA
AATCCCGCGT ATCGCGGGCT CACCGCGATT CTCGAATGCA CGGGGCGGCG CGACGCGATG
GTGCGCGTGC CGGTCGACGC GGAGGGCATC GACGTCGATG CGGGCGTCCG GCTCGCGCGC
GATGCGCGCG CGGCGTTCGT CACGCCGTCG CATCAATATC CGCTCGGCAT GCCGATGAGC
ATGGCGCGGC GCGCCGCGCT GCTCGCATGG GCGCGCGCGA GCGGCGCATG GGTGGTCGAG
GACGATTACG ACAGCGAGCT GCGCTACGAG GGCTATCCGT TTCCGTCGCT GCAGGGGCTC
GATCCGGCGC GCGTCGTCTA TCTCGGCACG TTCAGCAAGA TCCTGTTTCC GTCGTTGCGG
CTCGGTTATC TGATCGTGCC GGACGAACTG GTCGACGCGT TGCGCGGCGC GCGCGTGCTG
ATGGATCGAC ACGCGCCGAC CGCGGACCAG CACGTGCTCG CGGCCTTCAT CGCCGGCGGG
CATTTCGATC GCCACATTCG CCGCGTGCGA GGCGTGTATG CGGAGCAGCG CGCGCAACTG
ATCGATACGG TCGGCAGGCT GCTGTCGGGC GATCTCGCGT GGCTGCAGCC GGGCGATCAG
GGGATGCACG CGGTGCTCTG GCTCGCGGCG GGCGTCGACG ACCTGCGCGT TGCGGCGATG
GCCGCGCAGG CGGGCGTCGC GGTCCGCCCG GTGTCGCCGA TGTTCGCGCC GGGCACGGCA
CGCCCGGGCC TCGTGCTCGG CTTCGGCGGC TTCGGCCGCG AGCAGATGGA CGCGGCCGCG
CGCCGGCTCG CCGAGGTGAT CGCCGCGGCG AGCGGCTCGG CGGTGCCGCG TGCCGGGGCC
GGGCGTCGGC GTCGGGATGG CGCCGGCGGA ACCGACGCGT GA
 
Protein sequence
MARRARIIEI PSLGMLERTA GNLSRQLAQA LRDAVRRGEV MPGDALPSTR LLAASLRIAR 
GTVIDAYEQL IAEGFLESRG GVGTRVAPAL AEPRGARRRA AAAARSALPP PAAEYARVAR
EFAPLPPAPF AISVPAGATA PDDVWRRLGN RLRARGPAAP AGYSDPLGVR ALREAIAGYV
RKSRSVHCAP DQIIVTSGAQ QGLYLACQVL LGAHDRAWVE NPAYRGLTAI LECTGRRDAM
VRVPVDAEGI DVDAGVRLAR DARAAFVTPS HQYPLGMPMS MARRAALLAW ARASGAWVVE
DDYDSELRYE GYPFPSLQGL DPARVVYLGT FSKILFPSLR LGYLIVPDEL VDALRGARVL
MDRHAPTADQ HVLAAFIAGG HFDRHIRRVR GVYAEQRAQL IDTVGRLLSG DLAWLQPGDQ
GMHAVLWLAA GVDDLRVAAM AAQAGVAVRP VSPMFAPGTA RPGLVLGFGG FGREQMDAAA
RRLAEVIAAA SGSAVPRAGA GRRRRDGAGG TDA