Gene BURPS1710b_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1023 
SymboldarR 
ID3690151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1071970 
End bp1073007 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID637727479 
ProductAraC family transcription regulator 
Protein accessionYP_332435 
Protein GI76808651 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.243872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCGC AAACTCCCCT CCGGCATCGG ACGACGACGA CCGTCGATGT CGTGATCTAT 
CCGGGATTCA AGGCGATCGA GGCCGTCGGC GTCATCAACG TGTTCGACTA CGCGAACGCG
CGGCTCGCCG CCGCGGGGCT CGCGCCCGTC TACGATCTCC AGATTGCCGC GCCCGCGAAG
GGCGCGGTCA AGTCCGACAC CCTCATCGTG CTCGAGGCGA CGAAGGCGCT CGACACGCTC
GCGGTGCCCG ACACGGCGAT CGTCGTCGGC GCGCGCGACA TCGAGCGGGC GCTGCGCGAC
ACGTCGATGC TCGTCGGATG GTGCCGCGAC GTGTCCGCGC GCATCGGCCG GATGGTCGGG
CTGTGCTCGG GCTGCTTCTT TCTCGCCGAA GCCGGCATGC TGGACGGCCG GCGCGCGACG
ACGCACTGGA GCGTCGCCCC CCTGTTGCGG GCGCGTTATC CGGCGGTGAA GGTGGAGCCC
GACGCGATCT TCGTTCGCGA GGGCAACGTG TGGACGTCGG CGGGCGTCAC GGCCGGCCTC
GATCTCGCGC TCGCGATGGT CGAGGAGGAT CTCGGTCGCG AGATCGCGCT CGCCGTCGCG
CGCGATCTCG TGATTTACCT GAAGCGGCCG GGCGGCCAGT CGCAGTTCAG CGTGTACCTG
GCGAGCCAGA TGACCGCGCA CGCGTCGATC CGCGACATTC AGGACTGGAT TCTGAACGCG
CTCGACGCGC GGCTGAGCAT CGCGCAGCTC GCCAGGCGCG CCGCGATGAG CGAGCGCAAC
TTCATTCGCG TGTTCGTGCG CGAAACCGGC TATCGTCCGG CCGAATTCAT CGAAATCGCG
CGGCTCGAAA AAGCGCGCCG CCTGCTCGAG CAGGAAGCGC TGCCGCTGAA GACGGTGGCC
GTGCGCAGCG GGTTTCGTTC CGACGACCAA TTGCGGCGCG TGTTCATGCG CCGCCTCGGC
GTGACGCCCG GCGCGTATCG CGAGCGGTTC TCCGGCACCG GCGTGCGCGA AGCGCGGGGG
AGCGGCGACG TGGATTGA
 
Protein sequence
MAAQTPLRHR TTTTVDVVIY PGFKAIEAVG VINVFDYANA RLAAAGLAPV YDLQIAAPAK 
GAVKSDTLIV LEATKALDTL AVPDTAIVVG ARDIERALRD TSMLVGWCRD VSARIGRMVG
LCSGCFFLAE AGMLDGRRAT THWSVAPLLR ARYPAVKVEP DAIFVREGNV WTSAGVTAGL
DLALAMVEED LGREIALAVA RDLVIYLKRP GGQSQFSVYL ASQMTAHASI RDIQDWILNA
LDARLSIAQL ARRAAMSERN FIRVFVRETG YRPAEFIEIA RLEKARRLLE QEALPLKTVA
VRSGFRSDDQ LRRVFMRRLG VTPGAYRERF SGTGVREARG SGDVD