Gene BURPS1106A_A0825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0825 
Symbol 
ID4904793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp821593 
End bp822720 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content70% 
IMG OID640143931 
ProductAraC family transcriptional regulator 
Protein accessionYP_001074861 
Protein GI126455915 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.707047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCTC CGCTCAATTA CACTGATCGT TTTTGCCATA CCGCGCCGCA GCGCGCACCC 
GCGATGAAGC ACGAAGAAAA GAAAGGCACC GTTTCGATCG AGCTCGTCGA GTCGAGCCTC
GCGCTGTCGC GGCGGCGCGG CGTCGACGAC GCTTCGCTCC TCGCGCAGGC GGGCATTGCC
GGCGCGCTGC TCGCGCAGCC GAACGCGCGC GTGTCCGCGC GGCAGTACGG CGCGCTGTGG
AACGCGATCG CGCGCGCGCT CGACGACGAG TTCTTCGGCC AGGACTCACA CCCGATGCGC
TGCGGCAGCT TCATCGCGAT GAGCCAGGCG GCGCTCACCG CGCGCAACGG GCTGCGCGCG
CTCGCGCGCG CGGTCAACTT CATGCACTGC GTGCTCGACG ATCTGCACGC GCAGCTCGAC
GCGAGCGCCG AGCGCGTACG GCTGCGCTTC GTGCATCGCA ACAGCGCGAA CCCGCCGGAG
ATGTTCGCGT ATGCGACCTA TTTCGTCATC GTCTACGGCC TCACGTGCTG GCTGATCGGG
CGGCGCATTC CGCTGCTGCA CGCGAGCTTT CGCTGCGGCG AGCCGCGCGC GGTCCACGAA
TATCGGCTGA TGTTCTGCGA CGACATGCGT TTCGACGAGC CCGATTCGTA TGTCGATTTC
GATCCGGCGT TCGCCGCGCT GCCCATCGTG CAGACGGCGC AGACGCTCAA GCCGTTCCTG
CGCGACGCGC CCGCGAGCTT CATCGTCAAG TATCGCAACC CGCACGCGCT CGGCGAGCGC
GTGCGCGCGG CGCTGCGCGC GCTGCCGCCC GCCGCGTGGC CGACCGCGCG CGCGCTCGCC
GCGCGGCTGC ACGTGGCCGA GGCGACGCTG CGGCGCAAGC TGAAGCAGGA AGGCCATTCG
TATCAATCGA TCAAGGACGC GCTGCGGCGC GATCTCGCGT GCGAGGCGCT CGCCGATCCG
GCCCGCACGG TCGCCGACGT CGCCGCGGCG ACGGGCTTCG CCGAGCCGAG CGCGTTCTAC
CGCGCGTTTC GCAAGTGGCG CGGGATGAGC CCCGCCGACT ACCGCGACGC CGCGCTCGCC
GCGCGCGCGG CCGCTTCGCG CTTTCGCCGG AAACCGCCTA CTCTTTAA
 
Protein sequence
MLAPLNYTDR FCHTAPQRAP AMKHEEKKGT VSIELVESSL ALSRRRGVDD ASLLAQAGIA 
GALLAQPNAR VSARQYGALW NAIARALDDE FFGQDSHPMR CGSFIAMSQA ALTARNGLRA
LARAVNFMHC VLDDLHAQLD ASAERVRLRF VHRNSANPPE MFAYATYFVI VYGLTCWLIG
RRIPLLHASF RCGEPRAVHE YRLMFCDDMR FDEPDSYVDF DPAFAALPIV QTAQTLKPFL
RDAPASFIVK YRNPHALGER VRAALRALPP AAWPTARALA ARLHVAEATL RRKLKQEGHS
YQSIKDALRR DLACEALADP ARTVADVAAA TGFAEPSAFY RAFRKWRGMS PADYRDAALA
ARAAASRFRR KPPTL