Gene BURPS668_A0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0917 
Symbol 
ID4888750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp896519 
End bp897646 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content70% 
IMG OID640130857 
ProductAraC family transcriptional regulator 
Protein accessionYP_001061916 
Protein GI126442564 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.323818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCTC CGCTCAATTA CACTGATCGT TTTTGCCATA CCGCGCCGCA GCGCGCACCC 
GCGATGAAGC ACGAAGAAAA GAAAGGCACC GTTTCGATCG AGCTCGTCGA GTCGAGCCTC
GCGCTGTCGC GGCGGCGCGG CGTCGACGAC GCTTCGCTCC TCGCGCAGGC GGGCATTGCC
GGCGCGCTGC TCGCGCAGCC GAACGCGCGC GTGTCCGCGC GGCAGTACGG CGCGCTGTGG
AACGCGATCG CGCGCGCGCT CGACGACGAG TTCTTCGGCC AGGACTCACA CCCGATGCGC
TGCGGCAGCT TCATCGCGAT GAGCCAGGCG GCGCTCACCG CGCGCAACGG GCTGCGCGCG
CTCGCGCGCG CGGTCAACTT CATGCACTGC GTGCTCGACG ATCTGCACGC GCAGCTCGAC
GCGAGCGCCG AGCGCGTACG GCTGCGCTTC GTGCATCGCA ACAGCGCGAA CCCGCCGGAG
ATGTTCGCGT ATGCGACCTA TTTCGTCATC GTCTACGGCC TCACGTGCTG GCTGATCGGG
CGGCGCATTC CGCTGCTGCA CGCGAGCTTT CGCTGCGGCG AGCCGCGCGC GGTCCACGAA
TATCGGCTGA TGTTCTGCGA CGACATGCGT TTCGACGAGC CCGATTCGTA TGTCGATTTC
GATCCGGCGT TCGCCGCGCT GCCCATCGTG CAGACGGCGC AGACGCTCAA GCCGTTCCTG
CGCGACGCGC CCGCGAGCTT CATCGTCAAG TATCGCAACC CGCACGCGCT CGGCGAGCGC
GTGCGCGCGG CGCTGCGCGC GCTGCCGCCC GCCGCGTGGC CGACCGCGCG CGCGCTCGCC
GCGCGGCTGC ACGTGGCCGA GGCGACGCTG CGGCGCAAGC TGAAGCAGGA AGGCCATTCG
TATCAATCGA TCAAGGACGC GCTGCGGCGC GATCTCGCGT GCGAGGCGCT CGCCGATCCG
GCCCGCACGG TCGCCGACGT CGCCGCGGCG ACGGGCTTCG CCGAGCCGAG CGCGTTCTAC
CGCGCGTTTC GCAAGTGGCG CGGGATGAGC CCCGCCGACT ACCGCGACGC CGCGCTCGCC
GCGCGCGCGG CCGCTTCGCG CTTTCGCCGG AAACCGCCTA CTCTTTAA
 
Protein sequence
MLAPLNYTDR FCHTAPQRAP AMKHEEKKGT VSIELVESSL ALSRRRGVDD ASLLAQAGIA 
GALLAQPNAR VSARQYGALW NAIARALDDE FFGQDSHPMR CGSFIAMSQA ALTARNGLRA
LARAVNFMHC VLDDLHAQLD ASAERVRLRF VHRNSANPPE MFAYATYFVI VYGLTCWLIG
RRIPLLHASF RCGEPRAVHE YRLMFCDDMR FDEPDSYVDF DPAFAALPIV QTAQTLKPFL
RDAPASFIVK YRNPHALGER VRAALRALPP AAWPTARALA ARLHVAEATL RRKLKQEGHS
YQSIKDALRR DLACEALADP ARTVADVAAA TGFAEPSAFY RAFRKWRGMS PADYRDAALA
ARAAASRFRR KPPTL