Gene BURPS1710b_A1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1981 
Symbol 
ID3693405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2418341 
End bp2419495 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content72% 
IMG OID637732235 
ProductAraC family transcriptional regulator 
Protein accessionYP_337132 
Protein GI76819530 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.382238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATACCG GCACGCAGCG GCAAACGGCG CGCCGCCGCG GCGTGCGCTC GACGCCCGCG 
AACACTCGCC GGCCGCGCCG GCGCGCTGCC CGGCACGCGC CGGCACGACC TTCCCGCACC
GCAAACGGGC ACGGTGCGGA AACCGCGGCA CATGCGTCTG CGCGCGGCGA GTGCCGATGC
GGCGCGCCGC CGTATGCCGC GCGCGGGCGA GCGCTCGCGT CACGTCCCGC CCACCGCTTC
CGGAACTGTT CCATGAGCTC GCCCGCACGC CATTCGCCGC CGCAAACGAT CACGAAGGAC
TCGCTCGGCT GCACGTCGAG CGGCTTTCTG ACGGATATCG AACGCGAAAC GGCGCTCTTG
CGCTTCTATC GGAAACACAC GACGGGCCTG CGCCTCGAGC AGGTCTCGAC ACCCGCGTCC
GGGCGCGGCG TGCTGATCGG CATCTCGCTG TCCGGCGGCC ATCGCCGCAA GATCCTCCGC
GGCCGCCGGT CGGTGACGCA CGACTTCCGC GCCGATTCGG TCTACGTGCG CGATTTCTCC
GAAGACTATC GCGCGGACAT GATGTCGGAT TTCGACTTCG CGCTCGTCGA GCTGTCGCCG
TCGTTCATCG ACGGCCTCGC CGAGCGCGGC CGCCCGGCGC GCATCGCCGG CGTCGCGCCG
ACGCTCGCGC ACGACGACCG CCTGCTCGGC GAGCTCGGGC GCGCGCTCGC GCTCGCGCTG
CAGTCGGGCG ACGCGGCCGA TGCCATGCTC GTCGATCAAT TGGGCATCGC GATCGGCACG
CACGCGATGC ACGCGTACGG CGGCCTGCGC GCCGACGAGC CGAAGCAGCG GCGGCGCCTG
TCGGCGCCGC TCGAGCGGCG CGCGAAGGAG ATGCTGGCGG CGGGCGCGAC GTCGGTGGAC
GAAATCGCGC GTGCATGCCG CGTGTCGCGC GGCTACTTCA TCAACGCGTT CAGCGCGACG
ACGGGCAAGA CGCCGCATCA GTGGCTGATC GAGCAACGCA TCGAAGCGGC GAAGCATCTG
CTCGCGCACG GCGACTGGAC GCTCGCGCGC ATCGCCGAGC ATTGCGGCTT CTCGAGCCAG
AGCCATTTCA CGCAGAGCTT CGCGAAGGCC ATCGGCCTGC CGCCCGGCGC GTGGCGCCGG
CGCGCGCGCG CCTGA
 
Protein sequence
MHTGTQRQTA RRRGVRSTPA NTRRPRRRAA RHAPARPSRT ANGHGAETAA HASARGECRC 
GAPPYAARGR ALASRPAHRF RNCSMSSPAR HSPPQTITKD SLGCTSSGFL TDIERETALL
RFYRKHTTGL RLEQVSTPAS GRGVLIGISL SGGHRRKILR GRRSVTHDFR ADSVYVRDFS
EDYRADMMSD FDFALVELSP SFIDGLAERG RPARIAGVAP TLAHDDRLLG ELGRALALAL
QSGDAADAML VDQLGIAIGT HAMHAYGGLR ADEPKQRRRL SAPLERRAKE MLAAGATSVD
EIARACRVSR GYFINAFSAT TGKTPHQWLI EQRIEAAKHL LAHGDWTLAR IAEHCGFSSQ
SHFTQSFAKA IGLPPGAWRR RARA