Gene BURPS668_A0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0033 
Symbol 
ID4885772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp23433 
End bp24437 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content51% 
IMG OID640129974 
Producthelix-hairpin-helix DNA-binding motif-containing protein 
Protein accessionYP_001061039 
Protein GI126443025 
COG category[K] Transcription 
COG ID[COG5499] Predicted transcription regulator containing HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000119331 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGGCAAG CAGACCTTGT TCCATATTTC GGAACGCGCA GTCGAGTTTC TGAGGTCCTT 
TCCAGGAAGC GACCTCTGAC GGTGCAAATG ATTCGCGCTC TGTCCGTCGG ACTTGGGATA
TCCGCAGAGA CGCTTGTTGG TGCAAACTCC ATCGATTCCG ATGCGTCAAA GAAAGAAGAC
GTCGATTGGT CTCGGTTCCC TGTCAAGGAG ATGGTTGCCC GCGGCTGGCT GAAGAAACTT
GCATCGCAGG CAGTTGGCTC AACCGAGGAT TTAGTGAAGG GCTTTATCTC AAGTGCAGGA
CTGCAATTCG GTACCGCCGC TTTTCGCCGA GGAATGAGCG GTGACGCGTA CTCGCCGAGC
ACGATGTACT CACTGTACGC GTGGCTTTCA AGGGTAATTC AACGCGGGCG TGAACGAAAA
GCGAGCCTTG GGAAATATGA TCCGTCGATG TTCTCTGCCG CATTTCTGCG TGAGCTTGCA
CAACTGAGTT GGTCAGAGCA TGGACCGCTA CTTGCGGTTG AATATCTTGA GCGTAGAGGT
ATTGCTGTCG TTGTTGAGCC ACATTTGAAA GGAACGTTGC TTGATGGCGC AGCACTCAAA
GATGCGGACG GCACTCCGAT CATTGGACTG ACTTTGCGGT TTGACCGACT GGACAGTTTC
TGGTTTACAT TACTGCATGA GGTTGCTCAT ATCTGGAAAC ACGTTGGTCA TGATGAAACT
TTTTTGGACA ATCTTGACGT GTCGCCAGAA GACAAACGCG AGCTGGAGGC GAACCGTCTG
GCCAAAGAGG CACTGATACC ACGTGTTGCG TGGAAGCGTA GTGATGCATA TCTAAATCCG
AGTCCAGAGA CCATCGACAA GCTTTCTCGA GAACTCAAGA TTCACCCGGC AATCATCGCA
GGTCGTTTGA GGAAAGAATC CGAAAACTAC AAGCTCTTCA ATGAGCTTAT CGGATACAAC
GAAGTACGGA AGCACTTTAA TCTAACCGCA AGTTCAGAGG TTTGA
 
Protein sequence
MRQADLVPYF GTRSRVSEVL SRKRPLTVQM IRALSVGLGI SAETLVGANS IDSDASKKED 
VDWSRFPVKE MVARGWLKKL ASQAVGSTED LVKGFISSAG LQFGTAAFRR GMSGDAYSPS
TMYSLYAWLS RVIQRGRERK ASLGKYDPSM FSAAFLRELA QLSWSEHGPL LAVEYLERRG
IAVVVEPHLK GTLLDGAALK DADGTPIIGL TLRFDRLDSF WFTLLHEVAH IWKHVGHDET
FLDNLDVSPE DKRELEANRL AKEALIPRVA WKRSDAYLNP SPETIDKLSR ELKIHPAIIA
GRLRKESENY KLFNELIGYN EVRKHFNLTA SSEV