Gene BURPS668_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0861 
Symbol 
ID4883223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp840783 
End bp841817 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content69% 
IMG OID640126789 
ProductDJ-1/PfpI family protein/transcriptional regulator, AraC family 
Protein accessionYP_001057912 
Protein GI126439934 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCGC AAACTCCCCT CCGGCATCGG ACGACGACCG TCGATGTCGT GATCTATCCG 
GGATTCAAGG CGATCGAGGC CGTCGGCGTC ATCAACGTGT TCGACTATGC GAACGCGCGG
CTCGCCGCCG CGGGGCTCGC GCCCGTCTAC GATCTCCAGA TCGCCGCGCC CGCGAAGGGC
GCGGTCAAGT CCGACACCCT CATCGTGCTC GAGGCGACGA AGGCGCTCGA CACGCTCGCG
GTGCCCGACA CGGCGATCGT CGTCGGCGCG CGCGACATCG AGCGGGCGCT GCGCGACACG
TCGATGCTCG TCGGATGGTG CCGCGACGTG TCCGCGCGCA TCGGCCGGAT GGTCGGGCTG
TGCTCGGGCT GCTTCTTTCT CGCCGAAGCC GGCATGCTGG ACGGCCGGCG CGCGACGACG
CACTGGAGCG TCGCCCCCCT GTTGCGGGCG CGTTATCCGG CGGTGAAGGT GGAGCCCGAC
GCGATCTTCG TTCGCGAGGG CAACGTGTGG ACGTCGGCGG GCGTCACGGC CGGCCTCGAT
CTCGCGCTCG CGATGGTCGA GGAGGATCTC GGCCGCGAGA TCGCGCTCGC CGTCGCGCGC
GATCTCGTGA TTTACCTGAA GCGGCCGGGC GGCCAGTCGC AGTTCAGCGT GTACCTGGCG
AGCCAGATGA CCGCGCACGC GTCGATCCGC GACATTCAGG ACTGGATTCT GAACGCGCTC
GACGCGCGGC TGAGCATCGC GCAGCTCGCC AGGCGCGCCG CGATGAGCGA GCGCAACTTC
ATTCGCGTGT TCGTGCGCGA AACCGGCTAT CGTCCGGCCG AATTCATCGA AATCGCGCGG
CTCGAAAAAG CGCGCCGCCT GCTCGAGCAG GAAGCGCTGC CGCTGAAGAC GGTGGCCGTG
CGCAGCGGGT TTCGTTCCGA CGACCAATTG CGGCGCGTGT TCATGCGCCG CCTCGGCGTG
ACGCCCGGCG CGTATCGCGA GCGGTTCTCC GGCACCGGCG TGCGCGAAGC GCGGGGGAGC
GGCGACGTGG ATTGA
 
Protein sequence
MAAQTPLRHR TTTVDVVIYP GFKAIEAVGV INVFDYANAR LAAAGLAPVY DLQIAAPAKG 
AVKSDTLIVL EATKALDTLA VPDTAIVVGA RDIERALRDT SMLVGWCRDV SARIGRMVGL
CSGCFFLAEA GMLDGRRATT HWSVAPLLRA RYPAVKVEPD AIFVREGNVW TSAGVTAGLD
LALAMVEEDL GREIALAVAR DLVIYLKRPG GQSQFSVYLA SQMTAHASIR DIQDWILNAL
DARLSIAQLA RRAAMSERNF IRVFVRETGY RPAEFIEIAR LEKARRLLEQ EALPLKTVAV
RSGFRSDDQL RRVFMRRLGV TPGAYRERFS GTGVREARGS GDVD