Gene BURPS668_A3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3005 
Symbol 
ID4887797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2856176 
End bp2857696 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content69% 
IMG OID640132942 
Productsigma-54 dependent transcriptional regulator 
Protein accessionYP_001063997 
Protein GI126443116 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000526118 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACAAGA AAGAAAAAGT GGAGACGAAT GCACCGATTT CGGGCGGCTG GGTTCGGCTG 
CCCGCCGATT ACGGCGACGT GCTGCGGCGC GCGGCGGAGT CGCTGTTCAA GACCTTCGAG
CACTCGAGCG TCGGCACGCT GATCGTCGAC AAGGATGCGC GCGTCGTCTG GATCAATCAG
CGTTACGCGG CGCGTTTCGG GTTCGCCGAT CCGCAGCAGG CGATCGGCCG CGATTGCGAA
GCGGTGATTC CGCACAGCCT GATGCGCGAG GTGGTCGCGA CCGGCCGCCC GATCCTGCTC
GACATCATGG AGACGGGCCG CGAGCCGCTC GTCGTCACGC GCCTGCCGCT GACGGACGAC
GCGGGCGAGA CCGTCGGCGC GATCGGCTTC GCGCTGTTCG ACGAGCTGAA GACGCTCACG
CCGCTCTTTT CGCGCTACAT GCAGGTCCAG CAGGAGCTGA TCGCGACGCA ACGCTCGCTC
GCGCAGGCGC GGCGGGCGAA ATACACGTTC GCGAGCTTCG TCGGCACGAG CGCGGTGAGC
CTCGAGACGA AGCGGCAGGG GCGGCGCGCC GCGCAGGTCG ATTCGCCGGT GCTGCTGCTC
GGCGAGACGG GCACCGGTAA GGAGCTGCTC GCGCATGCGA TCCACGCGGC GTCCGCGCGG
GCATTGAAGC CGCTCGTGAC CGTCAACGTC GCGGCGATTC CCGATGCGCT GCTCGAAACC
GAGTTCTTCG GCGCGGCGCC GGGCGCGTAC ACGGGCGCGG ATCGCAAGGG GCGCGTCGGC
AAGTTCGAGC TTGCCGACGG CGGCACGCTC TTTCTCGACG AAATCGGCGA CATGCCGGTG
CCGCTGCAGG GCAAGCTGCT GCGCGTGCTG CAGGACAAGG AGTTCGAGCC GGTCGGCTCG
AACCGGATCG TGCGCGCGAA TGTGCGGATC ATCGCGGCGA CGTCGGCCGA ATTGCCGGCG
CTCGTCGCGG AAGGGCGCTT TCGCGCGGAC CTTTATTACC GGCTGAACGT GCTGACGATC
CATGCGCCGC CGCTGCGCGA GCGCGCATCG GACATCGAGG CGCTCGTCTA CACGATGCTC
GAGGAACTCG CCGCGCAGCA TGGGCTGGCC GAGCACTGCG AACTGACCGA CGACGCGCTG
CGCCTGCTGT GCGCGTATCC GTGGCCCGGC AACGTGCGCG AACTGCGCAA CACGCTCGAG
CGCGCGCTGA TGCTGTCCGA TCGCGCGTTG ATCGATGCGC GCGCGCTCGC GCCGTTCATC
GGGCCGGCGC GCGGCGCGGG GGGCGGTGTC GGGGCGGGCG GCGTCGGTCC GGCCGCGGTC
GCCATCGCGG CGCAGACTGC CATGGCCGAT ACGCGCGCGG CGGCGTCATC CTATGCGGAC
GCATTCGCCG CGTGGGAGCG TCAATTCCTG ATCGACGCGC TTGCCGCGTC CAACGGCAAG
GTGACGGAAG CGGCCGCGCG CATCGGCATC GGGCGTGCGA CGTTCTACAA GAAGCTCGCG
ACGCTCGGCA TCGATACGTA G
 
Protein sequence
MDKKEKVETN APISGGWVRL PADYGDVLRR AAESLFKTFE HSSVGTLIVD KDARVVWINQ 
RYAARFGFAD PQQAIGRDCE AVIPHSLMRE VVATGRPILL DIMETGREPL VVTRLPLTDD
AGETVGAIGF ALFDELKTLT PLFSRYMQVQ QELIATQRSL AQARRAKYTF ASFVGTSAVS
LETKRQGRRA AQVDSPVLLL GETGTGKELL AHAIHAASAR ALKPLVTVNV AAIPDALLET
EFFGAAPGAY TGADRKGRVG KFELADGGTL FLDEIGDMPV PLQGKLLRVL QDKEFEPVGS
NRIVRANVRI IAATSAELPA LVAEGRFRAD LYYRLNVLTI HAPPLRERAS DIEALVYTML
EELAAQHGLA EHCELTDDAL RLLCAYPWPG NVRELRNTLE RALMLSDRAL IDARALAPFI
GPARGAGGGV GAGGVGPAAV AIAAQTAMAD TRAAASSYAD AFAAWERQFL IDALAASNGK
VTEAAARIGI GRATFYKKLA TLGIDT