Gene Bcep18194_C6649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C6649 
Symbol 
ID3733971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp158141 
End bp159160 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content71% 
IMG OID637760356 
ProductAraC family transcriptional regulator 
Protein accessionYP_366343 
Protein GI78059768 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.700604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0354217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCCGC AGATGATTTC GCCGGATTTC GTCGACGACG CGCTCGCGTG CCTGCGCCGG 
CAAGGGCTTC CGACGGAGCC CGTGCTACGC ACCGCCGGCC TGCCGGCCGC CGTGCGCGAG
CCTGTCACGC CGCAGCAGTA CGGCCGGCTG TGGCTCGCGA TCGCCGGTGC GCTCGACGAC
GAGTTCTTCG GCCTCGCCGC ACGCCCGATG CGGCATGGCA GCTTCACGCT GCTGTGCCAT
GCGGTGCTGC ACGCCGGCAC GCTCGAGAAG GCGCTGCGGC GCGCGCTGCA GTTCCTGCGC
GTGGTGCTGG ACGAGCCGCA TGGCGAGCTT GTCGTGGCCG ACGGGCAGGC GCAGATCGTG
CTGACGCAGA CGGGCGCGCC CTACCCGGCG TTCGCGTACC GGACGTTCTG GCTGATCCTC
CTCGGCGTCG CGTGCTGGCT GATCGGCCGG CGCATCCCGC TCCAGCGCAT CGACTTCGCG
TGCCCGAGCC CCGACCAGCG CAGCGACTAT CACCAGTTCT TCGGCGTGCC CGTGCATTTC
GACCGGCCCG ACAGCCGGCT CGCGTTCAAC GCCGCGTACC TTGCGCTGCC GACGATCCGC
TCCGAGCAGG CGTTGAAGAC TTTCCTGCGC GGCGCGCCCG GCAACCTGCT GGTTCGCTAC
CGGCACGACA CGGGCTGGGT CGCGAAGACG CGCGCGCAAC TGAAAACGCT ACCGGCGGCG
GAGTGGCCCG ACTTCGACAC GCTGGCCGTG CGCCTCGGCA CGACGCCCGC GACGCTGCGG
CGGCGTCTGC GCAGCGAAGG GCAAAGCTTC GCGGCGATCA AGGACGAGCT GCGCGGCGCG
CTGGCGCAGT CGCTGTTGCG CGGGGATGCG CTCAGCGTGG CGGAGATCGC GGCCGAGCTC
GGGTTTACCG AGCCGAGCGC GTTTCATCGC GCGTTCCGGA AATGGACGGG CACGAGTCCT
GGTGCGTTCC GGCGGGATGT GCATGCGGCG GAGGGGGAAC CGGGGGTGGC GAGCGGATGA
 
Protein sequence
MGPQMISPDF VDDALACLRR QGLPTEPVLR TAGLPAAVRE PVTPQQYGRL WLAIAGALDD 
EFFGLAARPM RHGSFTLLCH AVLHAGTLEK ALRRALQFLR VVLDEPHGEL VVADGQAQIV
LTQTGAPYPA FAYRTFWLIL LGVACWLIGR RIPLQRIDFA CPSPDQRSDY HQFFGVPVHF
DRPDSRLAFN AAYLALPTIR SEQALKTFLR GAPGNLLVRY RHDTGWVAKT RAQLKTLPAA
EWPDFDTLAV RLGTTPATLR RRLRSEGQSF AAIKDELRGA LAQSLLRGDA LSVAEIAAEL
GFTEPSAFHR AFRKWTGTSP GAFRRDVHAA EGEPGVASG