Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_C6649 |
Symbol | |
ID | 3733971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007509 |
Strand | - |
Start bp | 158141 |
End bp | 159160 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637760356 |
Product | AraC family transcriptional regulator |
Protein accession | YP_366343 |
Protein GI | 78059768 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.700604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0354217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCCGC AGATGATTTC GCCGGATTTC GTCGACGACG CGCTCGCGTG CCTGCGCCGG CAAGGGCTTC CGACGGAGCC CGTGCTACGC ACCGCCGGCC TGCCGGCCGC CGTGCGCGAG CCTGTCACGC CGCAGCAGTA CGGCCGGCTG TGGCTCGCGA TCGCCGGTGC GCTCGACGAC GAGTTCTTCG GCCTCGCCGC ACGCCCGATG CGGCATGGCA GCTTCACGCT GCTGTGCCAT GCGGTGCTGC ACGCCGGCAC GCTCGAGAAG GCGCTGCGGC GCGCGCTGCA GTTCCTGCGC GTGGTGCTGG ACGAGCCGCA TGGCGAGCTT GTCGTGGCCG ACGGGCAGGC GCAGATCGTG CTGACGCAGA CGGGCGCGCC CTACCCGGCG TTCGCGTACC GGACGTTCTG GCTGATCCTC CTCGGCGTCG CGTGCTGGCT GATCGGCCGG CGCATCCCGC TCCAGCGCAT CGACTTCGCG TGCCCGAGCC CCGACCAGCG CAGCGACTAT CACCAGTTCT TCGGCGTGCC CGTGCATTTC GACCGGCCCG ACAGCCGGCT CGCGTTCAAC GCCGCGTACC TTGCGCTGCC GACGATCCGC TCCGAGCAGG CGTTGAAGAC TTTCCTGCGC GGCGCGCCCG GCAACCTGCT GGTTCGCTAC CGGCACGACA CGGGCTGGGT CGCGAAGACG CGCGCGCAAC TGAAAACGCT ACCGGCGGCG GAGTGGCCCG ACTTCGACAC GCTGGCCGTG CGCCTCGGCA CGACGCCCGC GACGCTGCGG CGGCGTCTGC GCAGCGAAGG GCAAAGCTTC GCGGCGATCA AGGACGAGCT GCGCGGCGCG CTGGCGCAGT CGCTGTTGCG CGGGGATGCG CTCAGCGTGG CGGAGATCGC GGCCGAGCTC GGGTTTACCG AGCCGAGCGC GTTTCATCGC GCGTTCCGGA AATGGACGGG CACGAGTCCT GGTGCGTTCC GGCGGGATGT GCATGCGGCG GAGGGGGAAC CGGGGGTGGC GAGCGGATGA
|
Protein sequence | MGPQMISPDF VDDALACLRR QGLPTEPVLR TAGLPAAVRE PVTPQQYGRL WLAIAGALDD EFFGLAARPM RHGSFTLLCH AVLHAGTLEK ALRRALQFLR VVLDEPHGEL VVADGQAQIV LTQTGAPYPA FAYRTFWLIL LGVACWLIGR RIPLQRIDFA CPSPDQRSDY HQFFGVPVHF DRPDSRLAFN AAYLALPTIR SEQALKTFLR GAPGNLLVRY RHDTGWVAKT RAQLKTLPAA EWPDFDTLAV RLGTTPATLR RRLRSEGQSF AAIKDELRGA LAQSLLRGDA LSVAEIAAEL GFTEPSAFHR AFRKWTGTSP GAFRRDVHAA EGEPGVASG
|
| |