Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3411 |
Symbol | |
ID | 4899972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 3328132 |
End bp | 3329019 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640136637 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001067648 |
Protein GI | 126453663 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACC GTCTGAAAGA CCTCCTCGCG CGTTTCGAAC TGCACGCCCG CGTATTTCAC TTCGGTGCGC TGCCCGGCGC GTCGACATTC GAGATTGGCG CGGACGGCTT CCATATGCAC CTGGTGCGCA CGGGGGCGGT CAACGTGACG GGCGAGGCGC TCGGGCGGCA CGCGGTGCCC GAGCCGGGCG CGGTGTTCAT CGGGCGCCCC GGCAAGTACC GGATCGAGGC GCGCGGCGAC GCGCCTGTCG AAGTCTTGTC CGCGGCGATC GAATTCGGGC TCGGCGATGA GAATCCGCTG TTGCGCGGCT TGCCCGATCT GCTCGCGATT CCGCTTGCGT CGATGTCGCC GCTCGGCGGC GTCCAGCAGG CGCTCTTCGC GGAAGCGAGC GCGCCCGCGT GCGGGCATGA CACGGTGATC AACCGGCTGA CCGAGGTGCT CGTCGTGCAG TTGCTGCGCT TCGTGATGCG CAACCGGCTG GTGGCGAGCG GATCGCTCGC CGGCCTGTCC GACGCGCGGC TCGCGAAGGC GTTGAACGCG ATGCACGCGG ATCCGGCGCT GCCGTGGTCG CTCGAGCGGA TGGCTGCGAT CGCGGGCATG TCGCGCTCGC GATTCGCCGC GCACTTCGCG GGCACGGTCG GCCTGCCGCC CGGCGAATAC CTGCTCCAAT GGCGCGTCGG GCTCGCGAAG ACGCTGCTCA GGCGCGGCTA TGCGGTGAAG GAAATCGCGC CGGAAGTCGG CTACGGCAGC GCGAGCGCGC TCACGCGCGC ATTCGCGCAA TCCACCGGGC AAGCGCCGAC CGATTGGCTC GCGCGGGCGG GCGACGCACC TGGCGCGATC GGAGCGGCAG CCGATTCGAT GCCGGACGTC GGCGTGCGAG CCGCCTGA
|
Protein sequence | MIDRLKDLLA RFELHARVFH FGALPGASTF EIGADGFHMH LVRTGAVNVT GEALGRHAVP EPGAVFIGRP GKYRIEARGD APVEVLSAAI EFGLGDENPL LRGLPDLLAI PLASMSPLGG VQQALFAEAS APACGHDTVI NRLTEVLVVQ LLRFVMRNRL VASGSLAGLS DARLAKALNA MHADPALPWS LERMAAIAGM SRSRFAAHFA GTVGLPPGEY LLQWRVGLAK TLLRRGYAVK EIAPEVGYGS ASALTRAFAQ STGQAPTDWL ARAGDAPGAI GAAADSMPDV GVRAA
|
| |