Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_0373 |
Symbol | |
ID | 3688981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 390186 |
End bp | 391151 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637726829 |
Product | AraC family transcriptional regulator |
Protein accession | YP_331787 |
Protein GI | 76811172 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAGTACG AAGGAAGTCC GATGAGCTTC GAACCCCTTT TCGCCGACGG CGGCGCGCGC CTGCAGCGCC GCATCCTCGC GCTGTTCGGG CAGCTCGCGC CGAACGAAGG CTATACGCAC GCCGCGCTCG ACGGCGTGCG CTTCATTCGC TCGAACCGGC CCGTGCCGCG GATGCCCGTG CTGTACGACC CGAGCATCGT CGTCGTGTGC CAGGGGCGCA AGCGCGGCTA TGTCGGCGAG CAGAGCTTCG TCTACGACGC CCAGCAGTAC CTCGTGCTGT CGGTGCCGCT GCCGTTCGAA TGCGAGACGT TCGCGACGCC GGACGAGCCG TTTCTCGCCA TTTCGATTCG CATCGATCTC GCGGTGATCG CCGAGCTCGC GATCCTGCTC GACGAGACGC ACGGCGCGTC GGTCAGCGAG CCGCGCGGCA TCTATTCGAC GCCGCTCGAC GCGCCGCTCG CGGACGCGGT GCTGCGCCTG CTCGAGGCGC TTGCGTCGCC CGCCGATACG CGCGTGCTCG GCCCGGCGAT CATGCGCGAG ATCGGCTATC GCGTGCTGAC GGGCGCGCAG GGCGACGCGA TTCGCGCGGC GCTCGCGCAG CAGCACCATT TCGGGCGCGT CGCGCGGGCG CTGCGGCGCA TCCATGCGGA CCTGAGCGGC GAGCTCGACG TCGAGACGCT CGCGAGCGAG GCGGGGATGA GCCTTGCCGT GTTCCATGCG CAGTTCAAGA GCGTGACGGC GACCTCGCCG ATGCAGTACG TGAAGGCGAC GCGGCTCCAT CATGCGCGGC TGATGATGGT GCAGGACGGG CTGAATGCGG GAGCGGCGGC GGCGCGGGTC GGCTATGCGA GCGCGTCGCA GTTCAGTCGG GAGTTCAAGC GGCTGTTCGG GCGCAGTCCG AGCGACGAAG TGCGCTGGGT GCAGGCGGGC GGCCGGCAGC CGGTCGCGCT CGAAGCGGGA GAGTGA
|
Protein sequence | MQYEGSPMSF EPLFADGGAR LQRRILALFG QLAPNEGYTH AALDGVRFIR SNRPVPRMPV LYDPSIVVVC QGRKRGYVGE QSFVYDAQQY LVLSVPLPFE CETFATPDEP FLAISIRIDL AVIAELAILL DETHGASVSE PRGIYSTPLD APLADAVLRL LEALASPADT RVLGPAIMRE IGYRVLTGAQ GDAIRAALAQ QHHFGRVARA LRRIHADLSG ELDVETLASE AGMSLAVFHA QFKSVTATSP MQYVKATRLH HARLMMVQDG LNAGAAAARV GYASASQFSR EFKRLFGRSP SDEVRWVQAG GRQPVALEAG E
|
| |