Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0461 |
Symbol | |
ID | 4900408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 425237 |
End bp | 426202 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640133691 |
Product | DNA-binding transcriptional activator GcvA |
Protein accession | YP_001064744 |
Protein GI | 126454788 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.796194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCC ATCGACTGCC GATGCTGAAT GCGCTGCGCG TGTTCGAGGC CGCCGCGCGG CACGAGAGCT TCTCGCGCGC GGCAGACGAG CTGTCCGTCA CGCACGGCGC GGTCAGCCAC CAGATGCGCG CGCTGGAAGC CGAGCTCGGC GTGCCGCTGT TCGTGCGCCA CGGCAAGCGG CTCGCGCTGA CGGACGCGGG CGGCCGCTAC GCGCAGCAGG TGCGCGCCGC GCTCGCGCTG CTCGCCGACG CGACCCGCGA GGTTCGCGCG AGCGAGCGCG ACAAGCGGCT CGTCGTGTCG ACGCTGCCGT CGTTCGCCGC GCGCTGGATC ACGCCGAGGA TCGGCCCGTT CATCGAACGG CATCCGGAAA TCGACCTGGA GCTGCGGGCA AGCGATTCGC TCGTCGATTT CGCGCGCGAC GACGTCGATG TCGCGATCCG CTTCGGCCAC GGCGTCTATC CGGGGCTGCA CGTCGAGCCG CTGCTCGACG AGACGTTCTT TCCCGTCTGC GCGCCGACGC TCAACGGCGG CATGCTGCCC GAGACGCCCG CCGATCTCGT CCGCTATCCG CTGCTGCGCT CGGACGACGA GCTGTGGCGG CCGTGGTTCG ACGCGGCGGG GCTCGACACG CTGACCGAGC CGAAGCGCGG CGTGCTGTAT CAGGATTCGT CGAATCTGCT GCAGGCGGCG ATCGACGGCC AGGGCATCGC GCTCGTGCGG CGCTCGCTCG CGGTGCCGGA GGTGGCGGCC GGCCGGATCG TGCGGCTCTT CGACATCGCG GGGCCGAGCC CGTGGCACTA CTTCTTCGTG TGCCCGCCGT CGCTCGCGCA AACGCCGCGC GTGCAGGCGC TCAGGAACTG GCTGCTGGAC GAGATCGCGC GCTTCAGGGC GCTGTGCGCG GCGCAGGAGA CGCGGCACGC GGCGGCCTAT GCGGCCGCGC GTGCGCGCGG CAAGGAGGGA AACTAG
|
Protein sequence | MNIHRLPMLN ALRVFEAAAR HESFSRAADE LSVTHGAVSH QMRALEAELG VPLFVRHGKR LALTDAGGRY AQQVRAALAL LADATREVRA SERDKRLVVS TLPSFAARWI TPRIGPFIER HPEIDLELRA SDSLVDFARD DVDVAIRFGH GVYPGLHVEP LLDETFFPVC APTLNGGMLP ETPADLVRYP LLRSDDELWR PWFDAAGLDT LTEPKRGVLY QDSSNLLQAA IDGQGIALVR RSLAVPEVAA GRIVRLFDIA GPSPWHYFFV CPPSLAQTPR VQALRNWLLD EIARFRALCA AQETRHAAAY AAARARGKEG N
|
| |