Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0441 |
Symbol | |
ID | 4881815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 410828 |
End bp | 411793 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640126369 |
Product | DNA-binding transcriptional activator GcvA |
Protein accession | YP_001057494 |
Protein GI | 126441649 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGCC ATCGACTGCC GATGCTGAAT GCGCTGCGCG TGTTCGAGGC CGCCGCGCGG CACGAAAGCT TCTCGCGCGC GGCAGACGAG TTGTCCGTCA CGCACGGCGC GGTCAGCCAC CAGATGCGCG CGCTGGAGGC GGAGCTCGGC GTGCCGCTGT TCGTGCGCCA CGGCAAGCGG CTCGCGCTGA CGGACGCGGG CGGCCGCTAC GCGCAGCAGG TGCGCGCCGC GCTCGCGCTG CTCGCCGACG CGACCCGCGA GGTTCGCGCG AGCGAGCGCG ACAAGCGGCT CGTCGTGTCG ACGCTGCCGT CGTTCGCCGC GCGCTGGATC ACGCCGCGCA TCGGCCCGTT CATCGAACGG CATCCGGAAA TCGACCTGGA GCTGCGGGCA AGCGATTCGC TCGTCGATTT CGCGCGCGAC GACGTCGATG TCGCGATCCG CTTCGGCCAC GGCGTCTATC CGGGGCTGCA CGTCGAGCCG CTGCTCGACG AGACGTTCTT TCCCGTCTGC GCGCCGACGC TCAACGGCGG CATGCTGCCC GAGACGCCCG CCGATCTCGT CCGCTATCCG CTGCTGCGCT CGGACGACGA GCTGTGGCGG CCGTGGTTCG ACGCGGCGGG GCTCGACACG CTGACCGAGC CGAAGCGCGG CGTGCTGTAT CAGGATTCGT CGAATCTGCT GCAGGCGGCG ATCGACGGCC AGGGCATCGC GCTCGTGCGG CGCTCGCTCG CGGTGCCGGA GGTGGCGGCC GGCCGGATCG TGCGGCTCTT CGACATCGCG GGGCCGAGCC CGTGGCACTA CTTCTTCGTG TGCCCGCCGT CGCTCGCGCA AACGCCGCGC GTGCAGGCGC TCAGGAGCTG GCTGCTGGAC GAGATCGCGC GCTTCAGGGC GCTGTGCGCG GCGCAGGAGG CGCAGCACGC GGCGGCCTAT GCGGCCGCGC GTGCGCGCGG CAAGGAGGGA AACTAA
|
Protein sequence | MNSHRLPMLN ALRVFEAAAR HESFSRAADE LSVTHGAVSH QMRALEAELG VPLFVRHGKR LALTDAGGRY AQQVRAALAL LADATREVRA SERDKRLVVS TLPSFAARWI TPRIGPFIER HPEIDLELRA SDSLVDFARD DVDVAIRFGH GVYPGLHVEP LLDETFFPVC APTLNGGMLP ETPADLVRYP LLRSDDELWR PWFDAAGLDT LTEPKRGVLY QDSSNLLQAA IDGQGIALVR RSLAVPEVAA GRIVRLFDIA GPSPWHYFFV CPPSLAQTPR VQALRSWLLD EIARFRALCA AQEAQHAAAY AAARARGKEG N
|
| |