Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_0698 |
Symbol | |
ID | 4677206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008784 |
Strand | + |
Start bp | 702489 |
End bp | 703520 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639843222 |
Product | LysR family transcriptional regulator |
Protein accession | YP_990303 |
Protein GI | 121596886 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | [TIGR03418] putative choline sulfate-utilization transcription factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.18558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGCGCTC ATATGGAATC GTCGGATCGG AACGGCGGCG CGACAGGCAG CCGCCGCCGG CCGGATCTCA CTATTGCCGG CCGTAATGCG GCCGTGAAGG CATCGAATGC TTATGAGCCA TCAGAGGTGC TTATGACGAG ACCAGACCGT CTGCCGCCGA TGCAGACCTT GTCCGCGTTC GAAGCGGCCG CGCGCCTCGC GAGCTTCACG GCCGCCGCGC GCGAGCTCGG CTCGACGCAG CCGGCCGTCA GCCAGCGCGT CGTTCAGCTC GAAGAGGATC TCGGCACGCC GCTCTTCGAG CGCGGGCGCC GCGGCGTCAC GCTGACCGAG GACGGCACGC GGCTCTTCGC GGCGGTCCGC CAAAGCCTCG ACGCGCTGCG CGCCGCGACG GCGGACATCC GCAGCCGCCG CGCGAACGGC ACATTCACGC TCGTCACCGA TTTCGGCTTC GCCACCTACT GGCTGATGCC GCGCCTCGAC GATCTGAAGC GCGCGATGCC CGGCGTCGAC GTGCGCGTCG TCACGTCTCA GGACATCGAT CCGCAGCGCG AGCACGCCGA CGTCGCGATC CTGTTCGGCG CCGGCGACTG GCCGGGCTGC ACATCGACGC GGCTCTTTCA GGAACACGTG ACGCCCGTGT GCTCGCCCGC GTTTCGCACC GCGCATGCCG ATATCGCGCG GCCAGCCGAC CTGTTGCGCG CGCCGCTGCT GCACGTGCAG CCGACGCGCC CCGAGCGCTG GCTCGCGTGG CGCGATTGGT TCGACGCGCA TGGGCTCGCC GCGCCGCCCG AGCCGCACGG GCTGACGTTC AACAGCTACT CGCTCGTGAT TCAAGCGGCG CTGATGAATC AGGGTGTCGC GCTCGGCTGG GCGCCGCTCG TCGACACGCC GATCGCGGCC GGCCAGCTCG TGCGGCTCGT CGACGCGCCC GTCGTCACGC CGCGCGGCTA CTACCTCGTT CTGCCGCCCG CGCGGCCGGA GGCGCGCGCG GTGCCGCTCT TTCGCCGCTG GCTGCTCGGC GCATGCGCAT GA
|
Protein sequence | MGAHMESSDR NGGATGSRRR PDLTIAGRNA AVKASNAYEP SEVLMTRPDR LPPMQTLSAF EAAARLASFT AAARELGSTQ PAVSQRVVQL EEDLGTPLFE RGRRGVTLTE DGTRLFAAVR QSLDALRAAT ADIRSRRANG TFTLVTDFGF ATYWLMPRLD DLKRAMPGVD VRVVTSQDID PQREHADVAI LFGAGDWPGC TSTRLFQEHV TPVCSPAFRT AHADIARPAD LLRAPLLHVQ PTRPERWLAW RDWFDAHGLA APPEPHGLTF NSYSLVIQAA LMNQGVALGW APLVDTPIAA GQLVRLVDAP VVTPRGYYLV LPPARPEARA VPLFRRWLLG ACA
|
| |