Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_1377 |
Symbol | |
ID | 4678396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008784 |
Strand | - |
Start bp | 1361354 |
End bp | 1362379 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639843893 |
Product | AraC family transcriptional regulator |
Protein accession | YP_990973 |
Protein GI | 121597164 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00832504 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCCCG GCCGACGAAC GATGTTCAGG ACGCCGGCGC TTTGTGCGAG GCAGACGATC ATGCCCCCGA CCCCATTCGA TCCGCTCGCG TTGCGCGAGC ACCGGCTGTT CGAATCGCGC GACCTCGACG AGACGCGCGA GCGGATCTCG CGCGTGATGC AGCCGCATGC GCTGTTGCCG GACGGCTCGC GGCACGGGCC GTCGCACATG GATTACGTGC GCCTCGGCGG GCTCGGCATC GGAACCATCG CGTTCGGCGA CGCGATGCGG GTGCATCTCG ATGCGGTGGA CGGCTATCAT CTGCTGATGT TTTGTCTGAC GGGTTCCGCG CAGGTCCGCA CGATGGGCCG CGCGTTCGAC GTCGACGCGC ACACGGGCGT GCTGTGCGCG CCGGGCGAGC CGTTCGACGC GCACCTGTCG CGCGATTGCG AGCAGTTCGT CCTCCGTATC GATGCGGCGA CCCTCGCCGC GCACGCGGGC GACGCGGCGG CGGCGCTCGA TCCCGTGATC GGCATCGACG ATTCGGCGCT GAGCGCGTGG ATGCAGCAAC TGCAGCTCGT CGCGCGCTCG CCGGAACTGC TCGCGAGCGC AAGCGCGAAC CCGCGCGTCG CAACGCGGCT CGAACAGTTG CTGCTCGATC TGCTGATCGA CGGGCATCCG CCCGCCGCGC CGCCCGCGCG GCGCGCCGAT CCGGCGCCAG GCTTCGTGCG GCGCGCGCAG GAGTTCATCG GCGCGCAGCT CGCCCAGCCG CTGCAGCTCG CCGACATCGC GCAGGCCGCG GGCGTACCCG AGCGCACGCT GCGCGACGGC TTCCTGCAGT TTCGCGGGAC GAGCCCGATG CAATACCTGC GCCAGCGGCG CCTCGAGCGC GCGCGCGAGC TGCTGCGCAC GGCCGCGCCC GAGCGCCGGA TCGCCGAGAT CGCGCTCGAT TGCGGTTTCG CGCACTTCGG CCGCTTCGCG ATCGCCTACC GCGAACGGTT CGGCGAGCTG CCGTCCGCGA CGCTCGCCGA TCGGCGCGAC GCCTGA
|
Protein sequence | MPPGRRTMFR TPALCARQTI MPPTPFDPLA LREHRLFESR DLDETRERIS RVMQPHALLP DGSRHGPSHM DYVRLGGLGI GTIAFGDAMR VHLDAVDGYH LLMFCLTGSA QVRTMGRAFD VDAHTGVLCA PGEPFDAHLS RDCEQFVLRI DAATLAAHAG DAAAALDPVI GIDDSALSAW MQQLQLVARS PELLASASAN PRVATRLEQL LLDLLIDGHP PAAPPARRAD PAPGFVRRAQ EFIGAQLAQP LQLADIAQAA GVPERTLRDG FLQFRGTSPM QYLRQRRLER ARELLRTAAP ERRIAEIALD CGFAHFGRFA IAYRERFGEL PSATLADRRD A
|
| |