Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA0023 |
Symbol | |
ID | 3090899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006348 |
Strand | + |
Start bp | 26942 |
End bp | 29092 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637560905 |
Product | hypothetical protein |
Protein accession | YP_101876 |
Protein GI | 53724913 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGATC GCCTGACGAT CGCGCAGACG TTCGCGGCGC TCGCCGCGCG GTTTTCTCAA TGGGAGGTGC TCGCCGCGCT CGATCAACTG GTGCGGCGCG GCTACGTGCG CGCCGATGCA CCCGGCGAGC GCGACGCCGA GCTTGCCTTC CATGAACGCG CGGGCGTCGA CGGCGATGCG GCGAGCGGCG TCGCGTCGCG GCTTACGGTC GCCGTCGAGG CGTTCGGCGT GGACCCGCGC GCGCAGCTCG ACGCGTTCGC CGCATGCGGG ATCGGCGTTG CGCCCGACGC GCCGCTGACG GTCGCGCTCA CCGACGGCTA CGATCGCGCC GAATTGATCG TGGCCGCCGA ACGCGCGGCG GCGCGCGGCG GCGCATTGCT CGTTGTCGTG GCCGATCGCG TGGAGCCGCT GATCGGCCCG TTGCTCGGCG CTGCGGCGGG CTTGGCGGCA TCGACGGCGC CGACGGCACC GTCGACATCG CCCGCGCCGC AGGACGCCGC CGACGCGCCG CCCTGCATCG AATGCGTGCG CTACTGGACG GCGCTGAACC ATCCCGTCGA GACGCTGCTC GCGCGCCTGC ACGGCGGCGA CGCGGCGCGC CTGCCGCCCG CGCGCAGCCG TGCGAGCGCC GCCGCCGTCG CGGCCGTCGT CGCGTCGTTC GTCGAGCAGA TCGCGGTGAA CGCGCAGCGC CGCCGTCATG CGGGCTCGCA TATCGTGTCG CTGCGCGTCG ATACGCTGGC CACCGCCGCG CATCGCGTCG TCAGGCGGCC GCAATGTCCG CGCTGCGCTC ACGCGGGATG GATGCGCGAG CAGGCCGAGC GCCCGGTGAC GCTCGCGTCG GCGGATGCAG GCGCGCGCCG CGACGGCGGC TATCGGACGC TTGCCGCCGC CGAGCTATTC AAACGCTACG GACATCTGAT TTCCCCGGTG AGCGGGCCGA TCGCCTATCT GCATCCGATG CCGGGGCGCA ACGCCGGCAT GCGGCACATG TACGTCGCGG GCTACCTGGT GTGCCCGCCG AGCGCGCCGC GCGAGAACCG TTTCGACAAG CTGTGCTCGG GCAAGGGCGC GAGCGACGCG CAGGCGCGCG CCAGCGCGCT CGCCGAGGCG CTGGAGCGCT TCAGCGGCGT CTATCAGGGC GACGAGGCGG CGCTGCGCGG CAGTCTCGCG CAGTTGTCGG CACACGCGCC GCCGGGCAGC GGGCCGATCG ACGTCAACGC GCTACAGCAG TACAGCGATC GCCAGTTCGA GCGGCGCGAG CGCCACAACG CGACGACCGA CGATCCGCGC AAGCAGGTGC CGCGGCGCTT CACGCGCGAC AGCGTGATCG ACTGGACACC CGCGTGGTCG ATCGCGACGG GCGCGCGGCG GCTCGTGCCG CTCGCGTACT GCTATGCGGA AACGCCCGCG TCGAGCGGCG CCGACTATTG CGTGCACAAC CCGAACGGCT GCGCGGCAGG CGCATGCATC GAGGAAGCGA TCCTGCAGGG CCTGCTCGAG CTCGTCGAGC GCGACGCGGT GGCGATCTGG TGGTACAACA TGCTGCGCCG GCCGGCCGTG GACATCGAGA GCTTCGGCGA TCCGTACTTC GACGCGCTCG CCGCCGACTA TGCGTCGCTC GGCTGGCGCT TGTGGGCGCT CGACATCACG CACGACCTGC GCATTCCGGT GTTCGTCGCG CTCGCGCGCG AAACGGCGAC GGGGCGCTTC TCGATCGGCT TCGGCTGCCA TCCGGACAGC CGGATCGCGC TGCAGCGCGC TCTCACCGAA GTGAATCAAC TGCTCGACGT CGGCGCGTCG GCGCCGCCGC CGTGGGACGT CGACAAGCTG CCGGACGACG CGTTTCTTCA TCCGGATCCC GCGCTGCCGC CGGTGCGCGC GCCGGCGCGC GCGCCGCACG GCCGCTGCGA TCTGAAGCGT GACATCGAGG ATTGCGTCGC GCGCTTGGCC GCGGCGGGCA TCGATACGCT CGTCGTCGAC AAGACGCGGC CCGACATCGG CCTGCCGGTC GTGCAGGTGA TCGCACCCGG CCTGTGTCAT TTCTGGCCGC GCTTCGGCGC GCCGCGACTG TATTCGGTGC CCGTCGCGCA GCGCTGGCGC GAGCGGCCGC GCGACGAGGA CGCGCTCAAT CGCGCGCTGC TGTTCCTGTA G
|
Protein sequence | MRDRLTIAQT FAALAARFSQ WEVLAALDQL VRRGYVRADA PGERDAELAF HERAGVDGDA ASGVASRLTV AVEAFGVDPR AQLDAFAACG IGVAPDAPLT VALTDGYDRA ELIVAAERAA ARGGALLVVV ADRVEPLIGP LLGAAAGLAA STAPTAPSTS PAPQDAADAP PCIECVRYWT ALNHPVETLL ARLHGGDAAR LPPARSRASA AAVAAVVASF VEQIAVNAQR RRHAGSHIVS LRVDTLATAA HRVVRRPQCP RCAHAGWMRE QAERPVTLAS ADAGARRDGG YRTLAAAELF KRYGHLISPV SGPIAYLHPM PGRNAGMRHM YVAGYLVCPP SAPRENRFDK LCSGKGASDA QARASALAEA LERFSGVYQG DEAALRGSLA QLSAHAPPGS GPIDVNALQQ YSDRQFERRE RHNATTDDPR KQVPRRFTRD SVIDWTPAWS IATGARRLVP LAYCYAETPA SSGADYCVHN PNGCAAGACI EEAILQGLLE LVERDAVAIW WYNMLRRPAV DIESFGDPYF DALAADYASL GWRLWALDIT HDLRIPVFVA LARETATGRF SIGFGCHPDS RIALQRALTE VNQLLDVGAS APPPWDVDKL PDDAFLHPDP ALPPVRAPAR APHGRCDLKR DIEDCVARLA AAGIDTLVVD KTRPDIGLPV VQVIAPGLCH FWPRFGAPRL YSVPVAQRWR ERPRDEDALN RALLFL
|
| |