Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_A2992 |
Symbol | |
ID | 4791189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008836 |
Strand | - |
Start bp | 3018388 |
End bp | 3021342 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | fimbrial usher protein |
Protein accession | YP_001028936 |
Protein GI | 124384239 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGACAG TCCGCCTCGC GCACCTCTCG ACGGACCTTG GGCGACATCG ACGGTTCGTC GGCCTGACGA CCACGCTCGC CGCCGCGACG TTCGCCGCGT CGGCGGCGAG CGGCGACGCC CGCGCGCAAC GCGCGCCGGC CCAAGCCGGC GACGCGAGCG CCGACGCCGG GGCGCGAACG GCGCCGGCAC GCCTGCCCGC GCTCACGGCG ATGTCCGCGC GTATCCCGGG CGAGCACGTG CTCATCGGCG CGACCGCGCC CGCGACATCG GGCGGCAGGC CGGCGGCCCC GGCCGATGCA GCCGTTGCGC GCGCAACGAC CGCGCAGGCA ATGGCGAGAG AAAGGCGCGA CGGGGCGGCG GCGCCCATGC CCGGCGGGCC GGTGTCGATC GCGGCGACGT GGCCGTCGGT ACCGCTATCG CCGTCGTTCG CCTCCGTACC CTCGCTGCCT TCGTCGCCTG CCTCGTCCGC CTCGTCCGTA TCGTCCGTGC CGTCCGTGTC AGCCATATCA GCCGTATCAA CCGCATCGTC CGCGTCACCC GCGCTCCCCG TGTCGCCGCG CCTATTCGCA TCGCCGTCAC TCCCGGCCTC TCCCACGCCG CCGCCCGCCC CCGGCGCATC GCCCCGCTCC GCCGCCTCGT CAGCGGCCCG GACTATGCTC GCCGAGGCGG CCGCGCCGCC CGCGCGGTCC GCATTGCCCG GCCCGACGAC GAGCCTGCCG GTGCCGGGCG CCGACGCGAC CGTCCCCGCG AGCGACCTGT ATCTCGGCGT CTCGCTGAAC GGCCAGCCGA CGCGCCTGAT CGTGCACTTC GTCGTCGCCG ACGGGCGCTT CTACGCGAGC CAGGACGATC TGAACGACAT CGGCGTCGCG ACGTCACGGC TGCGACAGCC GGCGAACGCG CTCATCGCGC TCGATGCGCT CGACGGCCTG CGCTACCGCT ACGACGCCGC GCGCCAGACG ATCGATCTCG ACGCGCCCGA TTCGCTGCGC ATCCCGCACA CGTTCGACAC ACGCGCGCTC GCGCCGACGG TCCCCGCGAG CGCGGGCCGC GGCGTCGTGC TGAATTACGA CCTCTACGCG CAGACGGCCG ATCGCGCGAG CGCGGCGCTC TGGCACGAGG CGCGCTACTT CGATCCGGCC GGCGTCTTCA GCAGCACGGG CGTCGCGTAT CTTCAGCACG GCGGCCAGCG CTACACGCGC TACGACACCT CGTGGAGCAT GTCCGATCCG AAATCGCTGA CGACGACGCA GTTCGGCGAC ACGATCTCGT CGTCGCTCGC CTGGACGCGC TCGCTGCGCG TCGCCGACCT GCAATGGCGC AGCAACTTCG CGCTGCGCCC GGACCTCGTG ACGTTCCCGG TGCCGGCGCT CGCGGGCACG GCCGTCGTGC CGTCGACCGT CGATCTGTAC GTGAACGGCG TGCGCCAGTT CAGCGGCGAC GTGCCGAGCG GCCCGTTCGT CATCAACAGC GTGCCGAGCA TCACGGGCGC GGGCAACGCG ACCGTGGTCA CGCGCGACGC GCTCGGGCGC ACGATCGCGA CGTCGCTGCC GCTCTACATC GACACGCGGA TGCTCGCGCC CGGCCTCGCG AGCTATTCGG TCGAGGCGGG TTTCCTGCGC CGCGCGTGGG GGCTGCGCTC GTTCGACTAC GCGCCCCGCC CGGCCGTGAG CGCGACCGCG CGCTACGGCG TGAGCGAGCG CCTGACCGTC GAGGCGCACG CGGAGGCGAC GCCCGGCCTC TACAACGCGG GCGCGGGCGC GCTCGTGCGG CTCGGCGGCG CGGGGGTCGC GAGCGCGTCG GCCGCGCAAA GCGCCGGACG CCTCGCGGGC ACGCAGGCCG GCCTCGGCTA CCAGCTCGTG CTGCCGCGCT TTTCGATCGA CGCGCAAACG CTGCGCGCGT TCGGCCAATA CGGCGACCTC GCCGCACGCG ACGGCACGCC GGTGGCGAGC GCGACCGATC GCGTCACGCT GTCGCTGCCG TTCATCCGCT CGCAGACGTT CGCGATCAGC TACATCGGCC TCAGGTATCC GGGCCTGCAA ACCGCGCGGA TCGGCTCCGT GTCGTACTCG GTCAACGTCG GCAACCTTGC GTCGATCAAC GTCAGCGCGT TCCAGGACTT CCACCAGCAC GACTCGCGCG GCGTGTTCGT GAGCCTGAAC GTCGCGCTCG GCAACCGGAC GTCGGTCAAC GCGAACGTCG GCCAGCAGAA CGGCAAGACG GTCTACAACG TGAACGCGAT GCGCGCGCCC GACTACGGCG GCGGCTTCGG CTGGAGCGCG CAGACGGGCG ACGCGGGCGG CGTGCGCTAC GGCCAGGCGC AGGCGCGATA TCTCGGCCGC TCGGGCGAGG TGGCGGCGCT CGCGCAGACG ATCGCCGGAC ATCAGAACGC GGCGCTCGAC GTGGCGGGCG CCGTCGTGCT GATGGACGGC CGCGCGCTGC TCACGCGGCG CATCGACGAC GGCTTCGCGC TCGTGTCGAC CGACGCTTCG CCGGGCGTGC CGGTGCTGCA CGAGAATCGC CTGATCGGCA CGACCGACCG CAACGGCTAC CTGCTGATTC CGGATCTGAA CGCGTACCAG AACAACCGGA TCGGCATCGA CACGTTGAAG CTGCCGCTCG ACGCGCGCGT ATCCGACACG ATTCGCAACG TCGTTCCGCA GTCGCGCTCG GGCGTGCTCG CGCATTTCGC GATCGCGCGC GAACAGTCGG CGTCGATCGT CCTCGAGGAT GCGTCCGGCG CGCCGCTGCC GGCCGGGCTG TCGGTCTCGC ATCGCGAGAG CGGCGCGAGC ACGATCGTCG GCTACGACGG GCTCACGTTC GTCACGGGCC TCGCGGCCGC CAATCACCTG GAGATCACGG GCCACGGCAA GCGCTGCGCG GTCGCGTTCG ACTACGTGCG CCCGGCCGAC GGCACGCCGC CGACGATCGG GCCGCTCGTC TGCGACCTGA AGTGA
|
Protein sequence | MPTVRLAHLS TDLGRHRRFV GLTTTLAAAT FAASAASGDA RAQRAPAQAG DASADAGART APARLPALTA MSARIPGEHV LIGATAPATS GGRPAAPADA AVARATTAQA MARERRDGAA APMPGGPVSI AATWPSVPLS PSFASVPSLP SSPASSASSV SSVPSVSAIS AVSTASSASP ALPVSPRLFA SPSLPASPTP PPAPGASPRS AASSAARTML AEAAAPPARS ALPGPTTSLP VPGADATVPA SDLYLGVSLN GQPTRLIVHF VVADGRFYAS QDDLNDIGVA TSRLRQPANA LIALDALDGL RYRYDAARQT IDLDAPDSLR IPHTFDTRAL APTVPASAGR GVVLNYDLYA QTADRASAAL WHEARYFDPA GVFSSTGVAY LQHGGQRYTR YDTSWSMSDP KSLTTTQFGD TISSSLAWTR SLRVADLQWR SNFALRPDLV TFPVPALAGT AVVPSTVDLY VNGVRQFSGD VPSGPFVINS VPSITGAGNA TVVTRDALGR TIATSLPLYI DTRMLAPGLA SYSVEAGFLR RAWGLRSFDY APRPAVSATA RYGVSERLTV EAHAEATPGL YNAGAGALVR LGGAGVASAS AAQSAGRLAG TQAGLGYQLV LPRFSIDAQT LRAFGQYGDL AARDGTPVAS ATDRVTLSLP FIRSQTFAIS YIGLRYPGLQ TARIGSVSYS VNVGNLASIN VSAFQDFHQH DSRGVFVSLN VALGNRTSVN ANVGQQNGKT VYNVNAMRAP DYGGGFGWSA QTGDAGGVRY GQAQARYLGR SGEVAALAQT IAGHQNAALD VAGAVVLMDG RALLTRRIDD GFALVSTDAS PGVPVLHENR LIGTTDRNGY LLIPDLNAYQ NNRIGIDTLK LPLDARVSDT IRNVVPQSRS GVLAHFAIAR EQSASIVLED ASGAPLPAGL SVSHRESGAS TIVGYDGLTF VTGLAAANHL EITGHGKRCA VAFDYVRPAD GTPPTIGPLV CDLK
|
| |