Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_1986 |
Symbol | |
ID | 4789202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | + |
Start bp | 2043174 |
End bp | 2044901 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | type IV prepilin |
Protein accession | YP_001025782 |
Protein GI | 124382814 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.610006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTCG TGCTTTCGCG CCGCCGCGCG CGCGGCTTCG CGCTGATCGA GATGCTCGGC GCGCTCGCGA TCGCCGCGCT GCTGCTTGCC GGCATCGCGG CGATGATGGA CAGCTCGCTC GACGACGTGC GCGCGCAGCA GGCCGCGCAA TACCAGGCGC AGGTGACGGC CGCGGCCACG CGTGCGCTCA AGCGTGACTA CGACGCATGG CTGCAGCGCG CGAACGCGCA GACGCCCGTC GTGATGACGC TTGCCGATTT GCAGGCGACG AACGATCTGC CCGCCGCGCT ACAGACACGC AACGCGTACG GCCAGCACAC GTGCGTGCTC GTCAAGCGCA CCGCGAACGG CGTCGGACTC GACGCGCTCG TCGTGACGAC GGGCGGCGAG GCGATCGGCG ACAAGGAGCT CGGGCTCGTC GCCGCGAGCG CGGGGCCGGG CGGCGGCTCG ATCGCCACGA GCGCGCCCGC GCTCGCGCGC GGCGCGTTCG ACGCGTGGCG CATGCCGCTC GGCGCCTACC TCGGCGGCAG CTCGCCGACG TGCGATCCGG CCGACGCCGC GCCGCCGAAC GCCGGCCATC TCGCCAACGA GATCTTCTTC AACGGGCCGA GCCAGCAGAT CAACAGCGAT TACCTGTACC GCGTCGGCGT CGGCGGCCAT CCGGAGGCGA ACGCGATGCA GGTGCCGATC TGGCTCACGC ACACGTTCGT CGAAGGCGCC GCCGACGCGG CGAACTGCGG CGCGGCCGGC AGCTATGCGA ACGGCAAGCT CGGCGCGGAC GCGGCCGGAC AGTTGCTGAG CTGCAGGAAC GGCGTGTGGC GCGGCGCCGG CGGTCACTGG AAGGACCCGG TCAGGACGGC CGACGATCTG CCCACCGACG CATCGAACGA AACCGGCGAC GTGCGCCTCA CGCTCGACAC GTTCCGCGCG TTCGCGTGGA CGGGCAACGC GTGGCAGGCG CTCGCCGTGG ACCAGAACGG CAACATGATC GTGCCGGGCG TCGTCTCCGC GAACCAGTAC GAGATCACCG GGCGCGTCGT CGTCAACACG CCGTGCGCGC CGGAGCCGAG CCGGCCGAAC GCGGGGCTCG TGTCGATGGG CCAGGACGGG CAGGTGCTGT CCTGCCAGGG CGGCAAGTGG CTGCCGCAAT CGGGGATCAA GATCGGCGGC ACCGAAACGG CGTGCGAGAT CCTGATGGAG ACGCCCGGCG CGACGGATTT CTCGTGCGGG TACACCTACC GCGGCCCCTA TCCGAATCCG CCGCTCATCA CCTACGAGCC CGACGGCACG TACACGTACA CGATCAACCG GCCGGTGAAG CTCGACAACA ACGGGCTCAT CGCGGTGAGC GCGTACATGC ACATGAGCTA CGCGACGTGC GCGCTGAAAG GGCGGGAAGG ACAGATGCGT CTCGTCGTCG ACGTGATCGA CGTTCAGAGC AACCAGGTGA TCGCGCACAG CGAGGCGCAG TCGACGAAGC TGATCGAGGA CGCCGCGACG ATCAACGTCA CGCTGAATCA GGCCGCCGAG CCGCGCAGCG GCTACACGGT CAGGCTGTCG AGCAAGTGGG CGACGTACGA CAGCTATGCG GGCACGCCGT GGACGTCGAG CTATTGCAGC GGCGGCAAGA CGTTTCTCCA GACGCCGCTC GTGACCGGCT GGACGATCTT CGTTCTATTG AACGGCGCTT CGCCGCGCGT GGCCGCCGCC GCGCGGCGCT GCGGTTAA
|
Protein sequence | MRFVLSRRRA RGFALIEMLG ALAIAALLLA GIAAMMDSSL DDVRAQQAAQ YQAQVTAAAT RALKRDYDAW LQRANAQTPV VMTLADLQAT NDLPAALQTR NAYGQHTCVL VKRTANGVGL DALVVTTGGE AIGDKELGLV AASAGPGGGS IATSAPALAR GAFDAWRMPL GAYLGGSSPT CDPADAAPPN AGHLANEIFF NGPSQQINSD YLYRVGVGGH PEANAMQVPI WLTHTFVEGA ADAANCGAAG SYANGKLGAD AAGQLLSCRN GVWRGAGGHW KDPVRTADDL PTDASNETGD VRLTLDTFRA FAWTGNAWQA LAVDQNGNMI VPGVVSANQY EITGRVVVNT PCAPEPSRPN AGLVSMGQDG QVLSCQGGKW LPQSGIKIGG TETACEILME TPGATDFSCG YTYRGPYPNP PLITYEPDGT YTYTINRPVK LDNNGLIAVS AYMHMSYATC ALKGREGQMR LVVDVIDVQS NQVIAHSEAQ STKLIEDAAT INVTLNQAAE PRSGYTVRLS SKWATYDSYA GTPWTSSYCS GGKTFLQTPL VTGWTIFVLL NGASPRVAAA ARRCG
|
| |