Gene BMA10229_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_1986 
Symbol 
ID4789202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008835 
Strand
Start bp2043174 
End bp2044901 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content70% 
IMG OID 
Producttype IV prepilin 
Protein accessionYP_001025782 
Protein GI124382814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.610006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTCG TGCTTTCGCG CCGCCGCGCG CGCGGCTTCG CGCTGATCGA GATGCTCGGC 
GCGCTCGCGA TCGCCGCGCT GCTGCTTGCC GGCATCGCGG CGATGATGGA CAGCTCGCTC
GACGACGTGC GCGCGCAGCA GGCCGCGCAA TACCAGGCGC AGGTGACGGC CGCGGCCACG
CGTGCGCTCA AGCGTGACTA CGACGCATGG CTGCAGCGCG CGAACGCGCA GACGCCCGTC
GTGATGACGC TTGCCGATTT GCAGGCGACG AACGATCTGC CCGCCGCGCT ACAGACACGC
AACGCGTACG GCCAGCACAC GTGCGTGCTC GTCAAGCGCA CCGCGAACGG CGTCGGACTC
GACGCGCTCG TCGTGACGAC GGGCGGCGAG GCGATCGGCG ACAAGGAGCT CGGGCTCGTC
GCCGCGAGCG CGGGGCCGGG CGGCGGCTCG ATCGCCACGA GCGCGCCCGC GCTCGCGCGC
GGCGCGTTCG ACGCGTGGCG CATGCCGCTC GGCGCCTACC TCGGCGGCAG CTCGCCGACG
TGCGATCCGG CCGACGCCGC GCCGCCGAAC GCCGGCCATC TCGCCAACGA GATCTTCTTC
AACGGGCCGA GCCAGCAGAT CAACAGCGAT TACCTGTACC GCGTCGGCGT CGGCGGCCAT
CCGGAGGCGA ACGCGATGCA GGTGCCGATC TGGCTCACGC ACACGTTCGT CGAAGGCGCC
GCCGACGCGG CGAACTGCGG CGCGGCCGGC AGCTATGCGA ACGGCAAGCT CGGCGCGGAC
GCGGCCGGAC AGTTGCTGAG CTGCAGGAAC GGCGTGTGGC GCGGCGCCGG CGGTCACTGG
AAGGACCCGG TCAGGACGGC CGACGATCTG CCCACCGACG CATCGAACGA AACCGGCGAC
GTGCGCCTCA CGCTCGACAC GTTCCGCGCG TTCGCGTGGA CGGGCAACGC GTGGCAGGCG
CTCGCCGTGG ACCAGAACGG CAACATGATC GTGCCGGGCG TCGTCTCCGC GAACCAGTAC
GAGATCACCG GGCGCGTCGT CGTCAACACG CCGTGCGCGC CGGAGCCGAG CCGGCCGAAC
GCGGGGCTCG TGTCGATGGG CCAGGACGGG CAGGTGCTGT CCTGCCAGGG CGGCAAGTGG
CTGCCGCAAT CGGGGATCAA GATCGGCGGC ACCGAAACGG CGTGCGAGAT CCTGATGGAG
ACGCCCGGCG CGACGGATTT CTCGTGCGGG TACACCTACC GCGGCCCCTA TCCGAATCCG
CCGCTCATCA CCTACGAGCC CGACGGCACG TACACGTACA CGATCAACCG GCCGGTGAAG
CTCGACAACA ACGGGCTCAT CGCGGTGAGC GCGTACATGC ACATGAGCTA CGCGACGTGC
GCGCTGAAAG GGCGGGAAGG ACAGATGCGT CTCGTCGTCG ACGTGATCGA CGTTCAGAGC
AACCAGGTGA TCGCGCACAG CGAGGCGCAG TCGACGAAGC TGATCGAGGA CGCCGCGACG
ATCAACGTCA CGCTGAATCA GGCCGCCGAG CCGCGCAGCG GCTACACGGT CAGGCTGTCG
AGCAAGTGGG CGACGTACGA CAGCTATGCG GGCACGCCGT GGACGTCGAG CTATTGCAGC
GGCGGCAAGA CGTTTCTCCA GACGCCGCTC GTGACCGGCT GGACGATCTT CGTTCTATTG
AACGGCGCTT CGCCGCGCGT GGCCGCCGCC GCGCGGCGCT GCGGTTAA
 
Protein sequence
MRFVLSRRRA RGFALIEMLG ALAIAALLLA GIAAMMDSSL DDVRAQQAAQ YQAQVTAAAT 
RALKRDYDAW LQRANAQTPV VMTLADLQAT NDLPAALQTR NAYGQHTCVL VKRTANGVGL
DALVVTTGGE AIGDKELGLV AASAGPGGGS IATSAPALAR GAFDAWRMPL GAYLGGSSPT
CDPADAAPPN AGHLANEIFF NGPSQQINSD YLYRVGVGGH PEANAMQVPI WLTHTFVEGA
ADAANCGAAG SYANGKLGAD AAGQLLSCRN GVWRGAGGHW KDPVRTADDL PTDASNETGD
VRLTLDTFRA FAWTGNAWQA LAVDQNGNMI VPGVVSANQY EITGRVVVNT PCAPEPSRPN
AGLVSMGQDG QVLSCQGGKW LPQSGIKIGG TETACEILME TPGATDFSCG YTYRGPYPNP
PLITYEPDGT YTYTINRPVK LDNNGLIAVS AYMHMSYATC ALKGREGQMR LVVDVIDVQS
NQVIAHSEAQ STKLIEDAAT INVTLNQAAE PRSGYTVRLS SKWATYDSYA GTPWTSSYCS
GGKTFLQTPL VTGWTIFVLL NGASPRVAAA ARRCG