Gene BMAA1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1603 
Symbol 
ID3087197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp1740988 
End bp1742715 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content70% 
IMG OID637565487 
Producttype IV prepilin 
Protein accessionYP_106183 
Protein GI53716217 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0116074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTCG TGCTTTCGCA CCGCCGCGCG CGCGGCTTCG CGCTGATCGA GATGCTCGGC 
GCGCTCGCGA TCGCCGCGCT GCTGCTTGCC GGCATCGCGG CGATGATGGA CAGCTCGCTC
GACGACGTGC GCGCGCAGCA GGCCGCGCAA TACCAGGCGC AGGTGACGGC CGCGGCCACG
CGTGCGCTCA AGCGTGACTA CGACGCATGG CTGCAGCGCG CGAACGCGCA GACGCCCGTC
GTGATGACGC TTGCCGATTT GCAGGCGACG AACGATCTGC CCGCCGCGCT ACAGACACGC
AACGCGTACG GCCAGCACAC GTGCGTGCTC GTCAAGCGCA CCGCGAACGG CGTCGGACTC
GACGCGCTCG TCGTGACGAC GGGCGGCGAG GCGATCGGCG ACAAGGAGCT CGGGCTCGTC
GCCGCGAGCG CGGGGCCGGG CGGCGGCTCG ATCGCCACGA GCGCGCCCGC GCTCGCGCGC
GGCGCGTTCG ACGCGTGGCG CATGCCGCTC GGCGCCTACC TCGGCGGCAG CTCGCCGACG
TGCGATCCGG CCGACGCCGC GCCGCCGAAC GCCGGCCATC TCGCCAACGA GATCTTCTTC
AACGGGCCGA GCCAGCAGAT CAACAGCGAT TACCTGTACC GCGTCGGCGT CGGCGGCCAT
CCGGAGGCGA ACGCGATGCA GGTGCCGATC TGGCTCACGC ACACGTTCGT CGAAGGCGCC
GCCGACGCGG CGAACTGCGG CGCGGCCGGC AGCTATGCGA ACGGCAAGCT CGGCGCGGAC
GCGGCCGGAC AGTTGCTGAG CTGCAGGAAC GGCGTGTGGC GCGGCGCCGG CGGTCACTGG
AAGGACCCGG TCAGGACGGC CGACGATCTG CCCACCGACG CATCGAACGA AACCGGCGAC
GTGCGCCTCA CGCTCGACAC GTTCCGCGCG TTCGCGTGGA CGGGCAACGC GTGGCAGGCG
CTCGCCGTGG ACCAGAACGG CAACATGATC GTGCCGGGCG TCGTCTCCGC GAACCAGTAC
GAGATCACCG GGCGCGTCGT CGTCAACACG CCGTGCGCGC CGGAGCCGAG CCGGCCGAAC
GCGGGGCTCG TGTCGATGGG CCAGGACGGG CAGGTGCTGT CCTGCCAGGG CGGCAAGTGG
CTGCCGCAAT CGGGGATCAA GATCGGCGGC ACCGAAACGG CGTGCGAGAT CCTGATGGAG
ACGCCCGGCG CGACGGATTT CTCGTGCGGG TACACCTACC GCGGCCCCTA TCCGAATCCG
CCGCTCATCA CCTACGAGCC CGACGGCACG TACACGTACA CGATCAACCG GCCGGTGAAG
CTCGACAACA ACGGGCTCAT CGCGGTGAGC GCGTACATGC ACATGAGCTA CGCGACGTGC
GCGCTGAAAG GGCGGGAAGG ACAGATGCGT CTCGTCGTCG ACGTGATCGA CGTTCAGAGC
AACCAGGTGA TCGCGCACAG CGAGGCGCAG TCGACGAAGC TGATCGAGGA CGCCGCGACG
ATCAACGTCA CGCTGAATCA GGCCGCCGAG CCGCGCAGCG GCTACACGGT CAGGCTGTCG
AGCAAGTGGG CGACATACGA CAGCTATGCG GGCACGCCGT GGACGTCGAG CTATTGCAGC
GGCGGCAAGA CGTTTCTCCA GACGCCGCTC GTGACCGGCT GGACGATCTT CGTTCTATTG
AACGGCGCTT CGCCGCGCGT GGCCGCCGCC GCGCGGCGCT GCGGTTAA
 
Protein sequence
MRFVLSHRRA RGFALIEMLG ALAIAALLLA GIAAMMDSSL DDVRAQQAAQ YQAQVTAAAT 
RALKRDYDAW LQRANAQTPV VMTLADLQAT NDLPAALQTR NAYGQHTCVL VKRTANGVGL
DALVVTTGGE AIGDKELGLV AASAGPGGGS IATSAPALAR GAFDAWRMPL GAYLGGSSPT
CDPADAAPPN AGHLANEIFF NGPSQQINSD YLYRVGVGGH PEANAMQVPI WLTHTFVEGA
ADAANCGAAG SYANGKLGAD AAGQLLSCRN GVWRGAGGHW KDPVRTADDL PTDASNETGD
VRLTLDTFRA FAWTGNAWQA LAVDQNGNMI VPGVVSANQY EITGRVVVNT PCAPEPSRPN
AGLVSMGQDG QVLSCQGGKW LPQSGIKIGG TETACEILME TPGATDFSCG YTYRGPYPNP
PLITYEPDGT YTYTINRPVK LDNNGLIAVS AYMHMSYATC ALKGREGQMR LVVDVIDVQS
NQVIAHSEAQ STKLIEDAAT INVTLNQAAE PRSGYTVRLS SKWATYDSYA GTPWTSSYCS
GGKTFLQTPL VTGWTIFVLL NGASPRVAAA ARRCG