Gene BURPS1106A_A2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2166 
Symbol 
ID4904584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2120305 
End bp2121987 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content69% 
IMG OID640145271 
Productputative type IV pilus protein 
Protein accessionYP_001076199 
Protein GI126458207 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0422322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTCG TGCTTTCGCG CCGCCGCGCG CGCGGCTTCG CGCTGATCGA GATGCTCGGC 
GCGCTCGCGA TCGCCGCGCT GCTGCTTGCC GGCATCGCGG CGATGATGGA CAGCTCGCTC
GACGACGTGC GCGCGCAGCA GGCCGCGCAA TACCAGGCGC AGGTGACGGC CGCGGCCACG
CGTGCGCTCA AGCGTGACTA CGACGCATGG CTGCAGCGCG CGAACGCGCA GACGCCCGTC
GTGATGACGC TTGCCGATTT GCAGGCGACG AACGATCTGC CCGCCGCGCT ACAGACGCGC
AACGCGTACG GCCAGCACAC GTGCGTGCTC GTCAAGCGCA CCGCGAACGG CGTCGGACTC
GACGCGCTCG TCGTGACGAC GGGCGGCGAG GCGATCGGCG ACAAGGAGCT CGGGCTCGTC
GCCGCGAGCG CGGGGCCGGG CGGCGGCTCG ATCGCCACGA GCGCGCCCGC GCTCGCGCGC
GGCGCGTTCG ACGCGTGGCG CATGCCGCTC GGCGCCTACC TCGGCGGCAG CTCGCCGACG
TGCGATCCGG CCGACGCCGC GCCGCCGAAC GCCGGCCATC TCGCCAACGA GATCTTCTTC
AACGGGCCGG GCCAGCAGAT CAACAGCGAT TACCTGTACC GCGTCGGCGT CGGCGGCCAT
CCGGAGGCGA ACGCGATGCA GGTGCCGATC TGGCTCACGC ACACGTTCGT CGAAGGCGCC
GCCGACGCGG CGAACTGCGG CGCGGCCGGC AGCTATGCGA ACGGCAAGCT CGGCGCGGAC
GCGGCCGGAC AGTTGCTGAG CTGCAGGAAC GGCGTGTGGC GCGGCGCCGG CGGTCACTGG
AAGGACCCGG TCAGGACGGC CGACGATCTG CCCACCGACG CATCGAACGA AACCGGCGAC
GTGCGCCTCA CGCTCGACAC GTTCCGCGCG TTCGCGTGGA CGGGCAACGC GTGGCAGGCG
CTCGCCGTGG ACCAGAACGG CAACATGATC GTGCCGGGCG TCGTCTCCGC GAACCAGTAC
GAGATCACCG GGCGCGTCGT CGTCAACACG CCGTGCGCGC CGGAGCCGAG CCGGCCGAAC
GCGGGGCTCG TGTCGATGGG CCAGGACGGG CAGGTGCTGT CGTGCCAGGG CGGCAAGTGG
CTGCCGCAAT CGGGGATCAA GATCGGCGGC ACCGAAACGG CGTGCGAGAT CCTGATGGAG
ACGCCCGGCG CGACGGATTT CTCGTGCGGG TACACCTACC GCGGCCCCTA TCCGAATCCG
CCGCTCATCA CCTACGAGCC CGACGGCACG TACACGTACA CGATCAACCG GCCGGTGAAG
CTCGACAACA ACGGGCTCAT CGCGGTGAGC GCGTACATGC ACATGAGCTA CGCGACGTGC
GCGCTGAAAG GGCGGGAAGG ACAGATGCGT CTCGTCGTCG ACGTGATCGA CGTTCAGAGC
AACCAGGTGA TCGCGCACAG CGAGGCGCAG TCGACGAAGC TGATCGAGGA CGCCGCGACG
ATCAACGTCA CGCTGAATCA GGCCGCCGAG CCGCGCAGCG GCTACACGGT CAGGCTGTCG
AGCAAGTGGG CGACGTACGA CAGCTATGCG GGCACGCCGT GGACGTCGAG CTATTGCAGC
GGCGGCAAGA CGTTTCTCCA GACGCCGCTC GTGACCGGCT GGACGATCAA TTCGTTCTAT
TGA
 
Protein sequence
MRFVLSRRRA RGFALIEMLG ALAIAALLLA GIAAMMDSSL DDVRAQQAAQ YQAQVTAAAT 
RALKRDYDAW LQRANAQTPV VMTLADLQAT NDLPAALQTR NAYGQHTCVL VKRTANGVGL
DALVVTTGGE AIGDKELGLV AASAGPGGGS IATSAPALAR GAFDAWRMPL GAYLGGSSPT
CDPADAAPPN AGHLANEIFF NGPGQQINSD YLYRVGVGGH PEANAMQVPI WLTHTFVEGA
ADAANCGAAG SYANGKLGAD AAGQLLSCRN GVWRGAGGHW KDPVRTADDL PTDASNETGD
VRLTLDTFRA FAWTGNAWQA LAVDQNGNMI VPGVVSANQY EITGRVVVNT PCAPEPSRPN
AGLVSMGQDG QVLSCQGGKW LPQSGIKIGG TETACEILME TPGATDFSCG YTYRGPYPNP
PLITYEPDGT YTYTINRPVK LDNNGLIAVS AYMHMSYATC ALKGREGQMR LVVDVIDVQS
NQVIAHSEAQ STKLIEDAAT INVTLNQAAE PRSGYTVRLS SKWATYDSYA GTPWTSSYCS
GGKTFLQTPL VTGWTINSFY