Gene BURPS668_A2250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2250 
Symbol 
ID4887734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2181337 
End bp2183019 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content69% 
IMG OID640132187 
Productputative type IV pilus protein 
Protein accessionYP_001063244 
Protein GI126445536 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTCG TGCTTTCGCG CCGCCGCGCG CGCGGCTTCG CGCTGATCGA GATGCTCGGC 
GCGCTCGCGA TCGCCGCGCT GCTGCTCGCC GGCATCGCGG CGATGATGGA CAGCTCGCTC
GACGACGTGC GCGCGCAGCA GGCCGCGCAA TACCAGGCGC AGGTGACGGC CGCGGCCACG
CGTGCGCTCA AGCGTGACTA CGACGCATGG CTGCAGCGCG CGAACGCGCA GACGCCCGTC
GTGATGACGC TTGCCGATTT GCAGGCGACG AACGATCTGC CCGCCGCGCT ACAGACGCGC
AACGCGTACG GCCAGCACAC GTGCGTGCTC GTCAAGCGCA CCGCGAACGG CGTCGGACTC
GACGCGCTCG TCGTGACGAC GGGCGGCGAG GCGATCGGCG ACAAGGAGCT CGGGCTCGTC
GCCGCGAGCG CGGGGCCGGG CGGCGGCTCG ATCGCCACGA GCGCGCCCGC GCTCGCGCGC
GGCGCGTTCG ACGCGTGGCG CATGCCGCTC GGCGCCTACC TCGGCGGCAG CTCGCCGACG
TGCGATCCGG CCGACGCCGC GCCGCCGAAC GCCGGCCATC TCGCCAACGA GATCTTCTTC
AACGGGCCGG GCCAGCAGAT CAACAGCGAT TACCTGTACC GCGTCGGCGT CGGCGGCCAT
CCGGAGGCGA ACGCGATGCA GGTGCCGATC TGGCTCACGC ACACGTTCGT CGAAGGCGCC
GCCGACGCGG CGAACTGCGG TGCGGCCGGC AGCTATGCGA ACGGCAAGCT CGGCGCGGAC
GCGGCCGGAC AGTTGCTGAG CTGCAGGAAC GGCGTGTGGC GTGGCGCCGG CGGTCACTGG
AAGGACCCGG TCAGGACGGC CGACGATCTG CCCACCGACG CATCGAACGA AACCGGCGAC
GTGCGCCTCA CGCTCGACAC GTTCCGCGCG TTCGCGTGGA CGGGCAACGC GTGGCAGGCG
CTCGCCGTGG ACCAGAACGG CAACATGATC GTGCCGGGCG TCGTCTCCGC GAACCAGTAC
GAGATCACCG GGCGCGTCGT CGTCAACACG CCGTGCGCGC CGGAGCCGAG CCGGCCGAAC
GCGGGGCTCG TGTCGATGGG CCAGGACGGG CAGGTGCTGT CGTGCCAGGG CGGCAAGTGG
CTGCCGCAAT CGGGGATCAA GATCGGCGGC ACCGAAACGG CGTGCGAGAT CCTGATGGAG
ACGCCCGGCG CGACGGATTT CTCGTGCGGG TACACCTACC GCGGCCCCTA TCCGAATCCG
CCGCTCATCA CCTACGAGCC CGACGGCACG TACACGTACA CGATCAACCG GCCGGTGAAG
CTCGACAACA ACGGGCTCAT CGCGGTGAGC GCGTACATGC ACATGAGCTA CGCGACGTGC
GCGCTGAAAG GGCGGGAAGG ACAGATGCGT CTCGTCGTCG ACGTGATCGA CGTTCAGAGC
AACCAGGTGA TCGCGCACAG CGAGGCGCAG TCGACGAAGC TGATCGAGGA CGCCGCGACG
ATCAACGTCA CGCTGAATCA GGCCGCCGAG CCGCGCAGCG GCTACACGGT CAGGCTGTCG
AGCAAGTGGG CGACGTACGA CAGCTACGCG GGCACGCCGT GGACGTCGAG CTATTGCAGC
GGCGGCAAGA CGTTTCTCCA GACGCCGCTC GTGACCGGCT GGACGATCAA TTCGTTCTAT
TGA
 
Protein sequence
MRFVLSRRRA RGFALIEMLG ALAIAALLLA GIAAMMDSSL DDVRAQQAAQ YQAQVTAAAT 
RALKRDYDAW LQRANAQTPV VMTLADLQAT NDLPAALQTR NAYGQHTCVL VKRTANGVGL
DALVVTTGGE AIGDKELGLV AASAGPGGGS IATSAPALAR GAFDAWRMPL GAYLGGSSPT
CDPADAAPPN AGHLANEIFF NGPGQQINSD YLYRVGVGGH PEANAMQVPI WLTHTFVEGA
ADAANCGAAG SYANGKLGAD AAGQLLSCRN GVWRGAGGHW KDPVRTADDL PTDASNETGD
VRLTLDTFRA FAWTGNAWQA LAVDQNGNMI VPGVVSANQY EITGRVVVNT PCAPEPSRPN
AGLVSMGQDG QVLSCQGGKW LPQSGIKIGG TETACEILME TPGATDFSCG YTYRGPYPNP
PLITYEPDGT YTYTINRPVK LDNNGLIAVS AYMHMSYATC ALKGREGQMR LVVDVIDVQS
NQVIAHSEAQ STKLIEDAAT INVTLNQAAE PRSGYTVRLS SKWATYDSYA GTPWTSSYCS
GGKTFLQTPL VTGWTINSFY