Gene BURPS668_1062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1062 
Symbol 
ID4883189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1038324 
End bp1040624 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content72% 
IMG OID640126990 
ProductP pilus assembly protein, porin PapC 
Protein accessionYP_001058112 
Protein GI126442012 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.819207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCCG AGGCGGCCGC GCCGCCCGCG CGGTCCGCAT TGCCCGGGCC GACGACGAGC 
CTGCCGGTGC CGGGCGCCGA CGCGACCGTC CCCGCGAGCG ACCTGTATCT CGGCGTCTCG
CTGAACGGCC AGCCGACGCG CCTGATCGTG CACTTCGTCG TCGCCGACGG GCGCTTCTAC
GCGAGCCAGG ACGATCTGAA CGACATCGGC GTCGCGACGT CGCGGCTGCG ACAGCCGGCG
AACGCGCTCA TCGCGCTCGA TGCGCTCGAC GGCCTGCGCT ACCGCTACGA CGCCGCGCGC
CAGACGATCG ATCTCGACGC GCCCGATTCG CTGCGCATCC CGCACACGTT CGACACACGC
GCGCTCGCGC CGACGGTCCC CGCGAGCGCG GGCCGCGGCG TCGTGCTGAA TTACGACCTC
TATGCGCAGA CGGCCGATCG CGCGAGCGCG GCGCTCTGGC ACGAGGCGCG CTACTTCGAT
CCGGCCGGCG TCTTCAGCAG CACGGGCGTC GCGTATTTTC AGCACGGCGG CCAGCGCTAC
ACGCGCTACG ACACCTCGTG GAGCATGTCC GATCCGAAAT CGCTGACGAC GACGCAGTTC
GGCGACACGA TCTCGTCGTC GCTCGCCTGG ACGCGCTCGC TGCGCGTCGC CGGCCTGCAA
TGGCGCAGCA ACTTCGCGCT GCGCCCGGAC CTCGTGACGT TCCCGGTGCC GGCGCTCGCG
GGCACGGCCG TCGTGCCGTC GACCGTCGAT CTGTACGTGA ACGGCGTGCG CCAGTTCAGC
GGCGACGTGC CGAGCGGCCC GTTCGTCATC AACAGCGTGC CGGGCATCAC GGGCGCGGGC
AACGCGACCG TGGTCACGCG CGACGCGCTC GGGCGCACGA TCGCGACGTC GCTGCCGCTC
TACATCGACA CGCGGATGCT CGCGCCCGGC CTCGCGAGCT ATTCGGTCGA GGCGGGTTTC
CTGCGCCGCG CGTGGGGGCT GCGCTCGTTC GACTACGCGC CCCGCCCGGC CGTGAGCGCG
ACCGCGCGCT ACGGCGTGAG CGAGCGCCTG ACCGTCGAGG CGCACGCGGA GGCGACGCCC
GGCCTCTACA ACGCGGGCGC GGGCGCGCTC GTGCGGCTCG GCGGCGCGGG GGTCGCGAGC
GCGTCGGCCG CGCAAAGCGC CGGGCGCCTC GCGGGCACGC AGGCCGGCCT CGGCTACCAG
CTCGTGCTGC CGCGCTTTTC GATCGACGCG CAAACGCTGC GCGCGTTCGG CCAATACGGC
GACCTCGCCG CGCGCGACGG CACGCCGGTG GCGAGCGCGA CCGATCGCGT CACGCTGTCG
CTGCCGTTCA TCCGCTCGCA GACGTTCGCG ATCAGCTACA TCGGCCTCAG GTATCCGGGC
CTGCAAACCG CGCGGATCGG CTCCGTGTCG TACTCGGTCA ACGTCGGCAA CCTTGCGTCG
ATCAACGTCA GCGCGTTCCA GGACTTCCAC CAGCACGACT CGCGCGGCGT GTTCGTGAGC
CTGAACGTCG CGCTCGGCAA CCGGACGTCG GTCAACGCGA ACGTCGGCCG GCAGAACGGC
AAGACCGTCT ACAACGTGAA CGCGATGCGC GCGCCCGACT ACGGCGGCGG CTTCGGCTGG
AGCGCGCAGA CGGGCGACGC GGGCGGCGTG CGCTACGGCC AGGCGCAGGC GCGATATCTC
GGCCGCTCGG GCGAGGTGGC GGCGCTCGCG CAGACGATCG CCGGACATCA GAACGCGGCG
CTCGACGTGG CGGGCGCCGT CGTGCTGATG GACGGCCGCG CGCTGCTCAC GCGGCGCATC
GACGACGGCT TCGCGCTCGT GTCGACCGAC GCTTCGCCGG GCGTGCCGGT GCTGCACGAG
AATCGCCTGA TCGGCACGAC CGACCGCAAC GGCTACCTGC TGATTCCGGA TCTGAACGCG
TACCAGAACA ACCGGATCGG CATCGACACG TTGAAGCTGC CGCTCGACGC GCGCGTATCC
GACACGATTC GCAACGTCGT TCCGCAGTCG CGCTCGGGCG TGCTCGCGCA TTTCGCGATC
GCGCGCGAAC AGTCGGCGTC GATCGTCCTC GAGGATGCGT CCGGCGCGCC GCTGCCGGCC
GGGCTGTCGG TCTCGCATCG CGAGAGCGGC GCGAGCACGA TCGTCGGCTA CGACGGGCTC
ACGTTCGTCA CGGGCCTCGC GGCCGCCAAT CACCTGGAGA TCACGGGCCA CGGCAAGCGC
TGCGCGGTCG CGTTCGACTA TGCGCGCCCG GCCGACGGCA CGCCGCCGAC GATCGGGCCG
CTCGTCTGCG ACCTGAAGTA A
 
Protein sequence
MLAEAAAPPA RSALPGPTTS LPVPGADATV PASDLYLGVS LNGQPTRLIV HFVVADGRFY 
ASQDDLNDIG VATSRLRQPA NALIALDALD GLRYRYDAAR QTIDLDAPDS LRIPHTFDTR
ALAPTVPASA GRGVVLNYDL YAQTADRASA ALWHEARYFD PAGVFSSTGV AYFQHGGQRY
TRYDTSWSMS DPKSLTTTQF GDTISSSLAW TRSLRVAGLQ WRSNFALRPD LVTFPVPALA
GTAVVPSTVD LYVNGVRQFS GDVPSGPFVI NSVPGITGAG NATVVTRDAL GRTIATSLPL
YIDTRMLAPG LASYSVEAGF LRRAWGLRSF DYAPRPAVSA TARYGVSERL TVEAHAEATP
GLYNAGAGAL VRLGGAGVAS ASAAQSAGRL AGTQAGLGYQ LVLPRFSIDA QTLRAFGQYG
DLAARDGTPV ASATDRVTLS LPFIRSQTFA ISYIGLRYPG LQTARIGSVS YSVNVGNLAS
INVSAFQDFH QHDSRGVFVS LNVALGNRTS VNANVGRQNG KTVYNVNAMR APDYGGGFGW
SAQTGDAGGV RYGQAQARYL GRSGEVAALA QTIAGHQNAA LDVAGAVVLM DGRALLTRRI
DDGFALVSTD ASPGVPVLHE NRLIGTTDRN GYLLIPDLNA YQNNRIGIDT LKLPLDARVS
DTIRNVVPQS RSGVLAHFAI AREQSASIVL EDASGAPLPA GLSVSHRESG ASTIVGYDGL
TFVTGLAAAN HLEITGHGKR CAVAFDYARP ADGTPPTIGP LVCDLK