Gene BURPS1106A_A2170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2170 
Symbol 
ID4903744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2124159 
End bp2125769 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content71% 
IMG OID640145275 
Producttype IV pilus protein PilQ 
Protein accessionYP_001076203 
Protein GI126458355 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.284038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACG ACGCGCAACG ACCCGACACG GCCGCCGCGC GCGCGACGCC GAGCGCCGCC 
GACGCCGCCG CGCCGCCGCG CGGCCCGCGC GCGGTCAGCG AGGCGGGCGA TTTCGCCGCG
AGCGTCGAGG AGCGCAAGTT CGTCTGCCTG TTCGACGACG GGCGGCTGCT GATCGCCGAA
GGCCACGAGA TGAATCCGTT CGTGCTGTCG TATCGCGCGC GCCTCGACCG GATGGGCCGG
CCGTACCGGC CGACGCCGGC GACGCTGATG CAGGTGCGCG AGGCGTACCG CCAGCACGTG
GCGGGCGGCG GCGAGCGGCT CGACCACACG GTGATGCAGG TGCTCGCGAA GGAGCTGATC
GGCCGCGCGT GCCGCGAGCG CGCATCCGAC GTGCACATTC GCGTGCGCCG CTTCAGCACG
GAGGTCTACT TCCGGATCCA CAACGAACTC ATGCGCGTGA ACGAGCACAC GCGCGAGCAC
GGCGAGCGGC TGCTCGCGAC GCTGTATGGC GCGATGACGA CCGTCTCGGA CAACAGCTAC
CGGCCGAGCG AGCGGCAGGA CGCGAGCATC GGCGACCGCG ACAAGCTGCC GGACGACCTG
TACGGCGTGC GGATCGCGAC GACGCCGACG AGCGAGGGCA GCCTGATGGT GCTGCGGCTG
CTGTACAACG ATGCGGGCGA CGCGACCGAT CTCGCGGCGC TCGGCTTCGC GCCCGAGCAC
GTCGCCGCGT TCCGCATGCT GCGCGCGCAG CCGCACGGGA TGAACATCAT CAGCGGCCCG
ACGGGCTCCG GCAAGTCGAC GACGCTGCAG CGCATGCTCG CCGCGCAGAT CGACGAATCG
CACGGCAGCC TGCACGTGAT CACCGTCGAG GATCCGATCG AATATCCGAT CGACGGCGCG
GTGCAGACGC CGGTGGCGAA CGCGCCGACG GAGGACGCCC GCGCGCTCGC GTTCGCGGCG
GCGATCACCA ACGCGATGCG GCTCGATCCG GACACGATCA TGATCGGCGA GATCCGCGAT
CGCGCGTCCG GGCAGACGGC GCTGCGCGCG TCGATGACGG GCCATCAGGT GTGGACGACC
GTGCACGCGA ACAGTGCGCT CGCGATCGCC GATCGCCTGA TCGATCTCGG CCTGCACGCG
CGGATGATCA CCGATCACAC GGTGATCTCC GGGCTCATCA GCCAGCGCCT CGTGAAGCTG
CTGTGCCCGC ACTGCAAGGT GCGGCTCCTC GATCATGCGG AGCGGATCGA GCCGGGCCTG
CTCGCGCGGC TGCGGCTCGC GCTCGACACC CGGATGAGCG AGGTCTGCAT CACGGGCGAC
GGCTGCGAGC AGTGCGGCAT GGCCGGCACG ATCGGCCGCA CGGTGGTGGC CGAGGTGATC
CTGCCCGACG CGCGGCTCTT CGAGTTCCTG CGCGACGGCG ACAAGGTCGG CGCGCTCGAG
TACTGCACCC GCACGCTCGG CGCGATGACG CTCGCCGAGC ACGCACTGCG CAAGGTCGCG
GCGGGGCTCG TCGATCCGCG CGGCGTCGAG CGCGTGGTCG GCGCGCTCGC GCCGGTGACG
GGCGACGCGC GACAGCAGCT GTCGCTCGTC GGATTCAGCT ATGGCACTTG A
 
Protein sequence
MSHDAQRPDT AAARATPSAA DAAAPPRGPR AVSEAGDFAA SVEERKFVCL FDDGRLLIAE 
GHEMNPFVLS YRARLDRMGR PYRPTPATLM QVREAYRQHV AGGGERLDHT VMQVLAKELI
GRACRERASD VHIRVRRFST EVYFRIHNEL MRVNEHTREH GERLLATLYG AMTTVSDNSY
RPSERQDASI GDRDKLPDDL YGVRIATTPT SEGSLMVLRL LYNDAGDATD LAALGFAPEH
VAAFRMLRAQ PHGMNIISGP TGSGKSTTLQ RMLAAQIDES HGSLHVITVE DPIEYPIDGA
VQTPVANAPT EDARALAFAA AITNAMRLDP DTIMIGEIRD RASGQTALRA SMTGHQVWTT
VHANSALAIA DRLIDLGLHA RMITDHTVIS GLISQRLVKL LCPHCKVRLL DHAERIEPGL
LARLRLALDT RMSEVCITGD GCEQCGMAGT IGRTVVAEVI LPDARLFEFL RDGDKVGALE
YCTRTLGAMT LAEHALRKVA AGLVDPRGVE RVVGALAPVT GDARQQLSLV GFSYGT