Gene BURPS668_1567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1567 
Symbol 
ID4883252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1532328 
End bp1533857 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content70% 
IMG OID640127495 
ProductTPR repeat-containing protein 
Protein accessionYP_001058608 
Protein GI126439247 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.085115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCGG AACTCGACGC CCGTGTGCGC GCGCTCACGC TGCGCGCGCA GCAACTGTTC 
GACGACAGCC GCCCGGAGCA GGGCGCGCTG CTCGCCGCGC AGGCGCTCGC GCTCGCGCCG
GACGACGCGC TCGCGCTGAA GCTCGTCGGT GTCGCCGAGT GCATGCGCGG CGATCATGCG
GCGGGGCTCG TGTATCTCGA GCGGGCGTGC CTGTCGGCGC CCGGGGACGC GAATCTGCAC
TACAACGTGG CCGTCGCGCA CGAATGCACC GGCTCGCACG AGCGCGCGGC GTTGAGTTAC
CGCCACTGCC TGCGTCTGCA GCCCGATCAT GCTGACGCGC TGTGGAACTA CGGCGAATAC
CTGCGCCTGA ACGGCCATTT CGAGGCGGCC GCGCGCTGCT TCGAGGCGCT GCAGGCGCAG
GCGTGCCGCT ATCCGTCGAT GCATCACCGG ATGGCCGTCG TCTATACGCA TCTGCATCGC
TTCGACGACG CACGGCGGCA TTTCGCGCTC GCGATGGACG AGAACATCGA TACGCGCGTC
ACGCGCTGGG AGCGCGCGCA TTTGCGGCTC GGCACGCGCG ATTTCGCACG CGGCTGGCCG
GACTACGACA CGCGCTTCGA CATCGGACAT CTGATCAACG TCCACTGCCA TCCGTTTCCG
ATCCGGCTCT GGCAAGGCGA GCCGCTCGCG GGCAAGACGC TGCTCGTGCA CGGCGAACAG
GGGCTCGGCG ACGAGATCAT GTTCGCGTCG ATCGTGCCGG ACATTGTCCG GCAAGCGGCG
CGCGTCGTGC TCGCCTGCGC GCCGTCGCTC GTCTCGCTGT TCCAGCGCGC GTTTCCGTCC
GCGATCGTGC GCGCGCACCG CGCGGGCGTC GCGCCCGCGC GCGTGGACGA TCTCGGGGCG
ATCGACTATC AGTCGCCGAT CGGCAGCCTG CCGCGCTGGC TGCGCGCGAG CGAGGCATCG
TTCGGCACGG GCGCGCCGTA TCTGGCGGCG GACCCGGCGC GCGTCGCATG GTTCGGCGCG
CGGCTGCGCG CGTTGGCGCC GCGCGCCGAT CGCGCGCTGA AAGTCGGCTT GACATGGGGA
TCGAATCCCG CGGCGGCGGT GCCGTCCGCC GCGCGCCGCG CCACGCGCAA GAGCATGCCG
CTGCGGTTGC TCGCGCCGCT CGCGCGGGTG CCGGACGTGC AGTACGTGAG CGTTCAGAAC
GCCGAGCTGG GCGAGCAGGC CGCGACCGTG CCCGAGCTCG ATCTGATCGA TTTCAGCAGC
GCGCTTCGGG ATTTCGCCGA CACCGCGGCG CTCGTCGCGA ATCTTGACGT CGTCGTGAGT
GTCGATACGT CGGTCGCGCA TCTGGCGGGC GCGCTCGGCA AGACGGCCTA TACGCTGCTG
ATGCGCAATT GCGACTGGCG ATACGGATTC GAGGGCGAGC GCTGCGTCTG GTACGAATCG
ATGACGCTGC TGCGCCAGAC GACGCAGGAC GATTGGCTGC CGGTCGTCGA TCGGGTGATC
GACGCGCTGG CGCGGTATCG CAAGCAATAA
 
Protein sequence
MTAELDARVR ALTLRAQQLF DDSRPEQGAL LAAQALALAP DDALALKLVG VAECMRGDHA 
AGLVYLERAC LSAPGDANLH YNVAVAHECT GSHERAALSY RHCLRLQPDH ADALWNYGEY
LRLNGHFEAA ARCFEALQAQ ACRYPSMHHR MAVVYTHLHR FDDARRHFAL AMDENIDTRV
TRWERAHLRL GTRDFARGWP DYDTRFDIGH LINVHCHPFP IRLWQGEPLA GKTLLVHGEQ
GLGDEIMFAS IVPDIVRQAA RVVLACAPSL VSLFQRAFPS AIVRAHRAGV APARVDDLGA
IDYQSPIGSL PRWLRASEAS FGTGAPYLAA DPARVAWFGA RLRALAPRAD RALKVGLTWG
SNPAAAVPSA ARRATRKSMP LRLLAPLARV PDVQYVSVQN AELGEQAATV PELDLIDFSS
ALRDFADTAA LVANLDVVVS VDTSVAHLAG ALGKTAYTLL MRNCDWRYGF EGERCVWYES
MTLLRQTTQD DWLPVVDRVI DALARYRKQ