Gene BURPS1106A_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0234 
SymbolfliF 
ID4903064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp219248 
End bp221044 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content70% 
IMG OID640133464 
Productflagellar MS-ring protein 
Protein accessionYP_001064517 
Protein GI126453288 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1766] Flagellar biosynthesis/type III secretory pathway lipoprotein 
TIGRFAM ID[TIGR00206] flagellar basal-body M-ring protein/flagellar hook-basal body protein (fliF) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTCGC AGGCCAACTC GCTGATCAAC CCCGACGCCC GTTCGAGCCT TGCGGGCGCA 
TCGCCGCAAG CCGCGGCCGC GGCGGGCGCG CTGCCGGGCG CGGCGGCGGG CGGCGCGGAT
TTCGGCCTGG GCGGCTTCGC CGAACGCATC CCGGGCCTCT CGCGAATGAA GACGAACCCG
CGGCTGCCGT TCCTGATCGG CGCGGCGCTC GCCATCGCCG CGATCGTCGC GCTCGTGCTC
TGGAGCCGCG CGCCCGACTA CCGCGTGCTG TACAGCAACC TGTCCGACCG CGACGGCGGC
GCGATCATCG CCGCGCTCCA GCAGGCAAAC GTTCCCTATA AGTTCGCCGA CGCGGGCGGC
GCGATCCTCG TGCCCGCGAA CCAGGTGCAC GAGACGCGCC TGAAGCTCGC CGCGATGGGG
CTGCCCAAGG GCGGCTCGGT CGGCTTCGAG CTGATGGACA ACCAGAAATT CGGCATCAGC
CAGTTCGCCG AGCAGGTCAA CTACCAGCGC GCGCTCGAGG GCGAGCTGCA GCGCACCGTC
GAATCGATCA ACGCGGTGCG CGCCGCGCGC GTGCATCTGG CGATTCCGAA GCCTTCGGTA
TTCGTGCGCG ATCGCGAGGC GCCGTCGGCG TCGGTGCTCG TCGATCTGTA CCCGGGCCGC
GTGCTCGACG AAGGGCAGGT GCTCGCCGTC ACGCGCATGG TTTCGTCGAG CGTGCCCGAC
ATGCCCGCGA AGAACGTGAC GATCGTCGAC CAGGACGGCA ACCTGCTCAC GCAGACCGCG
TCCGCCACCG GCCTCGACGC GAGCCAGCTC AAGTACGTGC AGCAGATCGA GCGCAACACG
CAAAAGCGCA TCGACGCGAT CCTCGCGCCG ATCTTCGGCG CCGGCAACGC GCGCTCGCAG
GTGAGCGCCG ACGTCGACTT CTCGAAGATC GAGCAGACCT CGGAGAGCTA CGGCCCGAAC
GGCACGCCGC AGCAGAGCGC GATCCGCAGC CAGCAGACGA GCAGTTCGAC CGAGCTCGCG
CAAAGCGGCG CGTCGGGCGT GCCGGGCGCG CTGTCGAACA CGCCGCCGCA GCCCGCGTCC
GCGCCGATCG TCGCGAGCAA CGGCCAGCCG GCCGGCCCGG CCGCGACGCC CGTCAGCGAC
CGCAAGGATT CGACGACGAA CTACGAGCTC GACAAGACCG TGCGGCACGT CGAGCAATCG
ATGGGCACGA TCAAGCGGCT GTCGGTCGCG GTGGTCGTCA ACTATCAGCC GAGCACCGAC
GCGAAGGGCC GCGTGACGAT GCAGCCGCTC GCCGCGGACA AGCTCGCGCA GGTCCAGCAG
CTCGTGAAAG ACGCGATGGG CTACGACGAG AAGCGCGGCG ATTCGGTCAA CGTCGTCAAC
AGCGCGTTCT CGGCCGCGGC CGATCCGTTC GCGAACCTGC CGTGGTGGCG CCAGCCGGAC
ATGATCGAAC TCGGCAAGGA CATCGCGAAA TGGCTGGGCG TCGCCGCGGC GGCCGCCGCG
CTGTACTTCA TGTTCGTGCG CCCGGCGCTG CGCCGCGCGT TCCCGCCGCC CGCGGAGCCC
GCGGCGGCCG CCGTGCCGGC GCTCGACGGC CCGGACGACA TGCTCGCGCT CGACGGCCTG
CCGAGCCCCG ACAAGAAGCA GCTTGCCGAG GAGGACGAAG AGCATCCGGC GCTCCTCGCC
TTCGAAAACG AGAGGAACCG CTACGAACGC AATCTCGACT ACGCGCGCAC GATCGCGCGC
CAGGATCCGA AGATCGTCGC AACCGTCGTG AAGAACTGGG TGTCCGATGA ACGCTGA
 
Protein sequence
MDSQANSLIN PDARSSLAGA SPQAAAAAGA LPGAAAGGAD FGLGGFAERI PGLSRMKTNP 
RLPFLIGAAL AIAAIVALVL WSRAPDYRVL YSNLSDRDGG AIIAALQQAN VPYKFADAGG
AILVPANQVH ETRLKLAAMG LPKGGSVGFE LMDNQKFGIS QFAEQVNYQR ALEGELQRTV
ESINAVRAAR VHLAIPKPSV FVRDREAPSA SVLVDLYPGR VLDEGQVLAV TRMVSSSVPD
MPAKNVTIVD QDGNLLTQTA SATGLDASQL KYVQQIERNT QKRIDAILAP IFGAGNARSQ
VSADVDFSKI EQTSESYGPN GTPQQSAIRS QQTSSSTELA QSGASGVPGA LSNTPPQPAS
APIVASNGQP AGPAATPVSD RKDSTTNYEL DKTVRHVEQS MGTIKRLSVA VVVNYQPSTD
AKGRVTMQPL AADKLAQVQQ LVKDAMGYDE KRGDSVNVVN SAFSAAADPF ANLPWWRQPD
MIELGKDIAK WLGVAAAAAA LYFMFVRPAL RRAFPPPAEP AAAAVPALDG PDDMLALDGL
PSPDKKQLAE EDEEHPALLA FENERNRYER NLDYARTIAR QDPKIVATVV KNWVSDER