Gene BURPS668_A0240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0240 
Symbol 
ID4886830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp216364 
End bp217551 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content65% 
IMG OID640130181 
Productflagellar hook protein FlgE 
Protein accessionYP_001061246 
Protein GI126444045 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTCA ATATCGCCCT ATCGGGCATC AACGCCATCA ACAGCCAGCT GAACACGATC 
AGCCACAACA TCGCGAACGC CAACACCTAC GGCTTCAAGG CCGGGCGCGC GAACTTCGCG
ACAATGGTCG CGGGCACGCA GGCCAACGGC ACGTACATCG GATCGATCAC GCAGAGCGTC
GGCACGCGCG GCGGCTTTCT GCCGACCGGC CGCGCGCTGG ACGGCGCGAT CGACGGCAAC
GGCTTCTTCG TGACGAAGGA TAGCGACGGC TCGATGCTCT TCACGCGCTT CGGCTACTTC
CAGCGCGACG CGGACGGCTA CATCGTCGAC GGCTTCGGCA GGCGCGCGCA GGGTTACGGC
GCGAGCGGCG CCTATGGCGA CATTCGCGTG CCGACCGACA CGGAGCCGGC GAAGGCGAGC
GACAGCCTCG AATATGTGGG CAATCTGTCC GCCGACTGGG CGGCGCCGAA GAACCCGAAT
TTCGATAAGG ACGACGACAC TTCCTTCAAT CATTCGGTCG CTTCGACCGT CTACGACTCG
CTCGGTCGTC AGCACATCGT GACGCAGTAC TTCGTGAAGG GGCAGCCGCC GTCGACCGAC
GTCATCGCCT ATTACGCGAT GGACGGCGAG ATCGTCGGCG GCAATACGCC GGTCCAGACA
GTGTTGCAGT TCGACACGAA CGGCCAACTG ACCGCGCCGA ACTCGCCGGT GGATCTCGAT
CTCGGTACGC CCGCCGGCGC GTCCGCGCTC GCGATCAAGG TCAACTACGC CGGCACGACG
CAAGTCGCCG GCGAGACGAC GACGACGGTC AATCGCGACA ATGGCTACGC AGCAGGCGTG
CCGGGAGAGG TGATGCTCGA CGAGGACGGC GGCGTCGTCG TGCAGTACAG CAACGGCAAG
CAGCGCAAGG TCGGCTCGCT CGCGCTCGCG ACCTTCGCGA ACCAGGATGG CCTGAGCGCC
GTGGGCGATA CCGCCTGGCG CGCGTCGGCC GCGTCCGGCA ATCCGTTGAT CGGCTCGGCC
GGCTCCGGCG CGCTCGGCAA GGTGGTGGCC GGTTCGCTCG AGCTGTCGAA CGCGGACGTC
ACGCAGGAGC TCGTCGACAT GATGAGCGCG CAGCGCAACT ACCAGGCCAA CTCGAAGGTG
CTGTCGACCG AGAACCAGAT GATGCAGGCG TTGATGCAGG CGCTGTAA
 
Protein sequence
MSFNIALSGI NAINSQLNTI SHNIANANTY GFKAGRANFA TMVAGTQANG TYIGSITQSV 
GTRGGFLPTG RALDGAIDGN GFFVTKDSDG SMLFTRFGYF QRDADGYIVD GFGRRAQGYG
ASGAYGDIRV PTDTEPAKAS DSLEYVGNLS ADWAAPKNPN FDKDDDTSFN HSVASTVYDS
LGRQHIVTQY FVKGQPPSTD VIAYYAMDGE IVGGNTPVQT VLQFDTNGQL TAPNSPVDLD
LGTPAGASAL AIKVNYAGTT QVAGETTTTV NRDNGYAAGV PGEVMLDEDG GVVVQYSNGK
QRKVGSLALA TFANQDGLSA VGDTAWRASA ASGNPLIGSA GSGALGKVVA GSLELSNADV
TQELVDMMSA QRNYQANSKV LSTENQMMQA LMQAL