Gene BURPS1106A_A0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0237 
Symbol 
ID4904859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp225273 
End bp226538 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content71% 
IMG OID640143344 
Producthypothetical protein 
Protein accessionYP_001074280 
Protein GI126457554 
COG category[N] Cell motility
[S] Function unknown 
COG ID[COG1360] Flagellar motor protein
[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family
[TIGR03350] type VI secretion system OmpA/MotB family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCATCATC CGGACATATC GGCTCCGGCC CCGACCTTCG ATTCCGTCGC GGCGACGCTC 
GCGCGGCGCG AGCCGGCGCC TGCGCCGGCC GGCGAGCCGC CCGCCGCGCG CCTCGCCGCG
ATCAGGCTCG CCCGCAACCC GCTGCTCGAA GCCGCGCGCG TGCTGCTGCG GGCGCTCGCC
GACATGCCCG AGCGGCTCGA TCGCGACGAC ATTCCGCAAT TGCGACTGCT GCTGGAACAG
GAGGTGCGCC TGTTCCAGCG GCTCTGCGAA CAGGCGAACA TCCGGCGCGA CCACATGCTC
GGCGCGCGCT ACTGCCTGTG CACCGCGCTC GACGAGGCGG CGATGCAGAC GTCGTGGGCA
CAATCGGCGA GCGGCAATCT CGGCACGTGG ATCAGCGAGG GGCTCGCGAC GTCGTTCCAC
GAGGATCGCC AGGGCGGAGA CAAGGTCTAT CTGCTGATCG GCCGGCTGAT GAATTCGCCG
CACGAGCACA TCGACCTGCT CGAAGTCATC TATCGAATCT TGAGCCTCGG CTTCGAGGGC
CGCTACCGTT ACGAAGCCGA CGGCCAGCGC AAGCACGAGA CCGTGCGCCA GCGGCTCTAC
AACGAGATCG CATCGCAGCG CGGGCCGGTG TCGGTCGCGC TGTCGCCGCA CTGGCAGCCC
GGCCCCCGCA GCAGGAGCGC GCCGTTTCGC GATTTCCCCG CGTGGGTCAC GGCCGCCGTG
CTGTCGCTGA TCGCGCTCGG GCTGTTCGGC TGCTTCAAGT ACGCGCTGTC GACGCGCAGC
GCCGACGTGC AGCAGCGGAT CGCCGCGATC GCGCGGATGG CGCCGCCCGC CGCGCCGGCC
GAGCTGCGCC TCGCGACGCT GCTCGCCGGC GAGATCGCGG CGGGCACGCT CAGCGTCGAG
GAAAACGCGC GCCGCAGCTC GGTGACGTTC CGCGGCGACG CGATGTTCGC GCCGGGCGCG
GCCGGCGTGA ACCCGGCGAT GGGGCCGCTC ATCCGGAAAA TCGCGGCCGA GATCGCGAAG
GTGCCGGGCA AGGTGACGGT GCGCGGCTAC ACCGACAATC AGCCGATCAA AAGCCGCCAG
TTCGCGTCGA ACGAGGCGCT ATCCGAAGAG CGCGCGACGC AGGTCATGCA GATGCTCCAG
AGCGCGGGCG TGCCCGCGAG CCGCCTCGAG GCGCTCGGCA AGGGCGGCGC CGAGCCGATC
GGCGACAACC GGACCCCGCA GGGCCGCGCG CTGAACCGCC GCGTCGAAAT CACGGTCGCG
CGCTGA
 
Protein sequence
MHHPDISAPA PTFDSVAATL ARREPAPAPA GEPPAARLAA IRLARNPLLE AARVLLRALA 
DMPERLDRDD IPQLRLLLEQ EVRLFQRLCE QANIRRDHML GARYCLCTAL DEAAMQTSWA
QSASGNLGTW ISEGLATSFH EDRQGGDKVY LLIGRLMNSP HEHIDLLEVI YRILSLGFEG
RYRYEADGQR KHETVRQRLY NEIASQRGPV SVALSPHWQP GPRSRSAPFR DFPAWVTAAV
LSLIALGLFG CFKYALSTRS ADVQQRIAAI ARMAPPAAPA ELRLATLLAG EIAAGTLSVE
ENARRSSVTF RGDAMFAPGA AGVNPAMGPL IRKIAAEIAK VPGKVTVRGY TDNQPIKSRQ
FASNEALSEE RATQVMQMLQ SAGVPASRLE ALGKGGAEPI GDNRTPQGRA LNRRVEITVA
R