Gene BURPS668_A0162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0162 
Symbol 
ID4888347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp142344 
End bp143645 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content66% 
IMG OID640130103 
Producthypothetical protein 
Protein accessionYP_001061168 
Protein GI126444974 
COG category[N] Cell motility
[S] Function unknown 
COG ID[COG1360] Flagellar motor protein
[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family
[TIGR03350] type VI secretion system OmpA/MotB family protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCGG GCGCCTTTGA CGAAGCCGCG TTCGATTTTC TCGTATCGGA CGAAGGCGGA 
GCGAACGTCG CGGCCGATCG CGCGCGATCG GCCGCGAAGG CCGCGCGCGC GGAGCCGGCG
CGCGATCGGA TCGACGCGTC GGTCAACGTT CAGCATCGCC TCGACGAAGT TCGCGCGGCG
GCCAATCCGC TGCTCGAAGC CGCGACGCCT TTGCTGCGGA TGCTGGCGGA CATGCCTGCG
ACGCTAGACA CCTCCGAAGC CGTCGCGAGC CTTCGCACGC TGCTCGTGCG CGAGGTCGCG
CTGTTCCAGA ACTTGTGCGA GAAGGCCGAT TTGCCTTGGA AGCACATGGC GGTCGTGCGC
TATTGCCTGT GCACCGCACT GGACGAGGCG GCCAATCGCA CGCGCTGGGG CGGCGGCGGC
GTATGGGCTT CGCAGAGCTT GCTGATTACA TACGAAGGCG AGGTCGACGG AGGCGAGAAG
TTCTTCCTGT TGATTGGACG CATGGCGACG GATCCTCAGG AGTACGTCGA TATTCTCGAG
ATTCTCTATC GCGTGCTGGG TCTCGGGTTC GAAGGGCGCT ACAGCGTCGT TGCGGACGGC
CGGCGGCATC TCGAGCAGAT TCGCCAGCGT TTGTGGACGC TCATCACCGG CGCGCGCGAC
GCGATTCAGC CGGAATTGTC GCTGCGCTGG CGCGGAGCGG AGCCGGGCAG GCTTCCGCTG
TTGCGCAGCG TGCCCGCATG GGCGAGCGGT GCGCTCGTCT TCCTGCTGTT GTTCGGTCTC
TTCGCGTTCT ATCAGTATCG GTTGCTCACC GAACGCAATG CGCTCGAGGC GCGCATCCTG
GCGATCGCGA AGGAGGAGCC GGCGACGCCA CCGCAGCGTT TGCGCCTGTC GATCCTGCTC
AAGAACGAAA TTGCGCGCGG TCTACTGACG GTCGACGAAG ACGATAGGCG CAGCGTGGTC
GTGTTCCGCG GCGATGCGAT GTTCCGCTCC GGCGAGAGCC GCGTGCTTCC CGAGATCGAG
CCGGTGCTCG ACAAGGTGGC GCGGGAAGTG GCCCGCGTCG GCGGCCGGGT CACGGTCACG
GGGCACTCCG ACAACCAGCC GATTCGGCGC GCCGACATCC CGAACAATCT CGTGTTGTCG
GAAAAACGCG CGGCTTACGT CGCGCAAATC CTGGTCCAGC GAGGCACTCC GGCCGATCGT
ATTCGCTCGG AGGGCCGGGG CGACGCGCAG CCCGTCGCGG AAAACGCGAC GCCTGCTGGC
CGATCCCGCA ACCGCCGCGT CGAAATCGCG GTCACGCTGT AA
 
Protein sequence
MNSGAFDEAA FDFLVSDEGG ANVAADRARS AAKAARAEPA RDRIDASVNV QHRLDEVRAA 
ANPLLEAATP LLRMLADMPA TLDTSEAVAS LRTLLVREVA LFQNLCEKAD LPWKHMAVVR
YCLCTALDEA ANRTRWGGGG VWASQSLLIT YEGEVDGGEK FFLLIGRMAT DPQEYVDILE
ILYRVLGLGF EGRYSVVADG RRHLEQIRQR LWTLITGARD AIQPELSLRW RGAEPGRLPL
LRSVPAWASG ALVFLLLFGL FAFYQYRLLT ERNALEARIL AIAKEEPATP PQRLRLSILL
KNEIARGLLT VDEDDRRSVV VFRGDAMFRS GESRVLPEIE PVLDKVAREV ARVGGRVTVT
GHSDNQPIRR ADIPNNLVLS EKRAAYVAQI LVQRGTPADR IRSEGRGDAQ PVAENATPAG
RSRNRRVEIA VTL