Gene BURPS668_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1983 
Symbol 
ID4883919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1960041 
End bp1961783 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content70% 
IMG OID640127911 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001059018 
Protein GI126438395 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTTC AAAACATGAC AGTGAGCACG AAGCTGACCC TTGCGTTCGG TGCGCTGGTG 
GGACTCGTGC TGCTCGTGTC CGTCCTGGCG CTGCACGCGC TCGGCGATGC GAACGACCGT
TTCGCCAGCT ACGTGAGCGG CATCAGCGCG CGCGCGGAAG CGGCCGAGCA GGTACGCACG
GCCGTGGACC GGCGCGCGAT CGCCGCGCGC AATCTGGTGC TCGTGACGAA GCCCGCCGAC
GTCGAGCTCG AAAAGGCCGC CGTGACGCAG GCGGAGGACG ACGTGCAGGC GCATCTGCGC
CGGCTGAAGG AACTGCTCTC GAGCGCGTCG GACGGGAACG ACAAGGCGCG CGGCCTCGTC
GCCGACATCG ACCGCGTCGA GGCACAATAC GGCCCGGTCG CACTCGCGAT CGTCAACGCC
GCGCTGAACA ATCGGCACGA CGAAGCGATC ACGATGATGA ACGACCAGTG CCGCCCGCTG
CTCGCTCAGC TCGTCAAGGC GACGAACGCC TACAGCGAAT ACACGCGCGG CCGCGCGCAG
GAAATGGTCC GCGAATCGGC CGACCACTAT GCGAGCCAGC GCCTGTTGCT GCTCGGCCTG
TGCGCGGCGG CGATCGGCGC GGCGGTGATC GCGGCGATCC TGATCGCACG GGGCCTGATG
CGCGCGCTCG GCGCCGAACC CGCGACGCTC GGCGACGTCA CGCGGCGCGT CGCGAACGGC
GATCTGAGCC CGGTCGCGGG CGCGCAGACG GCGCCGTCGG GCAGCGTGCT CGCATCGATG
GGCGAGATGC AGGCGAGCCT CGTGCGGCTG ATCGGGCAGG TGAGCACCGC CGCGGACAGC
ATTGCGACGG GTTCGAGCCA GATCGCGTCG GGCAACCAGG ATCTGTCGTC GCGCACCGAG
CATCAGGCTT CGTCGCTGCA GGAAACGGCC TCCAGCATGG AGGAGTTGAC GTCGACCGTG
CGCCAGAACG CGGAGAACGC GCAGCAGGCG AGCTCGCTCG CGGCGAACGC GTCGGAAGTC
GCTCAAAAGG GCAGTACGGT GGTCGGGCAG GTCGTCGACA CGATGACCGA CATCAGCCAG
AGCTCGGAGA AAGTCGCGGA AATCACCGGG ATCATCGAGA GTATCGCGTT CCAGACCAAC
ATCCTCGCGC TGAATGCGGC CGTCGAGGCG GCCCGCGCGG GCGAGCAGGG GCGCGGCTTC
GCGGTCGTCG CGAGCGAAGT GCGCAGCCTC GCGCAGCGCT CGTCGAGCGC GGCGAAGGAG
ATCAAGGATC TGATCAACGC GTCGGTGCAG AAGATCCATG ACGGCTCGGC GCTCGCGGGC
GAGGCGGGCA AGACGATGAC CGAAGTCACG CAGGCGGTCG CGCGCGTGAC CGACATCATG
GGCGAGATCG CCGCGGCGTC CGGCGAGCAG AGCCGCGGCA TCGAGCAGGT GAACCAGGCG
ATCGCACAGA TGGACGAAGT CACGCAGCAG AACGCCGCGC TCGTCGAGGA GGCGGCGGCC
GCGTCGAAGT CGCTCGAAGA GCAGGGGCGC CATCTGACGC AGGCCGTGTC GTTCTTCCGC
GCGAGCGCCG CAAGCGCGGC GCCGCAAGCG CGGCACGCGG CGCCAGCCAA GCCGAAGGCG
AAGCGCGGCG TGGCGGCTCC CGCCCCCGCA CCGCGCGCGG CGCACGCCGC ACCGACGTTC
AACAAACCGG CGCCGGCTCT CGCCGCCGCC GCGACCGCAA GCGACGACTG GCAGACCTTC
TGA
 
Protein sequence
MNFQNMTVST KLTLAFGALV GLVLLVSVLA LHALGDANDR FASYVSGISA RAEAAEQVRT 
AVDRRAIAAR NLVLVTKPAD VELEKAAVTQ AEDDVQAHLR RLKELLSSAS DGNDKARGLV
ADIDRVEAQY GPVALAIVNA ALNNRHDEAI TMMNDQCRPL LAQLVKATNA YSEYTRGRAQ
EMVRESADHY ASQRLLLLGL CAAAIGAAVI AAILIARGLM RALGAEPATL GDVTRRVANG
DLSPVAGAQT APSGSVLASM GEMQASLVRL IGQVSTAADS IATGSSQIAS GNQDLSSRTE
HQASSLQETA SSMEELTSTV RQNAENAQQA SSLAANASEV AQKGSTVVGQ VVDTMTDISQ
SSEKVAEITG IIESIAFQTN ILALNAAVEA ARAGEQGRGF AVVASEVRSL AQRSSSAAKE
IKDLINASVQ KIHDGSALAG EAGKTMTEVT QAVARVTDIM GEIAAASGEQ SRGIEQVNQA
IAQMDEVTQQ NAALVEEAAA ASKSLEEQGR HLTQAVSFFR ASAASAAPQA RHAAPAKPKA
KRGVAAPAPA PRAAHAAPTF NKPAPALAAA ATASDDWQTF