Gene BURPS668_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1794 
Symbol 
ID4881888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1769856 
End bp1771790 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content66% 
IMG OID640127722 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001058833 
Protein GI126441494 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.061359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAGC GGTTCGTTGG CGTTCCGAAC GAATTCGCTC GCGCAACGGC TCAGGCCGGC 
AGCGCGTGGG CAAGCCGCCC ATCGCGCAAA AGCCGAACGC TCGCCGCGCA GGAAACGAGC
GGTTGTTACG GAATTGCCGC ATTAGGAGCG CCACTTATTC AGCGTGCTTC CTCGCGGCCG
ATATATAGCG GACGACGCGT CGCACTCAAA CCGGAGAACA CCGTGGGCTT TTTGAACACC
TTTTCGTTGC GCAGATACGC CTCCCTGCTG CTGGGCCTGC TCGCCGTAGC GGCCGTTGCG
ATAGGCCTAT GCCAGTGGCG GCTCGATGCC GCGAATCACC GCGTCGCGCA GGCCTACCAG
CAGCGCTATG TGTCGACCCA GCTCGCGAAC GAGCTGCGCC GCAGCTCCGA CGATTTGACC
CGGCTTGCGC GCACCTATGT CGCGACGGGC GACGCGAAAT GGGAGCAGCA ATACAACGAG
ATACTCGCGA TTCGCTCGGG CAAGGCGCCG CGCCCGCTCG ATTACGATCG TATTTACTGG
GACTTTCGCG CGGCCGACGA ACCGCTGCGC GCGGAGCAGG GCGAGACGAT CTCGCTGCAG
GAGATGATGA AGCGCGCCGG CTTCACCGAC GAGGAGTTCG CGAAGCTGCA CGACGCCGAG
CAGAACTCGA ATGATCTCGT GAAGACCGAG ACGGTCGCGA TGAATCTCGT CAAAGGGTTG
ACGCCCGACG ACGCAGGCCA TTTCACGAAG CAAGGCCCGC CCGATCTCGA GAAGGCGCGG
GCCCTGATGT TCGACGCGAA CTACCACCGG TTCAAGGCGA AGATCATGCA TCCCATCGAC
GATTTCCTGA AGCTGCTCGA CGCGCGCACC GAAGGCGCGA TCGCCCGCGC GCAGGCAAGC
GCGCAAACGT GGAAGATCAT TTCGGCCGCA GTGGCGATCG GCATCCTCGC TTTCTTCGCG
CTGATGCTGC ACATGATGTT CAAGCGCGTG CTCGCCGGGC TCGAGGCGGC CGCCTCGACC
GCGAGCCGCG TCGCCGCGGG CGATCTGACC TCTCATTTCG ACGCGCACCG CGTCGACCCG
CAGGCGAAGG ACGAGATTTC CCGCGTGATG CGCGCGCTGC AGACGATGAA CGACGGGCTC
GTGCGCATCG TGACCGACGT GCGCTCGGGC ACCGATACGA TCGCGACGGC GTCCCACGAA
ATCATGGCCG GCAACAACGA TCTGTCCGCG CGAACCGAGC AGCAGGCCGC GTCGCTGCAG
GAAACGGCCG CGAGCATGAG CGAGCTCACC GCCACCGTGA AGCTGAACCT CGAGAACGCG
CGGCAGGCGA ACATGATCGG CTCCAACGCG GTGTCGACGG TCGAGAAGGG CTCGGTCTCG
GTCGAGCAAC TCGTCACCAC GGTCAACGCG ATCAGCACGA ACTCGGGCAA GATCGCCGAC
ATCATCTCGC TGATCGAGGG CATTGCGTTC CAGACGAACA TCCTTGCGCT GAACGCCGCC
GTCGAGGCGG CGCGCGCGGG CGAGCAGGGC CGCGGCTTCG CCGTCGTCGC GAGCGAAGTG
CGCAGTCTCG CCCAGCGCTC GTCGTCGGCG GCCAAGGAGA TCAAGGATCT CATCGAGACG
TCGATCGATA CGGTTCGGGA CGGCGTGTCG AAGGCCGACG AGGTCGGGCA GCACATCGTG
GAGGTGAAGC AGGCGATCCG GCGCGTCGCC GATCTCGTCG GCGAGATCAC CGCGGCGTCG
GAAGAGCAAA CCCGCGGCAT CGAGCAGGTC GATGCCGCCG TCAGCCAGAT GGACCGCGTG
ACGCAGCAAA ACGCGGCCCT CGTCGAGCAG GCCGCGGCCG CGTCGAAGGC GATGGACGAT
CAGGCCGGCA ACCTGCGCGC GGCGGCGTCG ATCTTCAAGC TGCCCGGGCG CGCGAGCCTG
TTCGCGCACG CGTAG
 
Protein sequence
MRKRFVGVPN EFARATAQAG SAWASRPSRK SRTLAAQETS GCYGIAALGA PLIQRASSRP 
IYSGRRVALK PENTVGFLNT FSLRRYASLL LGLLAVAAVA IGLCQWRLDA ANHRVAQAYQ
QRYVSTQLAN ELRRSSDDLT RLARTYVATG DAKWEQQYNE ILAIRSGKAP RPLDYDRIYW
DFRAADEPLR AEQGETISLQ EMMKRAGFTD EEFAKLHDAE QNSNDLVKTE TVAMNLVKGL
TPDDAGHFTK QGPPDLEKAR ALMFDANYHR FKAKIMHPID DFLKLLDART EGAIARAQAS
AQTWKIISAA VAIGILAFFA LMLHMMFKRV LAGLEAAAST ASRVAAGDLT SHFDAHRVDP
QAKDEISRVM RALQTMNDGL VRIVTDVRSG TDTIATASHE IMAGNNDLSA RTEQQAASLQ
ETAASMSELT ATVKLNLENA RQANMIGSNA VSTVEKGSVS VEQLVTTVNA ISTNSGKIAD
IISLIEGIAF QTNILALNAA VEAARAGEQG RGFAVVASEV RSLAQRSSSA AKEIKDLIET
SIDTVRDGVS KADEVGQHIV EVKQAIRRVA DLVGEITAAS EEQTRGIEQV DAAVSQMDRV
TQQNAALVEQ AAAASKAMDD QAGNLRAAAS IFKLPGRASL FAHA