Gene BURPS668_A0379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0379 
Symbol 
ID4888769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp342785 
End bp344413 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content71% 
IMG OID640130320 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001061385 
Protein GI126444166 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.366227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTTC GCCGCAAAAT TCCCCTCGCC TTCGCGACAG CACTCGTCCT GACGTCCGCG 
AGCGCGTTCT ACGGCATCCA CGCGCTCAAC CGCTCGCTCG ACACATACGG CACGACGGTT
CGGCAGAACG TTGCGAACGA GCGGATGGTG TCCGCGACGC TCGTCGCGTT CAAGCTGCAA
GTGCAGGAAT GGAAGGACAC GCTGCTGCGC GGCAAGGACC CCGCGAAACT CGATAAATAC
TGGCGCGCGT TCCAGCAGCG CGAGCAAACC GTCGACGCGC TCGCCGCCGA GCTGAAAGCG
AAGCTGCCCG ACGGCGAGAG CCGCGGGCTG ATCGAGCAGT TCGCCTCCGC GCATGCGGAA
ATGGGGCAGG GCTATCGTAA GGGATTCGAA GCATTCAGGG CCTCGGGGTT CGACCCGTCC
GCGGGCGACC AGGCCGTCGC GGGCGTCGAT CGCGCGCCCG CCGTGCTGCT CGAGAAAGCC
GCGCGGGACA TCGCCGCCGA CAGCGCGCGC GTATCGGCCG ACGCCGCGAG CGACGCCGCG
CACGCGACGG CGATCAGCAT CGCCGCGACG CTCGCCTTGT TCGCGCTCTC GCTCGCCGGC
GGCGTGTGGT TCGGCGGCAC CGTCACGCGG CCGCTCGAGC GCGCGCTCGC ATGCGTGCGG
CGAGTGGCCA CGGGCGACTT GTCGACGCCG ATCGACGCGC GCGGCCGCGA CGAGATCGCC
GAGCTGCTCG CCGCGCTGAA AGACATGCAG GCAAGCCTGT CGCACGTCGT GCGCGACGTG
CGGCACAACG CCGACGGCGT CGCCACCGCG AGCGCGCAGA TCGCGTCGGG CAATCTCGAT
CTGTCGTCGC GCACCGAGGA ACAGGCGGCA TCGCTCGAGG AAACGGCGGC GAGCATGGAC
GAGCTCACGT CGACCGTGCG CCGCAACGCC GAGCATGCGC AGCATGCGTG CGCGGTGGCC
GCCGGCGCAT CGACGAAGGC GGCGCGCGGC GGCGACGTGA TGCGTCAGGT CGTCGATACG
ATGCGCGGCA TCGCGGACAG CTCGGGCAAG GTCGCCGAGA TCATCGCGGT GATCGACGGC
ATCGCGTTCC AGACCAACAT CCTCGCGTTG AATGCGGCCG TCGAGGCCGC GCGCGCGGGC
GAACAGGGGC GCGGCTTCGC GGTGGTCGCG GGCGAGGTGC GCACGCTCGC GCAGCGCAGC
GCGACGGCGG CGCGCGAAAT CAAGACGCTG ATCGAGCAGT CGACCGAGCG CGTCGGCGCG
GGCTCCGCGC TCGTCGACGA TGCGGGGCGG ATCATCGGCG AGATCGTCGA TTCGGTGCGG
CAGGTGACGG GCATCGTCAG CGAGATCGCA GCGGCATCGA ACGAGCAGAG CGTCGGCATC
GAGCAGGTCA ATCGCGCGGT CGCGCAAATG GACAACGTCA CGCAGCAGAA CGCGGCGCTC
GTCGAGGAAG CGTCCGCGGC GGCGCATGCG CTCGCCGAAC AGGCGCACGC GCTGCATGGC
GCGGTTGCGG TGTTCTCGCT GCATGGCGAG CGAGGGGGCG AGCGCGGCTG TGCGCGTGCC
GGGCAGCCGG CCGTCGAAGC GGCGCACGAT TCGCCGCGCA CGCCGCTGGC GGTCGTCGCG
CCGGCCTGA
 
Protein sequence
MKLRRKIPLA FATALVLTSA SAFYGIHALN RSLDTYGTTV RQNVANERMV SATLVAFKLQ 
VQEWKDTLLR GKDPAKLDKY WRAFQQREQT VDALAAELKA KLPDGESRGL IEQFASAHAE
MGQGYRKGFE AFRASGFDPS AGDQAVAGVD RAPAVLLEKA ARDIAADSAR VSADAASDAA
HATAISIAAT LALFALSLAG GVWFGGTVTR PLERALACVR RVATGDLSTP IDARGRDEIA
ELLAALKDMQ ASLSHVVRDV RHNADGVATA SAQIASGNLD LSSRTEEQAA SLEETAASMD
ELTSTVRRNA EHAQHACAVA AGASTKAARG GDVMRQVVDT MRGIADSSGK VAEIIAVIDG
IAFQTNILAL NAAVEAARAG EQGRGFAVVA GEVRTLAQRS ATAAREIKTL IEQSTERVGA
GSALVDDAGR IIGEIVDSVR QVTGIVSEIA AASNEQSVGI EQVNRAVAQM DNVTQQNAAL
VEEASAAAHA LAEQAHALHG AVAVFSLHGE RGGERGCARA GQPAVEAAHD SPRTPLAVVA
PA