Gene BURPS668_A1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1080 
Symbol 
ID4887746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1036940 
End bp1039003 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content73% 
IMG OID640131020 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001062079 
Protein GI126443794 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.469209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGTGC CATTCCTGCG GCGCGCGAGC GTCGGCGCAC GCCTCGCGGC GCTGTCGAGC 
GCGCTCTTCG CCGTTCTCTT CGCCGCGTTC GTCTGGGCGC TGACCCACGC CGCCGGCAAC
CAGGTCGCCG ATCAGGTCCA CGCGCGCATC GACGAGAAGG ATCGCTCGAT CGCCGCGATG
ATCGCGCTCT TCGACGAAGC GCTCACCGCC GAGGCGAGCC GCGCGATGAC GCTCTTCGCG
AGCTTCCTGC CGGCCGGCTA CGCGCTCGAC GCCGCGCGCA CGATCGACGT CGCGGGCGTC
GCGACGCCCG CGCTCACCGC GGGCGGCCAG ACGCTGAACC TCGATTTCTC GATTCCCGAC
CAGTTCCTGC AAAAGAGCGG CGCGATCGCG ACGATCTTCG CGCGCCGCGG CGACGATTTC
GTCCGCGTGA CGACCTCGCT GAAGAAGCAG GACGGCAGCC GCGCGATCGG CACGCTGCTC
GACCGCAAGG GCCCCGCGTA CGCGCCGCTC GTCGCCGGCC GGACCTACAC CGGTCTCGCG
ACGCTGTTCG GCAAGCGTTA CATCACCCAA TACAAGCCGA TCGCCGACGC GAGCGGCGCG
ATCGTCGGCG CGCTGTTCGT CGGCATCGAC ATCGGCGCGG AAATGCGGCT CGTCGAAAAC
GGCATCCGCC AGTTGAAGAT CGGCGAGCAC GGCTACTACT TCGTGCTCGA CGCATCGGAC
GGCCCCGCGC GCGGCACCCT GCTCGTCCAT CCGGCGCGCG CGGGCCAGCG CGCCGACGAC
GCGGCCGCGC CCTACGCGCA AATGCTCGCC GCGAGGGAAG GCCAGTTGTC CTACACGTCG
ACCGACGCCG CCGCCGGCGA CGACGGCCCG CGCGCGAAGT TCGTGTCGTT CGTGACGGTT
CCGCAGTGGC AATGGCTCGT CGGCGGCATC GCGATCGACG ACGAAGTGAT GGCCGACATG
CGCGCGACCC GCAACCGCTT CGCGGCGATC GGCTGCGCGT TCGTGCTCGC GTTCGCGGCG
CTGTTCGTCG CGGTTGTCAA GCGCGTCGTC AGCCGGCCGC TCGATGCGGC GGCGCACGCG
TCCGAGCGCT TCGCCGCGGG CGACCTCAGC GTGCGGATCG GCGCGCACGA CAAGCACGAC
AAGCACGACA AGCACGACAA GCACGGCAAG CACGATGCAC GCGACGGCGC GGCGCGGCCG
CCCGCGAGCG GCCGCAGCGA CGAGATCGGC CGGCTCGTGC GCGCGGTCGA CGGCATCGGC
GACGGCCTCG CGCGCATCGT CGCGCAGGTG CGGCGCGGCG CGGCCGACAT CGCGCACGGC
ACCGTGACGA TCGCGGCCGG CAGCAGCGAC ATGGCCGCGC GGATCGCGAC GCAGGCAAGC
AGCGTCGAGC AGACCGCGGC GAGCATGGAG CAGATCACGG CGGCCGTTCA GCAGAACGCC
GATCACGCGG CGCAGGCGAG CGCGCTCGCG ACCGGCGCGT CGAGCGCGGC GACGACGGGC
GGCGCGGCCG TGCAGCGCGT CGTCGCGACG ATGGGCGACA TCCAGGGCGT CGCGCGCAGG
ATCGCCGAGA TCACCGGCGT GATCGAAGGC ATCGCATTCC AGACCAACAT CCTCGCGCTG
AACGCGGCCG TCGAGGCCGC GCGCGCGGGC GAGCACGGCC GCGGCTTCGC GGTCGTCGCC
TCCGAGGTGC GCGCGCTCGC GCAGCGCAGC GCGGCGGCGG CCAAGGAGAT CGACGCGCTC
GTCGGCGAAT CGGCGACGAC GGCCGAGCAC GGCTTCCGGA TCGCCGAGGA CGCGCGCGCG
GCGATGCAGG ACATCGTCGC GCGCGTCGAT CAGGTGCGCG CGATCATCGC CGAGATCAGC
GCGGCGTCGC GCGAACAGTC GAGCGGCATC GAGCAGGTGA ATCTGGCCGT CACGCAGATC
GGCGCGGCGA CGCAGCAGAA CGCGACGCTG ATCGCCGACG CCGAGCGCGC GGCTGCCGCG
CTGCGCGACG AGGCCGCGCA GCTCGCGCAC GCGGTCAGCG TGTTCAGGCT CGCGGCGGAC
GAACCCACGC TCGACGCGCG TTGA
 
Protein sequence
MHVPFLRRAS VGARLAALSS ALFAVLFAAF VWALTHAAGN QVADQVHARI DEKDRSIAAM 
IALFDEALTA EASRAMTLFA SFLPAGYALD AARTIDVAGV ATPALTAGGQ TLNLDFSIPD
QFLQKSGAIA TIFARRGDDF VRVTTSLKKQ DGSRAIGTLL DRKGPAYAPL VAGRTYTGLA
TLFGKRYITQ YKPIADASGA IVGALFVGID IGAEMRLVEN GIRQLKIGEH GYYFVLDASD
GPARGTLLVH PARAGQRADD AAAPYAQMLA AREGQLSYTS TDAAAGDDGP RAKFVSFVTV
PQWQWLVGGI AIDDEVMADM RATRNRFAAI GCAFVLAFAA LFVAVVKRVV SRPLDAAAHA
SERFAAGDLS VRIGAHDKHD KHDKHDKHGK HDARDGAARP PASGRSDEIG RLVRAVDGIG
DGLARIVAQV RRGAADIAHG TVTIAAGSSD MAARIATQAS SVEQTAASME QITAAVQQNA
DHAAQASALA TGASSAATTG GAAVQRVVAT MGDIQGVARR IAEITGVIEG IAFQTNILAL
NAAVEAARAG EHGRGFAVVA SEVRALAQRS AAAAKEIDAL VGESATTAEH GFRIAEDARA
AMQDIVARVD QVRAIIAEIS AASREQSSGI EQVNLAVTQI GAATQQNATL IADAERAAAA
LRDEAAQLAH AVSVFRLAAD EPTLDAR