Gene BURPS1106A_A2620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2620 
Symbol 
ID4905869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2561626 
End bp2563314 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content69% 
IMG OID640145723 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001076650 
Protein GI126457959 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTCG CACACATGAA GGTCGCCACC CGCCTCGGGA TCGGCTTCGC GCTGGTCGCC 
TGTCTGCTTG CGGTGATGGT CGCGTTCGCA CTGGACAGGA TGGCGAAGTT CGAAGGCTGG
ATGGTCGAGA TCACCGAAGT AAACAGCGTG GAGGCGAAGC TGGCCGCGAA GCTCGAACTG
AGCATTACCG AGCGCGCGCT CGCGCTGCGC AACCTGATCC TGCTCGATCG GCAGGATGAA
ATGCAGATCG AGCAGGATCG GATCGACGCG AAGGCGAAGC TCTACAGGCA ATCGCGAGAC
AGGCTCGCGA CGATGTTCGC GACGCTCGAC GGCACGCCGC AGGAGCGCGC GCTGCTCGAA
CAGATAGGCC AGCAGGGCGA CGCCGCCGAC GGCTTCATCG CACGTGCGAG GACGATGATT
CTCGCGGGGC AGAAGGACGA TGCGTACAAG CTGCTGCGCT TCGAATTCCG CCCGGTGCAG
GCGAAATGGT GGGCGCTCAC GCGCGAGCTG AAGGCGCTGG AGGAGAAGCA GAACGAGGAG
GCGACCCTGC ACGCGAAGGC GGCCTACGAA GAAGGCCGCA CGTGGATGCT CGTGCTCGGC
GCGCTCGCGC TGGTGTCGAG CGTGGTGTCC GCGTGGCTGA TCACGCGCGG CATCGTGCGC
CAGCTCGGCG GCGAGCCGTG CGACGCCGCT CACGCGGCGA ACATGATCGC CGCGGGCGAC
CTGAGCGTGG CGATCGGCGT GCGCGACGGT GACGAAGCGA GCCTCATGCA CGCGATGAAA
TCGATGCGCG ACAGCCTCGC GGAGATCGTG AGCCAGGTGC ACGTGAGCGC CGACACGATC
GCGACCGCAT CCGGGCAGAT CGCGAGCGGC AATCTCGACC TGTCCGCGCG CACCGAGCAA
CAGGCGGCGT CGCTCGAGGA AACCGCGGCG TCGATGGAGC AACTGACCGC GACCGTTCAG
CAGAACACCG ACAACTCGCG CCAGGCGGAC ACGCTCGCGG CGTCCGCGTC GCACGTCGCG
GAGAAGGGCG GCGCGGCGGT GACGCAGGTC GTCGATGCGA TGGGCTCGAT TCATGCGACG
GCGCAGAAGA TCGTCGAGAT CATCGGCGTG ATCGACGGCA TCGCGTTCCA GACCAACATC
CTCGCGCTGA ACGCGGCCGT CGAGGCGGCG CGCGCGGGCG AGCAGGGCCG CGGCTTCGCC
GTTGTCGCAT CGGAGGTGCG CAGCCTCGCG CAGCGCTCGG CGACAGCCGC GCGCGAAATC
AAGGAGCTGA TCGGCGGCTC GGTCGTGCAG GTCGAGGCCG GCGACCGTCT CGCGAAGGAA
GCGGGCGCGA CGATGCACGA AGTCGTCGAG AGTATCCGGC GCGTGACGCT GATCATGGCC
GAGATCACGC GCGCGAGCGA GCAGCAGACG AGCGGCATCG TCGAGATCGA TCGCGCGATC
ACGCAGATGG ATCAGGTCAC GCAGCAGAAC GCCTCGCTCG TCGAGGAGGC GGCCGCCGCG
GCCGAATCGA TGCGCGAGCA GTCCGCCGCG CTCGTGCGCG CGGTGCGCGT GTTCAAGCTG
GATGCGTATC CGGCCGTAGC GTCGAGCCGC GCCGCACCGC CCGCGCGGCC CGCGCGCGTC
GCGAGCTTCG ACGCCGGGGC GAGCGTGGCC GCGCATGCCG CGCCGCGACT GGCGCGCATC
GCGCCCTGA
 
Protein sequence
MGFAHMKVAT RLGIGFALVA CLLAVMVAFA LDRMAKFEGW MVEITEVNSV EAKLAAKLEL 
SITERALALR NLILLDRQDE MQIEQDRIDA KAKLYRQSRD RLATMFATLD GTPQERALLE
QIGQQGDAAD GFIARARTMI LAGQKDDAYK LLRFEFRPVQ AKWWALTREL KALEEKQNEE
ATLHAKAAYE EGRTWMLVLG ALALVSSVVS AWLITRGIVR QLGGEPCDAA HAANMIAAGD
LSVAIGVRDG DEASLMHAMK SMRDSLAEIV SQVHVSADTI ATASGQIASG NLDLSARTEQ
QAASLEETAA SMEQLTATVQ QNTDNSRQAD TLAASASHVA EKGGAAVTQV VDAMGSIHAT
AQKIVEIIGV IDGIAFQTNI LALNAAVEAA RAGEQGRGFA VVASEVRSLA QRSATAAREI
KELIGGSVVQ VEAGDRLAKE AGATMHEVVE SIRRVTLIMA EITRASEQQT SGIVEIDRAI
TQMDQVTQQN ASLVEEAAAA AESMREQSAA LVRAVRVFKL DAYPAVASSR AAPPARPARV
ASFDAGASVA AHAAPRLARI AP