Gene BURPS1106A_A1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1623 
Symbol 
ID4905813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1592561 
End bp1594114 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content68% 
IMG OID640144729 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001075657 
Protein GI126457543 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTCTC TTTCTCTGAA CCAAAAGCTC ACGTCGATGA TCGTCATCCT CTGGATCGGC 
CTGCTGCTGA TCGGCGCGAC CGGCGCATGG CAGAACCGCT CGTCGATGAT CGCGGACCGG
CGCGAGCAGC TCGCCTATCT GACCGCGCAG GCGTACAACG TGAGCGAGCA TTTCTACAAA
CTGTCGCAGC AGAGCGCGAT GCCCGAGGCC GACGCGAAGC GCGCCGCGCT CGAAGCGATC
GCCGCGATGC GCTACGGCAA GGACGGCTAT CTGTCGGTCA ACGATTCGAA GCCCGTCGTC
GTCATGCATC CGATCAAGCC GGAGCTGAAC GGCAAGGACG TGTCGGGCTT CACCGATCCG
GGCGGCAAGC ATCTGTTCGT CGAGATCGTC AAGGCGGGCA ACGCGGAAGG CGGCCTCGGC
TTCGTCGACT ACATGTGGCC GAAGCCGGGC GCCGACAAGC CGATCGACAA GACGAGCGCC
GTTCGCCATT TCGCGCCGTG GGACTGGTAC CTCGTGACCG GCATGTACAT GGACGACCTG
CAGGCCGCCG TGCTCGCGAG CACGGGCCGC TGGCTCGCGA TGACGGCCGT GCTCGGCGCC
GCCGCGACGA TCCTGATGGT GCTCGTGCTG CGCAGCGTGC GCAGCAGCCT CGGCGGCGAT
CTCGAAGCGG CCGTCGCGAC CGCGCAGCGC ATCGCGCAGG GCGACCTGAC CGCGCACGTC
GACATTCGCG GCGGCGATCG CAAGAGCCTC CTGCACGCGC TGCACACGAT GCAGACGGCG
CTCATCGACA TGGTCTCGCG CGTGCGACTC GGCACCGAGA ACATCAACGT CGGCGCATCG
GAGATCGCGG CCGGCAACAC GGATCTGTCG CAGCGCACCG AGCAGCAGGC GGCCGCGCTC
GTCGAGACCG CGTCGAGCAT GGACGAGATG ACGACGAACG TGAAGCAGAA CGCGGACAGC
GCGTCGCAGG CGGCCGAGCT CGCCGACCAG GCGGCGCAGG TCGCCACGCG CGGCAGCTCG
GTCGTCGACG ACGTCGTGCG CACGATGGAC GAGATCACCG ACGCGTCGCG CAAGATCGGC
GACATCATCG GCGTGATCGA CGGCATCGCG TTCCAGACCA ACATCCTTGC GCTCAACGCG
GCCGTCGAGG CGGCGCGCGC GGGCGAGCAG GGCCGCGGCT TCGCGGTCGT CGCGGGCGAG
GTGCGCACGC TCGCGCAGCG CTCGGCCACG GCCGCGAAGG AAATCAAGTC GCTGATCGAG
ACGTCGAACA CGACGGTCGA GCACGGCGCG ACGCTCGTCA CGCGCGCGGG CACGACGATG
ACGGAAATCG TGCAATCGGT GAGGCGCGTG AACGAGATCC TCGAGGAAAT CAGCCATGCG
TCGCGCGAAC AGAGCGCGGG CATCGACCAG GTGAATCGCG CGGTCGGCGA GATGGACCAG
GTCACGCAGC AGAACGCCGC GCTCGTCGAA CAGGCGGCCG CCGCCGCGCA TTCGCTGAAG
GATCAGGCGG ATGCGCTGCG CTCGGCGGTC GCGCAGTTCG CGCTGCCGGG TTGA
 
Protein sequence
MRSLSLNQKL TSMIVILWIG LLLIGATGAW QNRSSMIADR REQLAYLTAQ AYNVSEHFYK 
LSQQSAMPEA DAKRAALEAI AAMRYGKDGY LSVNDSKPVV VMHPIKPELN GKDVSGFTDP
GGKHLFVEIV KAGNAEGGLG FVDYMWPKPG ADKPIDKTSA VRHFAPWDWY LVTGMYMDDL
QAAVLASTGR WLAMTAVLGA AATILMVLVL RSVRSSLGGD LEAAVATAQR IAQGDLTAHV
DIRGGDRKSL LHALHTMQTA LIDMVSRVRL GTENINVGAS EIAAGNTDLS QRTEQQAAAL
VETASSMDEM TTNVKQNADS ASQAAELADQ AAQVATRGSS VVDDVVRTMD EITDASRKIG
DIIGVIDGIA FQTNILALNA AVEAARAGEQ GRGFAVVAGE VRTLAQRSAT AAKEIKSLIE
TSNTTVEHGA TLVTRAGTTM TEIVQSVRRV NEILEEISHA SREQSAGIDQ VNRAVGEMDQ
VTQQNAALVE QAAAAAHSLK DQADALRSAV AQFALPG