Gene BURPS668_A2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2687 
Symbol 
ID4887976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2574360 
End bp2576000 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content69% 
IMG OID640132623 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001063679 
Protein GI126443034 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGCAAAT CCGCCGCCGG CAGCCGGCTC ACGCTCGGCA ACCGGATCCT GTTCAGCTTC 
GGCGTGCTGT TCGTGCTGAT GCTGTTCATG GCGGCGCTGT CCTACCAGCG CCTGCGCGCG
ATCAACGACG AGGCGATCAG CATCGAGCGC GATTCGCTGC CGGGCGTCTA TCTCGCGTCG
TCGCTGCGCG CGTCGGCGAA CGAGTCGTAC ATCGTGCTGC AGCGCGCGGT GTTCGTCGAT
GCCGAAGCCG AAGCGGTGCA GCGCGATCTC GCGAAGGTGC CGGGCCTGCT CGAGGAGTTC
GACAAGCTGT CGTCCGCGTA CCAGGCGTCG ACGTTCCGCA GCGACGATCA GGAGCGCTTC
AACGCGTTTC GCGCGGCGTA CGAGCGCTAC CTGCCGCTGC TGAACGACGC CGTGCAGAAG
GCGCGCGGCG CGAGGCCCGA CGCGCTCGCT GCCTACGCGA GGGTGACGCC TGCGTGGGAA
GAGGTGATTC GCCATGCGAA CGTGCTCGTG CAGGAGAACC GGCGCTTCGC CGATCAGTCG
GCCACGCTGA TCCGCGAATC GGTGCACGGC ACCGAGATCA CGCTCGCGGT GGCGCTCGGC
GTCGTGCTCG TGGTCGCGCT GATGCTCGGC TATCTGCTGC ATCGCGCGGT GACCGTGCCG
ATGGCGCGGC TCATCGACGT GCACGACGTG ATGCGCACCG GCAACCTGAC GCAGCGCCTG
AACCTCGGCC GCAGCGACGA GTTCGGCACG CTCGAGAGCG GCTTCAACCG GATGGCCGAC
GAGCTGACCG CGCTCGTCGC GCAGGCGCAG CAGTCGTCGC TGCAGGTGAC GACGTCGGTG
GCCGAGATCG CGGCGACCTC GCGCGAGCAG CAGGCGACCG CGAACGAAAC GGCGGCGACG
ACGACCGAGA TCGGCGCGAC CTCGCGCGAG ATCTTCGCGA CCTCGCGCGA CCTGCTGCGC
ACGATGAACG AAGTGTCGGC GGTGGCCGAG CAGTCGGCGA CGCTCGCGGG CGTCGGCCAG
AGCGGCCTCA CGCGGATGGG CGAGACGATG CGCAGCGTGA TGGACGCGGC GGGCTCGGTG
AACGCGAAGC TCGCGATCCT CAACGAGAAG GCGATCAACA TCAACCAGGT CGTCGCGACG
ATCACGAAGG TCGCCGACCA GACCAACCTG CTGTCGCTGA ACGCGGCGAT CGAGGCGGAG
AAGGCGGGCG AGTACGGCCG CGGCTTCGCG GTCGTCGCGA CCGAGATCCG CCGCCTCGCC
GATCAGACGG CCGTCGCGAC GTACGACATC GAGCAGACGG TGAAGGAGAT CCAGTCGGCG
GTGTCGGCGG GCGTGATGGG CATGGACAAG TTCTCGGAGG AAGTGCGCCG CGGGATGCGC
GACGTGCAGC AGGTGGGCGG ACAGTTGTCG CAGATCATCG CCGAGGTGCA GACGCTCGCG
CCGCGCTTCC AGATGGTCAA CGAAGGGATG CAGACGCAGG CGAGCGGCGC CGAGCAGATC
ACGCAGGCGC TCGCGCAGTT GTCCGAGGCC GCGCAGCAGA CGGCGGAATC GCTGCGGCAG
TCGTCGCAGG CGATCGACGA TCTGACGCTC GTCGCGAACC AGCTGCGCAC CGGCGTGTCG
CGCTTCAAGG TCGACGCGTG A
 
Protein sequence
MRKSAAGSRL TLGNRILFSF GVLFVLMLFM AALSYQRLRA INDEAISIER DSLPGVYLAS 
SLRASANESY IVLQRAVFVD AEAEAVQRDL AKVPGLLEEF DKLSSAYQAS TFRSDDQERF
NAFRAAYERY LPLLNDAVQK ARGARPDALA AYARVTPAWE EVIRHANVLV QENRRFADQS
ATLIRESVHG TEITLAVALG VVLVVALMLG YLLHRAVTVP MARLIDVHDV MRTGNLTQRL
NLGRSDEFGT LESGFNRMAD ELTALVAQAQ QSSLQVTTSV AEIAATSREQ QATANETAAT
TTEIGATSRE IFATSRDLLR TMNEVSAVAE QSATLAGVGQ SGLTRMGETM RSVMDAAGSV
NAKLAILNEK AININQVVAT ITKVADQTNL LSLNAAIEAE KAGEYGRGFA VVATEIRRLA
DQTAVATYDI EQTVKEIQSA VSAGVMGMDK FSEEVRRGMR DVQQVGGQLS QIIAEVQTLA
PRFQMVNEGM QTQASGAEQI TQALAQLSEA AQQTAESLRQ SSQAIDDLTL VANQLRTGVS
RFKVDA