Gene BURPS668_2174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2174 
Symbol 
ID4881887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2166170 
End bp2168494 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content69% 
IMG OID640128102 
Productputative tex protein 
Protein accessionYP_001059209 
Protein GI126438941 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAA CCGTAGCACT CAAGATCGTA CAGCGCATCG CCGACGAACT CTCGGTCCAG 
CCGCGGCAGG TCGCCGCGGC GGTGCAACTC CTCGACGAAG GCTCCACCGT TCCGTTCATC
GCCCGCTACC GGAAGGAAGT CACGGGCAAT CTGGACGACA CGCAGTTGCG CCAGCTCGAA
GAGCGCCTGC TGTATCTGCG CGAGCTCGAG GAACGCCGCG CGACGATCAT CGCGAGCATC
GACGAGCAGG GCAAGCTGAC GGACGAACTG CGCGCGGCGA TCGACGCGGC CGACAGCAAG
CAGACGCTCG AGGATCTGTA CCTGCCGTAC AAGCCGAAGC GCCGCACGCG CGCGCAGATC
GCCCGCGAAG CCGGGCTCGA GCCGCTCGCG CAGGCGCTCC TCGCGAATCC GCTGCTCGAT
CCCCAGGCGG AAGCGGCCGC GTACGTGAAC ACGGATCGCG GCGTCGCCGA CGTGAAGGCG
GCGCTCGACG GCGCGCGCGA CATCCTGTCC GAGCAATTCG GCGAGACGGC CGAACTGCTC
GGCAAGCTGC GCGACTATCT GTTCGAGCGC GGCGTCGTGT CGTCGGCCGT CGTCGACGGC
AAGCAAGGCG AGGAAGGCGA GAAATTCCGC GACTACTACG ACTACTCGGA AACGATCAAG
ACCGTGCCGT CGCACCGCGC GCTCGCGCTG TTCCGCGGCC GCAACGCCGG CGTGCTGACC
GTGAAGCTCG GCCTCGGCGA AGAGCTCGAT GCGCAGGTGC CGCACCCGGG CGAGGCGATG
ATCGCGCGCC ATTTCGGGAT CGCGAACCAG AACCGGCCGG CCGACAAGTG GCTGTCCGAC
GTGTGCCGCT GGTGCTGGCG CGTGAAGGTG CAGCCGCACA TCGAAACCGA ATTGCTCACA
CAATTGCGCG AGACGGCCGA GCATGAGGCG ATCCGCGTGT TCGCGCGCAA CCTGAAGGAC
CTGCTGCTCG CCGCGCCCGC GGGCCCGAAG GCCGTGATCG GTCTCGACCC CGGCCTGCGC
ACGGGCGTGA AGGTCGCCGT CGTCGACCGC ACGGGCAAGC TGCTCGCGAC CGACACGATC
TATCCGCACG AGCCGCGCCG CGACTGGGAC GGCTCGCTCG CGAAGCTCGC GCGCCTCGCC
GCGCAGACGC AGGCCGAGCT CGTCAGCATC GGCAACGGCA CCGCGTCGCG CGAAACCGAC
AAGCTCGCGA GCGAGCTGAT CGCCAAGCAT CCCGAGCTCA AGCTGCAGAA GATCGTCGTG
TCGGAGGCGG GCGCGTCCGT CTACTCGGCG TCGGAGCTCG CCGCGAAGGA ATTTCCCGAG
CTCGACGTGT CGCTGCGCGG CGCGGTATCG ATCGCGCGCC GGCTGCAGGA TCCGCTCGCG
GAGCTCGTGA AGATCGAGCC GAAGGCGATC GGCGTCGGCC AGTATCAGCA CGACGTGAAC
CAGCGCGAGC TCGCGCGCTC GCTCGACGCG GTCGTCGAGG ATTGCGTGAA CGCGGTCGGC
GTCGACGCGA ACACCGCGTC TGCCGCCCTC CTCGCGCGCG TGTCGGGCCT GAACTCGACG
CTCGCGCGCA ACATCGTCGA CTATCGCGAC GCGAACGGCC CGTTCCCCTC GCGCGAGCAC
CTGCGCCGCG TGCCGCGCCT CGGCGACAAG ACGTTCGAGC AGGCGGCGGG CTTCCTGCGC
ATCAACGGCG GCGAGAATCC GCTCGACCGC TCGTCGGTGC ACCCGGAGGC ATACCCCGTC
GTCGAGCGGA TGCTCGCGAA GATCAGCAAG CGCATCGACG ACGTGCTAGG CAACCGCGAC
GCGCTCGCGG GCCTGTCGCC CGCCGAATTC GTCGATGAAC GTTTCGGTTT GCCGACCGTG
CGCGACATCC TGTCCGAGCT CGAGAAGCCC GGCCGCGATC CGCGCCCCGA ATTCAAGACC
GCGACATTCC GCGAAGGTGT CGAGAAAGTG TCGGATCTCG CGCCGGGGAT GGTGCTCGAA
GGCGTCGTGA CGAACGTGGC GGCATTCGGC GCGTTCGTCG ACATCGGCGT GCATCAGGAC
GGGCTCGTCC ACGTATCCGC GATGTCGACG AAATTCATCA AGGATCCTCA CGAAATCGTG
AAGGCCGGCC AGGTCGTCAA GGTGAAGGTG CTCGACGTCG ATGTGAAGCG CCAGCGGATT
TCGCTGACGA TGCGGCTCGA CGACGACGCG GCGCCCAGCG CGCCCGGCAA TCGCGGCGGC
GCCGAGCGCG GCGCAATGCG CGGCGGCGCC CGGGCGCAGC GCTCGCGCGA GCCGGAACCG
GCGGGCGCGA TGGCCGCCGC GTTCGCAAAG CTCAAGCAGC GTTGA
 
Protein sequence
MTETVALKIV QRIADELSVQ PRQVAAAVQL LDEGSTVPFI ARYRKEVTGN LDDTQLRQLE 
ERLLYLRELE ERRATIIASI DEQGKLTDEL RAAIDAADSK QTLEDLYLPY KPKRRTRAQI
AREAGLEPLA QALLANPLLD PQAEAAAYVN TDRGVADVKA ALDGARDILS EQFGETAELL
GKLRDYLFER GVVSSAVVDG KQGEEGEKFR DYYDYSETIK TVPSHRALAL FRGRNAGVLT
VKLGLGEELD AQVPHPGEAM IARHFGIANQ NRPADKWLSD VCRWCWRVKV QPHIETELLT
QLRETAEHEA IRVFARNLKD LLLAAPAGPK AVIGLDPGLR TGVKVAVVDR TGKLLATDTI
YPHEPRRDWD GSLAKLARLA AQTQAELVSI GNGTASRETD KLASELIAKH PELKLQKIVV
SEAGASVYSA SELAAKEFPE LDVSLRGAVS IARRLQDPLA ELVKIEPKAI GVGQYQHDVN
QRELARSLDA VVEDCVNAVG VDANTASAAL LARVSGLNST LARNIVDYRD ANGPFPSREH
LRRVPRLGDK TFEQAAGFLR INGGENPLDR SSVHPEAYPV VERMLAKISK RIDDVLGNRD
ALAGLSPAEF VDERFGLPTV RDILSELEKP GRDPRPEFKT ATFREGVEKV SDLAPGMVLE
GVVTNVAAFG AFVDIGVHQD GLVHVSAMST KFIKDPHEIV KAGQVVKVKV LDVDVKRQRI
SLTMRLDDDA APSAPGNRGG AERGAMRGGA RAQRSREPEP AGAMAAAFAK LKQR