Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2212 |
Symbol | |
ID | 4902854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 2199723 |
End bp | 2202047 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640135440 |
Product | putative tex protein |
Protein accession | YP_001066475 |
Protein GI | 126452035 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAA CCGTAGCACT CAAGATCGTA CAGCGCATCG CCGACGAACT CTCGGTCCAG CCGCGGCAGG TCGCCGCGGC GGTGCAACTC CTCGACGAAG GCTCCACCGT TCCGTTCATC GCCCGCTACC GGAAGGAAGT CACGGGCAAT CTGGACGACA CGCAGTTGCG CCAGCTCGAA GAGCGCCTGC TGTATCTGCG CGAGCTCGAG GAACGCCGCG CGACGATCAT CGCGAGCATT GACGAGCAGG GCAAGCTGAC GGACGAACTG CGCGCGGCGA TCGACGCGGC CGACAGCAAG CAGACGCTCG AGGATCTGTA CCTGCCGTAC AAGCCGAAGC GCCGCACGCG CGCGCAGATC GCCCGCGAAG CCGGGCTCGA GCCGCTCGCG CAGGCGCTCC TCGCGAATCC GCTGCTCGAT CCCCAGGCGG AAGCGGCCGC GTACGTGAAC ACGGATCGCG GCGTCGCCGA CGTGAAGGCG GCGCTCGACG GCGCGCGCGA CATCCTGTCC GAGCAATTCG GCGAGACGGC CGAACTGCTC GGCAAGCTGC GCGACTATCT GTTCGAGCGC GGCGTCGTGT CGTCGGCCGT CGTCGACGGC AAGCAAGGCG AGGAAGGCGA GAAATTCCGC GACTACTACG ACTACTCGGA AACGATCAAG ACCGTGCCGT CGCACCGCGC GCTCGCGCTG TTCCGCGGCC GCAACGCCGG CGTGCTGACC GTGAAGCTCG GCCTCGGCGA AGAGCTCGAT GCGCAGGTGC CGCACCCGGG CGAGGCGATG ATCGCGCGCC ATTTCGGGAT CGCGAACCAG AACCGGCCGG CCGACAAGTG GCTGTCCGAC GTGTGCCGCT GGTGCTGGCG CGTGAAGGTG CAGCCGCACA TCGAAACCGA ATTGCTCACA CAATTGCGCG AGACGGCCGA GCATGAGGCG ATCCGCGTGT TCGCGCGCAA CCTGAAGGAC CTGCTGCTCG CCGCGCCCGC GGGCCCGAAG GCCGTGATCG GTCTCGACCC CGGCCTGCGC ACGGGCGTGA AGGTCGCCGT CGTCGACCGC ACGGGCAAGC TGCTCGCGAC CGACACGATC TATCCGCACG AGCCGCGCCG CGACTGGGAC GGCTCGCTCG CGAAGCTCGC GCGCCTCGCC GCACAGACGC AGGCCGAGCT CGTCAGCATC GGCAACGGCA CCGCGTCGCG CGAAACCGAC AAGCTCGCGA GCGAGCTGAT CGCCAAGCAT CCCGAGCTCA AGCTGCAGAA GATCGTCGTG TCGGAGGCGG GCGCGTCCGT CTACTCGGCG TCGGAGCTCG CCGCGAAGGA ATTCCCCGAG CTCGACGTGT CGCTGCGCGG CGCGGTATCG ATCGCGCGCC GGCTGCAGGA TCCGCTCGCG GAGCTCGTGA AGATCGAACC GAAGGCGATC GGCGTCGGCC AGTATCAGCA CGACGTGAAC CAGCGCGAGC TCGCGCGCTC GCTCGACGCG GTCGTCGAGG ATTGCGTGAA CGCGGTCGGC GTCGACGCGA ACACCGCGTC TGCCGCCCTC CTCGCGCGCG TGTCGGGCCT GAACTCGACG CTCGCGCGCA ACATCGTCGA CTATCGCGAC GCGAACGGCC CGTTCCCGTC GCGCGAGCAC CTGCGCCGCG TGCCGCGCCT CGGCGACAAG ACGTTCGAGC AGGCGGCGGG CTTCCTGCGC ATCAACGGCG GCGAGAATCC GCTCGACCGC TCGTCGGTGC ACCCGGAGGC ATACCCCGTC GTCGAGCGGA TGCTCGCGAA GATCAGCAAG CGCATCGACG ACGTGCTAGG CAACCGCGAC GCGCTCGCTG GCCTGTCGCC CGCCGAATTC GTCGATGAAC GTTTCGGTTT GCCGACCGTG CGCGACATCC TGTCCGAGCT CGAGAAGCCC GGCCGCGATC CGCGCCCCGA ATTCAAGACC GCGACATTCC GCGAAGGTGT CGAGAAAGTG TCGGATCTCG CGCCGGGGAT GGTGCTCGAA GGCGTCGTGA CGAACGTGGC GGCATTCGGC GCATTCGTCG ACATCGGCGT GCATCAGGAC GGGCTCGTCC ACGTATCCGC GATGTCGACG AAATTCATCA AGGATCCTCA CGAAATCGTG AAGGCCGGCC AGGTCGTCAA GGTGAAGGTG CTCGACGTCG ATGTGAAGCG CCAGCGGATT TCGCTGACGA TGCGGCTCGA CGACGACGCG GCGCCCAGCG CGCCCGGCAA TCGCGGCGGC GCCGAGCGCG GCGCAATGCG CGGCGGCGCC CGGGCGCAGC GCTCGCGCGA GCCGGAACCG GCGGGCGCGA TGGCCGCCGC GTTCGCAAAG CTCAAGCAGC GTTGA
|
Protein sequence | MTETVALKIV QRIADELSVQ PRQVAAAVQL LDEGSTVPFI ARYRKEVTGN LDDTQLRQLE ERLLYLRELE ERRATIIASI DEQGKLTDEL RAAIDAADSK QTLEDLYLPY KPKRRTRAQI AREAGLEPLA QALLANPLLD PQAEAAAYVN TDRGVADVKA ALDGARDILS EQFGETAELL GKLRDYLFER GVVSSAVVDG KQGEEGEKFR DYYDYSETIK TVPSHRALAL FRGRNAGVLT VKLGLGEELD AQVPHPGEAM IARHFGIANQ NRPADKWLSD VCRWCWRVKV QPHIETELLT QLRETAEHEA IRVFARNLKD LLLAAPAGPK AVIGLDPGLR TGVKVAVVDR TGKLLATDTI YPHEPRRDWD GSLAKLARLA AQTQAELVSI GNGTASRETD KLASELIAKH PELKLQKIVV SEAGASVYSA SELAAKEFPE LDVSLRGAVS IARRLQDPLA ELVKIEPKAI GVGQYQHDVN QRELARSLDA VVEDCVNAVG VDANTASAAL LARVSGLNST LARNIVDYRD ANGPFPSREH LRRVPRLGDK TFEQAAGFLR INGGENPLDR SSVHPEAYPV VERMLAKISK RIDDVLGNRD ALAGLSPAEF VDERFGLPTV RDILSELEKP GRDPRPEFKT ATFREGVEKV SDLAPGMVLE GVVTNVAAFG AFVDIGVHQD GLVHVSAMST KFIKDPHEIV KAGQVVKVKV LDVDVKRQRI SLTMRLDDDA APSAPGNRGG AERGAMRGGA RAQRSREPEP AGAMAAAFAK LKQR
|
| |