Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A5099 |
Symbol | |
ID | 3750307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | + |
Start bp | 2138950 |
End bp | 2141274 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637763395 |
Product | RNA binding S1 |
Protein accession | YP_369337 |
Protein GI | 78066568 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.83854 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAAA CCGTAGCACT CAAGATCGTA CAGCGCATCG CCACCGAACT GGCCGTCCAG CCGCGCCAGG TCGCGGCGGC AGTGCAACTC CTCGACGAAG GCTCGACCGT TCCGTTCATT GCCCGGTACC GCAAGGAAGT GACCGGCAAC CTGGACGACA CGCAGTTGCG CCAGCTCGAG GAACGCCTCC TGTATCTGCG CGAACTCGAG GATCGCCGCG CGACGATCCT GTCGAGCATC GACGAACAGG GCAAGCTGAC CGACGAACTG CGCGCCGCGA TCGACGCGGC CGACAGCAAG CAGGTGCTCG AAGACCTTTA TCTGCCGTAC AAGCCGAAGC GCCGCACGCG TGCGCAGATC GCCCGCGAAG CCGGCCTCGA GCCGCTCGCC CAGGCGCTCC TCGCGAACCC GCTGCTCGAC CCGCAGGCCG AGGCCGCCGC GTACGTCGAC GCCGACAAGG GCGTCGCCGA CGTGAAGGCC GCGCTCGACG GTGCGCGCGA CATCCTGTCC GAACAGTTCG GCGAAACGGC CGAGCTGCTC GGCAAGCTGC GCGACTACCT GCACAACCAG GGCGTCGTGT CGTCGGCCGT CGTCGAGGGC AAGGAAAACG AGGAAGGCGA GAAATTCCGC GACTATTACG ACTACGCGGA AACGATCAAG ACCGTGCCGT CGCACCGCGC CCTCGCGCTG TTCCGCGGCC GCAACGCGGG CGTGCTGACG GTCAAGCTCG GCCTCGGCGA AGAGCTCGAC GCGCAGGTGC CGCATCCCGG CGAGGCGATG ATCGCGCGCC ATTTCGGTAT CGCGAACCAG AACCGCCCGG CCGACAAATG GCTGTCCGAC GTATGCCGCT GGTGCTGGCG CGTGAAGGTG CAGCCGCACA TCGAGAACGA GCTGCTCACG CAACTGCGCG AAACGGCCGA AACGGAAGCG ATCCGCGTGT TCGCGCGCAA CCTGAACGAC CTGCTGCTGG CCGCGCCGGC CGGCCCGAAG GCCGTGATCG GTCTCGACCC CGGCCTGCGC ACGGGCGTGA AGGTCGCAGT CGTCGACCGC ACCGGCAAGG TGCTCGCGAC CGACACGATC TACCCGCACG AGCCGCGCCG CGACTGGGAC GGCTCGATCG CGAAACTCGC ACGCATCGCC GCGCAGACGC AGGCCGAGCT GATCAGCATC GGCAACGGCA CCGCGTCGCG TGAAACCGAC AAGCTCGCGA GCGAACTGAT CGCGAAGCAC CCGGAGCTGC GCCTGCAGAA GATCGTCGTG TCGGAAGCCG GCGCGTCGGT CTATTCGGCA TCCGAGCTGG CCGCGAAGGA ATTCCCGGAA CTCGACGTGT CGCTGCGCGG CGCCGTGTCG ATTGCACGCC GCCTGCAGGA TCCGCTCGCC GAACTCGTGA AGATCGAGCC GAAGGCCATC GGCGTCGGCC AGTACCAGCA CGACGTGAAC CAGCGCGAAC TCGCCCGCTC GCTCGACGCG GTCGTCGAGG ACTGCGTGAA CGCGGTCGGT GTCGACGCGA ACACCGCGTC GGCCCCGCTG CTCGCCCGTG TATCGGGCCT GAACGCCACG CTCGCGCGCA ATATCGTCGA CTACCGCGAT GCGAACGGCC CGTTCCCTTC GCGCGAGCAC CTGCGCAAGG TGCCGCGCCT CGGCGACAAG ACCTTCGAAC AGGCCGCCGG CTTCCTGCGC ATCAACGGTG GCGAGAATCC GCTCGACCGC TCGTCGGTGC ACCCGGAGGC GTATCCGGTC GTCGAGCGGA TGCTCGCAAA GATCAGCAAG CGCATCGACG ACGTGCTCGG CAACCGCGAA GCGCTGTCGG GCCTTTCCCC GACGGAATTT GTTGACGAAC GTTTCGGCCT GCCGACGGTA CGCGACATCC TGTCCGAACT GGAGAAGCCG GGCCGCGATC CGCGCCCCGA ATTCAAGACC GCGACGTTCC GCGAAGGCGT CGAGAAGGTG TCGGATCTCG TGCCGGGCAT GACGCTCGAA GGCGTCGTGA CGAACGTCGC CGCGTTCGGC GCGTTCGTGG ACATCGGCGT CCACCAGGAC GGCCTCGTCC ACGTGTCCGC GATGTCGACG AAATTCATCA AGGATCCGCA CGAAGTCGTG AAGGCCGGCC AGGTCGTCAA GGTGAAGGTG ATCGACGTCG ACGTGAAGCG CCAGCGCATT GCACTGACGA TGCGCCTCGA CGACGACGCG GCAGCGCCCG GCATGTCGTC GCGCGGCGGC CAGGATCGCG GCAACGCGGC GCGCGGCGCG GCCCGCCCGC AGCGTTCGCG CGAGCCGGAA CCGGCCGGCG CAATGGCCGC GGCGTTCGCC AAGCTGAAGC GCTAA
|
Protein sequence | MTETVALKIV QRIATELAVQ PRQVAAAVQL LDEGSTVPFI ARYRKEVTGN LDDTQLRQLE ERLLYLRELE DRRATILSSI DEQGKLTDEL RAAIDAADSK QVLEDLYLPY KPKRRTRAQI AREAGLEPLA QALLANPLLD PQAEAAAYVD ADKGVADVKA ALDGARDILS EQFGETAELL GKLRDYLHNQ GVVSSAVVEG KENEEGEKFR DYYDYAETIK TVPSHRALAL FRGRNAGVLT VKLGLGEELD AQVPHPGEAM IARHFGIANQ NRPADKWLSD VCRWCWRVKV QPHIENELLT QLRETAETEA IRVFARNLND LLLAAPAGPK AVIGLDPGLR TGVKVAVVDR TGKVLATDTI YPHEPRRDWD GSIAKLARIA AQTQAELISI GNGTASRETD KLASELIAKH PELRLQKIVV SEAGASVYSA SELAAKEFPE LDVSLRGAVS IARRLQDPLA ELVKIEPKAI GVGQYQHDVN QRELARSLDA VVEDCVNAVG VDANTASAPL LARVSGLNAT LARNIVDYRD ANGPFPSREH LRKVPRLGDK TFEQAAGFLR INGGENPLDR SSVHPEAYPV VERMLAKISK RIDDVLGNRE ALSGLSPTEF VDERFGLPTV RDILSELEKP GRDPRPEFKT ATFREGVEKV SDLVPGMTLE GVVTNVAAFG AFVDIGVHQD GLVHVSAMST KFIKDPHEVV KAGQVVKVKV IDVDVKRQRI ALTMRLDDDA AAPGMSSRGG QDRGNAARGA ARPQRSREPE PAGAMAAAFA KLKR
|
| |