Gene Bcep18194_A5099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A5099 
Symbol 
ID3750307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp2138950 
End bp2141274 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content68% 
IMG OID637763395 
ProductRNA binding S1 
Protein accessionYP_369337 
Protein GI78066568 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.83854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAA CCGTAGCACT CAAGATCGTA CAGCGCATCG CCACCGAACT GGCCGTCCAG 
CCGCGCCAGG TCGCGGCGGC AGTGCAACTC CTCGACGAAG GCTCGACCGT TCCGTTCATT
GCCCGGTACC GCAAGGAAGT GACCGGCAAC CTGGACGACA CGCAGTTGCG CCAGCTCGAG
GAACGCCTCC TGTATCTGCG CGAACTCGAG GATCGCCGCG CGACGATCCT GTCGAGCATC
GACGAACAGG GCAAGCTGAC CGACGAACTG CGCGCCGCGA TCGACGCGGC CGACAGCAAG
CAGGTGCTCG AAGACCTTTA TCTGCCGTAC AAGCCGAAGC GCCGCACGCG TGCGCAGATC
GCCCGCGAAG CCGGCCTCGA GCCGCTCGCC CAGGCGCTCC TCGCGAACCC GCTGCTCGAC
CCGCAGGCCG AGGCCGCCGC GTACGTCGAC GCCGACAAGG GCGTCGCCGA CGTGAAGGCC
GCGCTCGACG GTGCGCGCGA CATCCTGTCC GAACAGTTCG GCGAAACGGC CGAGCTGCTC
GGCAAGCTGC GCGACTACCT GCACAACCAG GGCGTCGTGT CGTCGGCCGT CGTCGAGGGC
AAGGAAAACG AGGAAGGCGA GAAATTCCGC GACTATTACG ACTACGCGGA AACGATCAAG
ACCGTGCCGT CGCACCGCGC CCTCGCGCTG TTCCGCGGCC GCAACGCGGG CGTGCTGACG
GTCAAGCTCG GCCTCGGCGA AGAGCTCGAC GCGCAGGTGC CGCATCCCGG CGAGGCGATG
ATCGCGCGCC ATTTCGGTAT CGCGAACCAG AACCGCCCGG CCGACAAATG GCTGTCCGAC
GTATGCCGCT GGTGCTGGCG CGTGAAGGTG CAGCCGCACA TCGAGAACGA GCTGCTCACG
CAACTGCGCG AAACGGCCGA AACGGAAGCG ATCCGCGTGT TCGCGCGCAA CCTGAACGAC
CTGCTGCTGG CCGCGCCGGC CGGCCCGAAG GCCGTGATCG GTCTCGACCC CGGCCTGCGC
ACGGGCGTGA AGGTCGCAGT CGTCGACCGC ACCGGCAAGG TGCTCGCGAC CGACACGATC
TACCCGCACG AGCCGCGCCG CGACTGGGAC GGCTCGATCG CGAAACTCGC ACGCATCGCC
GCGCAGACGC AGGCCGAGCT GATCAGCATC GGCAACGGCA CCGCGTCGCG TGAAACCGAC
AAGCTCGCGA GCGAACTGAT CGCGAAGCAC CCGGAGCTGC GCCTGCAGAA GATCGTCGTG
TCGGAAGCCG GCGCGTCGGT CTATTCGGCA TCCGAGCTGG CCGCGAAGGA ATTCCCGGAA
CTCGACGTGT CGCTGCGCGG CGCCGTGTCG ATTGCACGCC GCCTGCAGGA TCCGCTCGCC
GAACTCGTGA AGATCGAGCC GAAGGCCATC GGCGTCGGCC AGTACCAGCA CGACGTGAAC
CAGCGCGAAC TCGCCCGCTC GCTCGACGCG GTCGTCGAGG ACTGCGTGAA CGCGGTCGGT
GTCGACGCGA ACACCGCGTC GGCCCCGCTG CTCGCCCGTG TATCGGGCCT GAACGCCACG
CTCGCGCGCA ATATCGTCGA CTACCGCGAT GCGAACGGCC CGTTCCCTTC GCGCGAGCAC
CTGCGCAAGG TGCCGCGCCT CGGCGACAAG ACCTTCGAAC AGGCCGCCGG CTTCCTGCGC
ATCAACGGTG GCGAGAATCC GCTCGACCGC TCGTCGGTGC ACCCGGAGGC GTATCCGGTC
GTCGAGCGGA TGCTCGCAAA GATCAGCAAG CGCATCGACG ACGTGCTCGG CAACCGCGAA
GCGCTGTCGG GCCTTTCCCC GACGGAATTT GTTGACGAAC GTTTCGGCCT GCCGACGGTA
CGCGACATCC TGTCCGAACT GGAGAAGCCG GGCCGCGATC CGCGCCCCGA ATTCAAGACC
GCGACGTTCC GCGAAGGCGT CGAGAAGGTG TCGGATCTCG TGCCGGGCAT GACGCTCGAA
GGCGTCGTGA CGAACGTCGC CGCGTTCGGC GCGTTCGTGG ACATCGGCGT CCACCAGGAC
GGCCTCGTCC ACGTGTCCGC GATGTCGACG AAATTCATCA AGGATCCGCA CGAAGTCGTG
AAGGCCGGCC AGGTCGTCAA GGTGAAGGTG ATCGACGTCG ACGTGAAGCG CCAGCGCATT
GCACTGACGA TGCGCCTCGA CGACGACGCG GCAGCGCCCG GCATGTCGTC GCGCGGCGGC
CAGGATCGCG GCAACGCGGC GCGCGGCGCG GCCCGCCCGC AGCGTTCGCG CGAGCCGGAA
CCGGCCGGCG CAATGGCCGC GGCGTTCGCC AAGCTGAAGC GCTAA
 
Protein sequence
MTETVALKIV QRIATELAVQ PRQVAAAVQL LDEGSTVPFI ARYRKEVTGN LDDTQLRQLE 
ERLLYLRELE DRRATILSSI DEQGKLTDEL RAAIDAADSK QVLEDLYLPY KPKRRTRAQI
AREAGLEPLA QALLANPLLD PQAEAAAYVD ADKGVADVKA ALDGARDILS EQFGETAELL
GKLRDYLHNQ GVVSSAVVEG KENEEGEKFR DYYDYAETIK TVPSHRALAL FRGRNAGVLT
VKLGLGEELD AQVPHPGEAM IARHFGIANQ NRPADKWLSD VCRWCWRVKV QPHIENELLT
QLRETAETEA IRVFARNLND LLLAAPAGPK AVIGLDPGLR TGVKVAVVDR TGKVLATDTI
YPHEPRRDWD GSIAKLARIA AQTQAELISI GNGTASRETD KLASELIAKH PELRLQKIVV
SEAGASVYSA SELAAKEFPE LDVSLRGAVS IARRLQDPLA ELVKIEPKAI GVGQYQHDVN
QRELARSLDA VVEDCVNAVG VDANTASAPL LARVSGLNAT LARNIVDYRD ANGPFPSREH
LRKVPRLGDK TFEQAAGFLR INGGENPLDR SSVHPEAYPV VERMLAKISK RIDDVLGNRE
ALSGLSPTEF VDERFGLPTV RDILSELEKP GRDPRPEFKT ATFREGVEKV SDLVPGMTLE
GVVTNVAAFG AFVDIGVHQD GLVHVSAMST KFIKDPHEVV KAGQVVKVKV IDVDVKRQRI
ALTMRLDDDA AAPGMSSRGG QDRGNAARGA ARPQRSREPE PAGAMAAAFA KLKR