Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3996 |
Symbol | |
ID | 8734454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4243410 |
End bp | 4246439 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 646504621 |
Product | transcriptional activator domain protein |
Protein accession | YP_003395788 |
Protein GI | 284045448 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.306526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCAGAA GGGTGGGGAG CGAGCACCCG CGGGTCGCGC TCATCCAGCG CAAGCTGGCG CCGCCGCCGC CGCCCGCGCC GCTCGTCGAG CGGCCGCGGT TGGAGCAGCT GCTGGAGGCG CTGTTCGAGC GCCATCGTGT CGTCGTCGTC TCGGCGACGG CCGGCGCGGG CAAGACGACG GCCGTGGCCG CGGCCGCGAG GAGGCTGACG TGGCCGCTGG CGTGGCTCTC GGTCGATCGC ACCGACGCTG CGCCCGGGCG CCTCGTCACC TACCTGGAGG CGGCGCTGGC GCGCACGCGA CCCGAGCTGG CCGGGCTCGC GAGCGAGGCG CTCGCCGCCG GCATCCCGCA CGCCGAGGCC GTCGGCCTGC TGATCGAGGC GGCTGGGGAC GCGCCACTGC TGATCGTGCT CGACGACCTC GAGCGGCTCG GCGACGCGCC GGAGGCGCTG GCCGTGATCG AGGCGCTGCT GCGCTACGCG CCGCCCGCGG TGCACGTCGC GCTGATCAGC CGCCGCGACG TGCCGAGCGT GCTGTCGGTC CCCTCCGGGC ACGTCAGCGC GACGATCAGC GAGGGCGATC TGGCGTTCAC CGTGACCGAG GCGGCCGAGG CGCTGGAGCT GCTGGGCGAG CGTGAGACGG ACGCCGCCGA GGTCGTGGCC GCGACGGGCG GCTGGGTCAC CGGCGTGCTG TTCGAGGCGT GGCGTTCGGC CGCGCACGTC GCCGGCCTCG GCGGTGAGGC CGACCCGCTC GACGGCTACC TCTCCTCCCA GCTGCTGGAG CAGTTGGACC CGGCGGCGCG CGACTTCCTC GTCGCGACAT CGGTGCTGGA GGAGGTCACG GTCGCGCGTG CGACCGCGCT CGGCCAGCCG GCTGCGGCAG AGCGGGTGCT CGCGCTCAGA GCCGCGCATC TGCCGGTCGT CTGGGGCGAG GGCGGCACGA GCATGCGCTG CCACCCGCGC TTCCGCGACT ACCTGCTGGC GCGCCTGGAG CGCCGCGGAC CGGACGCGGT GCGCGAGGTC CGCGTCGCGC ACGGACGGCT GCTCGCGCGG GAGGGCCACC ACGAGGAGGC GACGGAGGTG CTGTTGCGCG CCGGGGCCGA CGAGCAGGCG CTGACGAGCG CCGAACAGGC GATCTTCCCC GTCATCGAGC GGCTCGACGG CGCGGTCGCG GAGCGCTGGC TGACGGCGCT GGCCGACGTC GCGCCCAGCG GCGCGACGCA GCTGACGATC GCCGAGCTGG CGCTGTCGCT GACCGGCGAC GACTACCGCC GGGGCGTGCG CGTCGCCGAC CAGCTGGTCG AGCTGGACGA GCGCGAGCGG CTCGCGCACG CCTCGGACCG CGCGGCCGGG CTGATGGCCT CGCTGTACGC GGGCGTGGGC CGCCTCGACG ACGCGCACGC GGTGCTCGAC GCCGCCTCCG GCGGCCTGGG CGCGAGCACG GTCCGCTACA TGCTGCGGAT GTACGAGGGC GGCCCGGCGC AGCCGAGACC CGAGCCGGGA GGCGAGGAGG CGGACGGCGT GCTCAGCTAC GCCGACTACT TCTGCGGGCG CCTCGACAAG CTCGGCGAGG CGCCGCTGTC GCGCTGGGCC TACGCCGTCG CGCGGCCGTG GCAGATCGCC GCCCGCGCCG CGCTCGGCCA CACCGCCGAG GCGCTGGAGC TGTACGAGGC GGCGCAGGAC GCGGGCGTGA TGACGGTGTC GCTGGCGGTC TGCCCGGGCC CCGAGGTGCT GATGGACGCG GGCCGGCTGG AGCAGGCGCG CGCGCTGATC GCCCACGGAC GCAGGCTCGC GCGCGCGACT GGTGCGTTGG ACCTGGAGTG CGTCAACCGG TTGATGGAGG GCAAGCTCGC GTTGCGGTTG GAGCGCGACC CAGAGGCGGC GCGGGCGGTG CTGGAGCAGC TCGAACGCGA CCTGGAGCCG CATGCGTGGC CGTTCCTGAT CGGGCCGATC AACGCCTGGC TCGGCGCGGC GCTGCTGCTG CAGGGCGACG ACGTGGCGGC GCTGGCACGG CTCAGCCGCG CGGTCGAGGT GCTGGTGGAC GCCGACCTGA TCCTGGAGCT GCCGACGGCG GCCGGCTACC TCGCCGAGGC GCAGTGGCGG GCCGGAGACG AGGAGGCGGC CGATCGCGCC GCGGACCTCG CGCTCGACGC GTCGCACCGG CTGGGCTCCA ATCACCGGCT GCTGCAGGCG CTGACCGACT ACCCGGCGGT CGTGTCGCGC CGGATCGACG CCGAGCAGGG GGCCGACTCC GCGTGGCATG AGCTGGGCCG CGCGCTGATC GCCCAGGGCG CCGGCGGGCT GTCGGTGATC GGCAGCGTCG TCGAGCTGCG CGAGTTCGGC GAGCCGGCGC TCGCGATCGC GGGGGCGACG GTGCGGGCGC GGATCGCCAA GAGCTACGAG CTGGTCGCGT TCCTGATCGT GCGCGGGGGC GACGGCGTGG ACCGCGACGA GCTGCTCGAC GCGCTGTTCG ACGGCCGCGC CGACGATTCC GCGCGCTCCT ACCTGCGGCA GGCGATCTAC TGGGTGCGCC GCGCGCTGCC CGAGGGCGGG CTGCTCGTCG AGAAGCGGCG CGTGCGGCTG GGCGCCGAGC TGACCGTCTC CAGCGAGTCG ACCACGTTCG AGGCGCGGCT GGCGGAGGCC GCTCGGCTGC AGGGGGAGGA GCGGCTTGCC GCGACACGGG CCGCGCTGGA GCTGTACGAC CGCGGCGAGT ACCTCGCGGG GGCCGCCTCG GGCTGGGTCG CCGCGCGCCG CGAGCAGCTC GCCGAGCGCG CGCTCGACGC GCGCTACGAG GCCGCCGAGC TGGCGTTCGC GGCGGGCGCG TACGGCGACG GGCGCGCGCT CGCCGAGCAG GTGCTGGGCG CCGACCCGCT GCGCGAGGCC GCCTGGCGGC TGCTGATGCG GATCGCCAGC GCGCTCGGCG ACGAGGACCG CGTGATCGCG ACCTTCCGCG ACTGCGAGCG GGCGCTCGCG ACGATCGGCA CGGCACCGTC CGCGGTGACG CGCGAGCTGG TCGAGCGGCT GCGCCGCTGA
|
Protein sequence | MPRRVGSEHP RVALIQRKLA PPPPPAPLVE RPRLEQLLEA LFERHRVVVV SATAGAGKTT AVAAAARRLT WPLAWLSVDR TDAAPGRLVT YLEAALARTR PELAGLASEA LAAGIPHAEA VGLLIEAAGD APLLIVLDDL ERLGDAPEAL AVIEALLRYA PPAVHVALIS RRDVPSVLSV PSGHVSATIS EGDLAFTVTE AAEALELLGE RETDAAEVVA ATGGWVTGVL FEAWRSAAHV AGLGGEADPL DGYLSSQLLE QLDPAARDFL VATSVLEEVT VARATALGQP AAAERVLALR AAHLPVVWGE GGTSMRCHPR FRDYLLARLE RRGPDAVREV RVAHGRLLAR EGHHEEATEV LLRAGADEQA LTSAEQAIFP VIERLDGAVA ERWLTALADV APSGATQLTI AELALSLTGD DYRRGVRVAD QLVELDERER LAHASDRAAG LMASLYAGVG RLDDAHAVLD AASGGLGAST VRYMLRMYEG GPAQPRPEPG GEEADGVLSY ADYFCGRLDK LGEAPLSRWA YAVARPWQIA ARAALGHTAE ALELYEAAQD AGVMTVSLAV CPGPEVLMDA GRLEQARALI AHGRRLARAT GALDLECVNR LMEGKLALRL ERDPEAARAV LEQLERDLEP HAWPFLIGPI NAWLGAALLL QGDDVAALAR LSRAVEVLVD ADLILELPTA AGYLAEAQWR AGDEEAADRA ADLALDASHR LGSNHRLLQA LTDYPAVVSR RIDAEQGADS AWHELGRALI AQGAGGLSVI GSVVELREFG EPALAIAGAT VRARIAKSYE LVAFLIVRGG DGVDRDELLD ALFDGRADDS ARSYLRQAIY WVRRALPEGG LLVEKRRVRL GAELTVSSES TTFEARLAEA ARLQGEERLA ATRAALELYD RGEYLAGAAS GWVAARREQL AERALDARYE AAELAFAAGA YGDGRALAEQ VLGADPLREA AWRLLMRIAS ALGDEDRVIA TFRDCERALA TIGTAPSAVT RELVERLRR
|
| |