Gene Cwoe_3996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3996 
Symbol 
ID8734454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4243410 
End bp4246439 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content76% 
IMG OID646504621 
Producttranscriptional activator domain protein 
Protein accessionYP_003395788 
Protein GI284045448 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.306526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAGAA GGGTGGGGAG CGAGCACCCG CGGGTCGCGC TCATCCAGCG CAAGCTGGCG 
CCGCCGCCGC CGCCCGCGCC GCTCGTCGAG CGGCCGCGGT TGGAGCAGCT GCTGGAGGCG
CTGTTCGAGC GCCATCGTGT CGTCGTCGTC TCGGCGACGG CCGGCGCGGG CAAGACGACG
GCCGTGGCCG CGGCCGCGAG GAGGCTGACG TGGCCGCTGG CGTGGCTCTC GGTCGATCGC
ACCGACGCTG CGCCCGGGCG CCTCGTCACC TACCTGGAGG CGGCGCTGGC GCGCACGCGA
CCCGAGCTGG CCGGGCTCGC GAGCGAGGCG CTCGCCGCCG GCATCCCGCA CGCCGAGGCC
GTCGGCCTGC TGATCGAGGC GGCTGGGGAC GCGCCACTGC TGATCGTGCT CGACGACCTC
GAGCGGCTCG GCGACGCGCC GGAGGCGCTG GCCGTGATCG AGGCGCTGCT GCGCTACGCG
CCGCCCGCGG TGCACGTCGC GCTGATCAGC CGCCGCGACG TGCCGAGCGT GCTGTCGGTC
CCCTCCGGGC ACGTCAGCGC GACGATCAGC GAGGGCGATC TGGCGTTCAC CGTGACCGAG
GCGGCCGAGG CGCTGGAGCT GCTGGGCGAG CGTGAGACGG ACGCCGCCGA GGTCGTGGCC
GCGACGGGCG GCTGGGTCAC CGGCGTGCTG TTCGAGGCGT GGCGTTCGGC CGCGCACGTC
GCCGGCCTCG GCGGTGAGGC CGACCCGCTC GACGGCTACC TCTCCTCCCA GCTGCTGGAG
CAGTTGGACC CGGCGGCGCG CGACTTCCTC GTCGCGACAT CGGTGCTGGA GGAGGTCACG
GTCGCGCGTG CGACCGCGCT CGGCCAGCCG GCTGCGGCAG AGCGGGTGCT CGCGCTCAGA
GCCGCGCATC TGCCGGTCGT CTGGGGCGAG GGCGGCACGA GCATGCGCTG CCACCCGCGC
TTCCGCGACT ACCTGCTGGC GCGCCTGGAG CGCCGCGGAC CGGACGCGGT GCGCGAGGTC
CGCGTCGCGC ACGGACGGCT GCTCGCGCGG GAGGGCCACC ACGAGGAGGC GACGGAGGTG
CTGTTGCGCG CCGGGGCCGA CGAGCAGGCG CTGACGAGCG CCGAACAGGC GATCTTCCCC
GTCATCGAGC GGCTCGACGG CGCGGTCGCG GAGCGCTGGC TGACGGCGCT GGCCGACGTC
GCGCCCAGCG GCGCGACGCA GCTGACGATC GCCGAGCTGG CGCTGTCGCT GACCGGCGAC
GACTACCGCC GGGGCGTGCG CGTCGCCGAC CAGCTGGTCG AGCTGGACGA GCGCGAGCGG
CTCGCGCACG CCTCGGACCG CGCGGCCGGG CTGATGGCCT CGCTGTACGC GGGCGTGGGC
CGCCTCGACG ACGCGCACGC GGTGCTCGAC GCCGCCTCCG GCGGCCTGGG CGCGAGCACG
GTCCGCTACA TGCTGCGGAT GTACGAGGGC GGCCCGGCGC AGCCGAGACC CGAGCCGGGA
GGCGAGGAGG CGGACGGCGT GCTCAGCTAC GCCGACTACT TCTGCGGGCG CCTCGACAAG
CTCGGCGAGG CGCCGCTGTC GCGCTGGGCC TACGCCGTCG CGCGGCCGTG GCAGATCGCC
GCCCGCGCCG CGCTCGGCCA CACCGCCGAG GCGCTGGAGC TGTACGAGGC GGCGCAGGAC
GCGGGCGTGA TGACGGTGTC GCTGGCGGTC TGCCCGGGCC CCGAGGTGCT GATGGACGCG
GGCCGGCTGG AGCAGGCGCG CGCGCTGATC GCCCACGGAC GCAGGCTCGC GCGCGCGACT
GGTGCGTTGG ACCTGGAGTG CGTCAACCGG TTGATGGAGG GCAAGCTCGC GTTGCGGTTG
GAGCGCGACC CAGAGGCGGC GCGGGCGGTG CTGGAGCAGC TCGAACGCGA CCTGGAGCCG
CATGCGTGGC CGTTCCTGAT CGGGCCGATC AACGCCTGGC TCGGCGCGGC GCTGCTGCTG
CAGGGCGACG ACGTGGCGGC GCTGGCACGG CTCAGCCGCG CGGTCGAGGT GCTGGTGGAC
GCCGACCTGA TCCTGGAGCT GCCGACGGCG GCCGGCTACC TCGCCGAGGC GCAGTGGCGG
GCCGGAGACG AGGAGGCGGC CGATCGCGCC GCGGACCTCG CGCTCGACGC GTCGCACCGG
CTGGGCTCCA ATCACCGGCT GCTGCAGGCG CTGACCGACT ACCCGGCGGT CGTGTCGCGC
CGGATCGACG CCGAGCAGGG GGCCGACTCC GCGTGGCATG AGCTGGGCCG CGCGCTGATC
GCCCAGGGCG CCGGCGGGCT GTCGGTGATC GGCAGCGTCG TCGAGCTGCG CGAGTTCGGC
GAGCCGGCGC TCGCGATCGC GGGGGCGACG GTGCGGGCGC GGATCGCCAA GAGCTACGAG
CTGGTCGCGT TCCTGATCGT GCGCGGGGGC GACGGCGTGG ACCGCGACGA GCTGCTCGAC
GCGCTGTTCG ACGGCCGCGC CGACGATTCC GCGCGCTCCT ACCTGCGGCA GGCGATCTAC
TGGGTGCGCC GCGCGCTGCC CGAGGGCGGG CTGCTCGTCG AGAAGCGGCG CGTGCGGCTG
GGCGCCGAGC TGACCGTCTC CAGCGAGTCG ACCACGTTCG AGGCGCGGCT GGCGGAGGCC
GCTCGGCTGC AGGGGGAGGA GCGGCTTGCC GCGACACGGG CCGCGCTGGA GCTGTACGAC
CGCGGCGAGT ACCTCGCGGG GGCCGCCTCG GGCTGGGTCG CCGCGCGCCG CGAGCAGCTC
GCCGAGCGCG CGCTCGACGC GCGCTACGAG GCCGCCGAGC TGGCGTTCGC GGCGGGCGCG
TACGGCGACG GGCGCGCGCT CGCCGAGCAG GTGCTGGGCG CCGACCCGCT GCGCGAGGCC
GCCTGGCGGC TGCTGATGCG GATCGCCAGC GCGCTCGGCG ACGAGGACCG CGTGATCGCG
ACCTTCCGCG ACTGCGAGCG GGCGCTCGCG ACGATCGGCA CGGCACCGTC CGCGGTGACG
CGCGAGCTGG TCGAGCGGCT GCGCCGCTGA
 
Protein sequence
MPRRVGSEHP RVALIQRKLA PPPPPAPLVE RPRLEQLLEA LFERHRVVVV SATAGAGKTT 
AVAAAARRLT WPLAWLSVDR TDAAPGRLVT YLEAALARTR PELAGLASEA LAAGIPHAEA
VGLLIEAAGD APLLIVLDDL ERLGDAPEAL AVIEALLRYA PPAVHVALIS RRDVPSVLSV
PSGHVSATIS EGDLAFTVTE AAEALELLGE RETDAAEVVA ATGGWVTGVL FEAWRSAAHV
AGLGGEADPL DGYLSSQLLE QLDPAARDFL VATSVLEEVT VARATALGQP AAAERVLALR
AAHLPVVWGE GGTSMRCHPR FRDYLLARLE RRGPDAVREV RVAHGRLLAR EGHHEEATEV
LLRAGADEQA LTSAEQAIFP VIERLDGAVA ERWLTALADV APSGATQLTI AELALSLTGD
DYRRGVRVAD QLVELDERER LAHASDRAAG LMASLYAGVG RLDDAHAVLD AASGGLGAST
VRYMLRMYEG GPAQPRPEPG GEEADGVLSY ADYFCGRLDK LGEAPLSRWA YAVARPWQIA
ARAALGHTAE ALELYEAAQD AGVMTVSLAV CPGPEVLMDA GRLEQARALI AHGRRLARAT
GALDLECVNR LMEGKLALRL ERDPEAARAV LEQLERDLEP HAWPFLIGPI NAWLGAALLL
QGDDVAALAR LSRAVEVLVD ADLILELPTA AGYLAEAQWR AGDEEAADRA ADLALDASHR
LGSNHRLLQA LTDYPAVVSR RIDAEQGADS AWHELGRALI AQGAGGLSVI GSVVELREFG
EPALAIAGAT VRARIAKSYE LVAFLIVRGG DGVDRDELLD ALFDGRADDS ARSYLRQAIY
WVRRALPEGG LLVEKRRVRL GAELTVSSES TTFEARLAEA ARLQGEERLA ATRAALELYD
RGEYLAGAAS GWVAARREQL AERALDARYE AAELAFAAGA YGDGRALAEQ VLGADPLREA
AWRLLMRIAS ALGDEDRVIA TFRDCERALA TIGTAPSAVT RELVERLRR