Gene Cwoe_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4022 
Symbol 
ID8734480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4271656 
End bp4277625 
Gene Length5970 bp 
Protein Length1989 aa 
Translation table11 
GC content72% 
IMG OID646504647 
Productconserved repeat domain protein 
Protein accessionYP_003395814 
Protein GI284045474 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC CCCGAAGCGC CGCGCGTCGC GCGGCACGCC CGCTGGCGAC CGCCCGGCGC 
GCGCTGCTCG CCGCCGCGGC GCTGACCGCG ACGATGACGA TCCTCGCGCT GCTGGCGCCG
GCCGCACAGG CCGTGCCGTT CGACTGCACG GGCGCGACGA TCTACTCGGC CATGCGCGGC
GCGACCAACT CGACGGCGTC CAACGGCACG ATCTTCGCGC TCGACGAGAG CACCGTGGGC
GGCGCGCAGG TCACCAGCAC CCTGGTCACG ACGATCCCGT CCGGCGGCTT CGCGAACGGG
ATCGGGATCA CCAGAGGCGG CACCGCGCTC TACGCCGTCG ACCAGGCCGC GACGGGGTCG
GCGGTCATCC GCGCGTATGA CGCGATCGCG GAGACGTGGA CGAGCTACAC CGGTTCCGCC
GGGACGGAGA GCTTCGTCGC CGGAGCGATC AACCCCGCCA ACGGGCTCTA CTACTACGCC
GCCTATGCGG CCGGCACGGC GACGACGGCC GGCACCGCGA CCGTCTACGC GTTCGACACG
ACCACGAACA CGCCGGTCAC GGGCAAGATC GCGACGATCA ACCTGCCGAT CGTCGGCGCC
GGCGGCCCCA ACGGCGACAT CGCGTTCGAC GCGCTCGGCA ACCTCTACCT GCTGGAGTCG
GTCGGGACGA CGGTCGCGAT CAGCCGCGTC AACGCCGCGT CGCTGCCGAC GACCGGCTCG
CCGACGGGCG CGACGGTGAC GAGCACCAGA GTCGCGGGCT TCACCAGCAG CGGGCCGCTC
TACAACGGGA TCTCGTTCGA CAACGCGGGC AACCTGTACG TCCTCAACGC CGGACCGAAC
CAGCTGACGA GAATCAACCC GAACACCGGC GTCGCGCTCG CCGGGCCGAC CAGCCTCGAC
GCCGCCGCGC AGGCGTTCGC GAACGTCGAC CTCGCCGCCT GCGCGACCAA CCCGACGCTG
TCGCTGCGCA AGGAGATCCT CGGCCGCTAC GCCCCCGCTG ACCAGTTCGG CCTGTCGATA
TCCGGCGGCG GATTGACCTC GGGCAACGTC GCGACGACGA GCGGCGGCGC GAGAGGGATC
CAGACGGCGG TCGCGGGTCC GATCATCGCG CAGTCGGGCA CCACCTACAC GCTGACGGAG
TCGGCGGCCA GAGGCGCGAG ACTGGCCAAC TACAGAACGA CGTACGACTG CGTCGACACG
GCCAACGGCA ATGCGCCGGT CTCCTCCGGC AGCGGCGCGT CGTTCACCGT GCCGTTCCCG
GCGCCGGTCA TCGGCAGAGC CAGCTCGAAC ATCCTCTGCA CGTTCCTCAA CTCCCCGCTC
GCGCCGTCGC TCACGCTCGA CAAGTCCGCC GACAGAAGAA GACTGATCGT CGGCGACACG
GTCACGTACT CGTTCCTCGT CACCAACACC GGCGACGTGA CGCTCGCGCC GGTCACCGTG
ACCGACACCT CCTTCAGCGG CTCCGGCACG CCGCCGGCGA TCTCGTGCCC GCCGGCCGCC
GCCTCGCTCG CGCCGAGCGC GTCGGTCACG TGCACGGCGA CGTACGTCGT CACGCAGGCC
GACGTCGACA CCGGCGTGGT CGCGAACACC GCGGTCGCGA CGGGCGACTC GCCCGCGGGC
GATCCGATCA GATCGCCGCC GTCCTCGACC TCGGTCCCGC AGGCGCCCGA ACCGGCGCTC
GACCTCGCCA AGAGCGCGTC CCCCGCGACG ATCAGCGCGG CAGGCGACAC GGTCACGTAC
AGATTCCTCG TCACCAACGT CGGCAACGTG ACGCTGGCGC CGGTGACGGT GAGAGAGACG
GCGTTCTCGG GCAGCGGCAC CGCGCCGGTG GTGAGATGCC CGGCGGGCGC GGCGTCGCTT
GCGCCCGGCG CGCAGGTGAC GTGCACGGCG ACGTACACCG CGACGCAGGC CGACGTCAAC
AGAGGCCAGA TCGACAACAC CGCGGTCGCG ACGGGCACGC CGCCGGTCGG TCCGCCGGTC
GACTCGCCGC CGTCGAGAGC GACGGTCACC GCGTCGGCGA CGCCGTCGCT GACGGTCGTG
AAGTCCAACG ACAGCGGCAC GTCGTTCGTG CTCGGGCAGG TGATCACATA CAGATACGTG
GTGACGAACA CGGGCAACGT CACGTTGGCG CCCGTGACCG TGAGAGAGAC GGCGTTCACC
GGATCCGGAA CCCCGTCGGC GATCAGCTGC CCGCCGGCGG CGGCGTCGCT GGCACCCGGT
GCGCAGGTCA CGTGCAGCTC GACCTACACC GTGACGCAGA CCGACGTCAA CAGAGGCCAG
ATCGACAACA CCGCGGTCGC GACTGGCACG CCGCCGACCG GTCCACCGGT CGACTCGCCG
CCGTCCAGAT CGACGTCGCC GAGCACGCCG GCGCCTGCGT TGACGATCGC CAAGAGCGCG
TCGCCCGCGA CCTTCAGAGC CGCCGGCGAC ACGATCACGT ACAGATTCGT CGTGACGAAC
ACCGGGAACG TGACGCTGGC GCCGGTGACC GTGAGAGAGA CGGCGTTCAC CGGCACCGGC
ACCGCGCCGG TGGTGAGATG CCCGGCGGGC GCGGCGTCGC TCGACCCCGG CGGGCAGGTG
ACGTGCACGG CGACGTACAC CGTGACCCAA GCTGACGCCG ACAGAGGCGA GATCGACAAC
ACCGCGGTCG CGACGGGCAC GCCGCCGACC GGTCCGCCGG TCGACTCGCC GCCGTCGAGA
GCGACGGTCA ACGGTCCGGC GTCGCCGTCG CTGACGGTCG TCAAGTCCGT CAGCCCGCCG
TCGTTGAGCG GGGCGGGCCA GGAGCTGACC TACTCGTTCG TCGTCACCAA CACCGGCAAC
GTGACGCTTG CTCCCGTGAC CGTGAGAGAG ACCGCGTTCA CCGGATCGGG GCCGTCGCCG
ACGATCTCCT GCCCACCGGG CGCGGCGTCC CTGGCGCCGG GCGCGCAGGT GATCTGCACC
GCCAGATACA CGGTCACGCA GGACGACTTC GACAGAGACA GCCTGGAGAA CACGGCCGTC
GCGACGGGCG TCCCGCCGAG AGGCCCCCCG GTCGACTCGC CGCCGTCCGA CGCCTCGGTC
CCGTTCACGC CCGAACCGAG ACTCGACATC GTCAAGACGG CGAACCCCAC CGCGGTCAGC
GCGGCGGGCG ACCTCGTCTC GTACAGCTTC CTCGTCACCA ACACGGGCAA CGTGACGGTC
GGCTCGGTCG CGGTCACGGA CACGTTCACG GCGCCCTCGA CCGGCACGCT CGGCCCGATC
AGCTGCCCGC AGACGAGACT GATTCCGGGG CAGTCGACGA CGTGCACGGC GCCGGCGTAC
GCGGCGACGC AGGCCGACAT CGACAACGGC ATCATCCGCA ACTCGGCGTT CGCCACCGGT
GAGGACCCGG GCGGCGACCC GGTTGTCTCC GGCAGATCGC CCGCGACCGT CGAGGTCCTC
GCGCAGCCGG GCATCACGAT CGTCAAGTCC GCGAACGTCA GAAGCTTCGC GAGACCCGGG
ACGTTGGTCA CGTTCAGCTT CGAGGTGAGA AACACCGGCA ACGTGACGCT CGACCCCGTC
GTCGTGAGCG ACCCGCTGCC CGGCCTCTCG CCGATCTCGT GCCCGCAGAC GAGACTCGCG
CCGGCGGCAT CCCAGACGTG CACCGCCACC TACACGACGA CCGGCGCCGA CGTCAACGCG
GGTGAGATCG ACAACACCGG CACCGTCACC GGCCAGCCGC CGACGCCGTT CGGCGGAACC
CCGCCGCCGC CCGTCACCGA CAGATCGAGC ACGACGGTGC CCGCGGACCA GGCGCCGGCG
CTGAGCATCG TCAAGACGGC GACGCCGACG TCGGTCACCG CCGCGGGTGA CGCGATCGCA
TACAGATTCC TCGTCACCAA CACCGGCAAC GTGACGCTCA CGGGCGTGGC GGTGAGAGAC
ACGTTCACGC CGCCTGCGAC CGGCCCCGGC GGGCCGATCA CCTGCCTCGT GACGACGCTC
GACCCCGGCG ACTCGACGAC GTGCACCGCG CCGCCGTATC TCGCGAGCCA GGCGGACGCC
GACAACGGCA GGATCGACAA CACGGCGATC GCGACCGGCA CGCCACCGAG AGGTCAGCTC
GTCGACTCGC CGCCGTCGGC GGCCGTCGTC ACGATCGCGC CTGACCCGGG GATCTCGCTC
GTGAAGTCCG CGAGCGTCAC CGAGTACAAG GTCGCCGGGA CGGTCGTGAC GTACAGCTAC
GCGGTGAGAA ACACCGGCAA CGTGACGCTC GATCCGGTCG TCGTGACCGA CCCGATGCCC
GGTCTGTCGG CGCTCTCCTG CCCGCAGACG AGACTCGCCC CGGGCGCCTC CGAGGTCTGC
ACGGCGAGAT ACACGACGAC CGCGGCCGAC GTCCTCGCCG GCGCGCTCGA CAACACCGGC
ACCGCGACGG GCTCGCCGCC GTCGACCCAG TCCAACCCGA ACCCGCCGCC GGTGAGAGCG
ACCTCGAGCG TCTCGGTGCC GGCGCGGCCC GAGGCCGACC TGTCGATCGT CAAGACGGCC
TCGCCCGGCG TCGCGACGCC GGGGCGGAGC CTGACGTACA CGCTCACGGT CAGAAACGAC
GGGCCGTCCG ACGCGATCGC GGTCGTCGTC TCCGATCCGC TGCCCGCCGG GCTCACGTTC
GTCTCCGCGA GCGCGGGCTG CAGCGCCGCC GGCCAGGACG TCACGTGCAC GCGCGCGTCG
CTGGCCGCGG GTGAGACGGC CACGTTCACC GTCACCGCGA ACGTCGCCGG CGACGTCGCC
CACGCGATCG ACAACACCGC GACGGTCAGA AGCGACACGC CGGATCCCGA CCCGACGGAC
AACAGATCCA GAGTCGAGGT GCCGGTCAGA GGGGAGACCG ACCTCTCGAT CGTCAAGACG
CCGTCGACGA CGACGCCCGG CCCGAGTGGG CAGGTGATCT ACACGCTCGT GGTCAGGAAC
GCCGGCCCGA GCGCCGCGAC CGGCGTGAAG GTCTCCGACC CGATGCCGGC GGGCCTGACC
GTGCAGAGCG CGACGCCGAG CCAGGGCAGC TGCTCGATCG CGGGCCGCAC CGTGTCGTGC
GACCTCGGCG GGATCGCCGC CGGCGGCGGC GTGCAGGTGC TGGTGGCGGC GAACGTCGCG
GCCGGCGCGA GCGGGGCGAT CGTCAACACC GCGACCGTCA CCGGCGACCA GGACGACCCG
AGACCCGGCG ACAACAGAAG CAGCACGACG GTGACGCCCG GGCAGACGCC GGCCCCGGCC
GCCGACCTCG TGGTGACGAA GACGACGAGC GCGAGAGAGG TCGTCGTCGG CAGACGCCTG
ACGTACGAGA TCGTCGTCAG AAACGTCTCC GCGCATCCGG CGTTCGCGGT CGCGCTGACC
GACACGTTCG GGCTTCCCGC GCGGATCGTC TCGGTCCGTG CGACCCAGGG CAGCTGCCTC
CCACGGGCGC CGCTGACGTG TGCGCTCGGC ACGATCGCGG CGGGCAGATC GGTCATGGTC
ACCGTCGTCG CGTACCCGCG CGCCACGGGC AGACTGCGCA ACGCCGCGAG CGCGACGTCC
CGCGCGCAGG ACCCGACGCC GCGCAACAAC GTCGCGGGCG TCTCGCGCAG CGTCGGCAGA
CCGCGGCTGC GGATCGCCAA GACGGCGGAC GTCCGCGTCG TGCGGGCCGG TGACACCGTC
GAGTACGCGA TACGCGTCAG CAACCCGTCC GCGGTCACGC TCCGCAACGT GCGCGTCTGC
GACACGCTCC CGCCCGGGCT CGTGCGCGAG GACGCGACGC CGGGCGCCAC GCTGCGGAGA
GGCGCCTACT GCTGGAGCGT GAGATCGCTG CCAGCCGGCG AGTCGCGGAC GTTCTCGATG
AGGGCTGGAG CGATCCGCGG CGCGCGCGGC AGCAAGGTCA ACACCGCGAC GGCGACCGCG
CCCGGCGCGC GTGGCGACCG CGCGCAGCGC ACCGTCCGCG TCGTCGCGGG CGCGGTCGCG
CCCGCGACGG GAGGAGGCGT GACCGGCTGA
 
Protein sequence
MSLPRSAARR AARPLATARR ALLAAAALTA TMTILALLAP AAQAVPFDCT GATIYSAMRG 
ATNSTASNGT IFALDESTVG GAQVTSTLVT TIPSGGFANG IGITRGGTAL YAVDQAATGS
AVIRAYDAIA ETWTSYTGSA GTESFVAGAI NPANGLYYYA AYAAGTATTA GTATVYAFDT
TTNTPVTGKI ATINLPIVGA GGPNGDIAFD ALGNLYLLES VGTTVAISRV NAASLPTTGS
PTGATVTSTR VAGFTSSGPL YNGISFDNAG NLYVLNAGPN QLTRINPNTG VALAGPTSLD
AAAQAFANVD LAACATNPTL SLRKEILGRY APADQFGLSI SGGGLTSGNV ATTSGGARGI
QTAVAGPIIA QSGTTYTLTE SAARGARLAN YRTTYDCVDT ANGNAPVSSG SGASFTVPFP
APVIGRASSN ILCTFLNSPL APSLTLDKSA DRRRLIVGDT VTYSFLVTNT GDVTLAPVTV
TDTSFSGSGT PPAISCPPAA ASLAPSASVT CTATYVVTQA DVDTGVVANT AVATGDSPAG
DPIRSPPSST SVPQAPEPAL DLAKSASPAT ISAAGDTVTY RFLVTNVGNV TLAPVTVRET
AFSGSGTAPV VRCPAGAASL APGAQVTCTA TYTATQADVN RGQIDNTAVA TGTPPVGPPV
DSPPSRATVT ASATPSLTVV KSNDSGTSFV LGQVITYRYV VTNTGNVTLA PVTVRETAFT
GSGTPSAISC PPAAASLAPG AQVTCSSTYT VTQTDVNRGQ IDNTAVATGT PPTGPPVDSP
PSRSTSPSTP APALTIAKSA SPATFRAAGD TITYRFVVTN TGNVTLAPVT VRETAFTGTG
TAPVVRCPAG AASLDPGGQV TCTATYTVTQ ADADRGEIDN TAVATGTPPT GPPVDSPPSR
ATVNGPASPS LTVVKSVSPP SLSGAGQELT YSFVVTNTGN VTLAPVTVRE TAFTGSGPSP
TISCPPGAAS LAPGAQVICT ARYTVTQDDF DRDSLENTAV ATGVPPRGPP VDSPPSDASV
PFTPEPRLDI VKTANPTAVS AAGDLVSYSF LVTNTGNVTV GSVAVTDTFT APSTGTLGPI
SCPQTRLIPG QSTTCTAPAY AATQADIDNG IIRNSAFATG EDPGGDPVVS GRSPATVEVL
AQPGITIVKS ANVRSFARPG TLVTFSFEVR NTGNVTLDPV VVSDPLPGLS PISCPQTRLA
PAASQTCTAT YTTTGADVNA GEIDNTGTVT GQPPTPFGGT PPPPVTDRSS TTVPADQAPA
LSIVKTATPT SVTAAGDAIA YRFLVTNTGN VTLTGVAVRD TFTPPATGPG GPITCLVTTL
DPGDSTTCTA PPYLASQADA DNGRIDNTAI ATGTPPRGQL VDSPPSAAVV TIAPDPGISL
VKSASVTEYK VAGTVVTYSY AVRNTGNVTL DPVVVTDPMP GLSALSCPQT RLAPGASEVC
TARYTTTAAD VLAGALDNTG TATGSPPSTQ SNPNPPPVRA TSSVSVPARP EADLSIVKTA
SPGVATPGRS LTYTLTVRND GPSDAIAVVV SDPLPAGLTF VSASAGCSAA GQDVTCTRAS
LAAGETATFT VTANVAGDVA HAIDNTATVR SDTPDPDPTD NRSRVEVPVR GETDLSIVKT
PSTTTPGPSG QVIYTLVVRN AGPSAATGVK VSDPMPAGLT VQSATPSQGS CSIAGRTVSC
DLGGIAAGGG VQVLVAANVA AGASGAIVNT ATVTGDQDDP RPGDNRSSTT VTPGQTPAPA
ADLVVTKTTS AREVVVGRRL TYEIVVRNVS AHPAFAVALT DTFGLPARIV SVRATQGSCL
PRAPLTCALG TIAAGRSVMV TVVAYPRATG RLRNAASATS RAQDPTPRNN VAGVSRSVGR
PRLRIAKTAD VRVVRAGDTV EYAIRVSNPS AVTLRNVRVC DTLPPGLVRE DATPGATLRR
GAYCWSVRSL PAGESRTFSM RAGAIRGARG SKVNTATATA PGARGDRAQR TVRVVAGAVA
PATGGGVTG