Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4022 |
Symbol | |
ID | 8734480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4271656 |
End bp | 4277625 |
Gene Length | 5970 bp |
Protein Length | 1989 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646504647 |
Product | conserved repeat domain protein |
Protein accession | YP_003395814 |
Protein GI | 284045474 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGC CCCGAAGCGC CGCGCGTCGC GCGGCACGCC CGCTGGCGAC CGCCCGGCGC GCGCTGCTCG CCGCCGCGGC GCTGACCGCG ACGATGACGA TCCTCGCGCT GCTGGCGCCG GCCGCACAGG CCGTGCCGTT CGACTGCACG GGCGCGACGA TCTACTCGGC CATGCGCGGC GCGACCAACT CGACGGCGTC CAACGGCACG ATCTTCGCGC TCGACGAGAG CACCGTGGGC GGCGCGCAGG TCACCAGCAC CCTGGTCACG ACGATCCCGT CCGGCGGCTT CGCGAACGGG ATCGGGATCA CCAGAGGCGG CACCGCGCTC TACGCCGTCG ACCAGGCCGC GACGGGGTCG GCGGTCATCC GCGCGTATGA CGCGATCGCG GAGACGTGGA CGAGCTACAC CGGTTCCGCC GGGACGGAGA GCTTCGTCGC CGGAGCGATC AACCCCGCCA ACGGGCTCTA CTACTACGCC GCCTATGCGG CCGGCACGGC GACGACGGCC GGCACCGCGA CCGTCTACGC GTTCGACACG ACCACGAACA CGCCGGTCAC GGGCAAGATC GCGACGATCA ACCTGCCGAT CGTCGGCGCC GGCGGCCCCA ACGGCGACAT CGCGTTCGAC GCGCTCGGCA ACCTCTACCT GCTGGAGTCG GTCGGGACGA CGGTCGCGAT CAGCCGCGTC AACGCCGCGT CGCTGCCGAC GACCGGCTCG CCGACGGGCG CGACGGTGAC GAGCACCAGA GTCGCGGGCT TCACCAGCAG CGGGCCGCTC TACAACGGGA TCTCGTTCGA CAACGCGGGC AACCTGTACG TCCTCAACGC CGGACCGAAC CAGCTGACGA GAATCAACCC GAACACCGGC GTCGCGCTCG CCGGGCCGAC CAGCCTCGAC GCCGCCGCGC AGGCGTTCGC GAACGTCGAC CTCGCCGCCT GCGCGACCAA CCCGACGCTG TCGCTGCGCA AGGAGATCCT CGGCCGCTAC GCCCCCGCTG ACCAGTTCGG CCTGTCGATA TCCGGCGGCG GATTGACCTC GGGCAACGTC GCGACGACGA GCGGCGGCGC GAGAGGGATC CAGACGGCGG TCGCGGGTCC GATCATCGCG CAGTCGGGCA CCACCTACAC GCTGACGGAG TCGGCGGCCA GAGGCGCGAG ACTGGCCAAC TACAGAACGA CGTACGACTG CGTCGACACG GCCAACGGCA ATGCGCCGGT CTCCTCCGGC AGCGGCGCGT CGTTCACCGT GCCGTTCCCG GCGCCGGTCA TCGGCAGAGC CAGCTCGAAC ATCCTCTGCA CGTTCCTCAA CTCCCCGCTC GCGCCGTCGC TCACGCTCGA CAAGTCCGCC GACAGAAGAA GACTGATCGT CGGCGACACG GTCACGTACT CGTTCCTCGT CACCAACACC GGCGACGTGA CGCTCGCGCC GGTCACCGTG ACCGACACCT CCTTCAGCGG CTCCGGCACG CCGCCGGCGA TCTCGTGCCC GCCGGCCGCC GCCTCGCTCG CGCCGAGCGC GTCGGTCACG TGCACGGCGA CGTACGTCGT CACGCAGGCC GACGTCGACA CCGGCGTGGT CGCGAACACC GCGGTCGCGA CGGGCGACTC GCCCGCGGGC GATCCGATCA GATCGCCGCC GTCCTCGACC TCGGTCCCGC AGGCGCCCGA ACCGGCGCTC GACCTCGCCA AGAGCGCGTC CCCCGCGACG ATCAGCGCGG CAGGCGACAC GGTCACGTAC AGATTCCTCG TCACCAACGT CGGCAACGTG ACGCTGGCGC CGGTGACGGT GAGAGAGACG GCGTTCTCGG GCAGCGGCAC CGCGCCGGTG GTGAGATGCC CGGCGGGCGC GGCGTCGCTT GCGCCCGGCG CGCAGGTGAC GTGCACGGCG ACGTACACCG CGACGCAGGC CGACGTCAAC AGAGGCCAGA TCGACAACAC CGCGGTCGCG ACGGGCACGC CGCCGGTCGG TCCGCCGGTC GACTCGCCGC CGTCGAGAGC GACGGTCACC GCGTCGGCGA CGCCGTCGCT GACGGTCGTG AAGTCCAACG ACAGCGGCAC GTCGTTCGTG CTCGGGCAGG TGATCACATA CAGATACGTG GTGACGAACA CGGGCAACGT CACGTTGGCG CCCGTGACCG TGAGAGAGAC GGCGTTCACC GGATCCGGAA CCCCGTCGGC GATCAGCTGC CCGCCGGCGG CGGCGTCGCT GGCACCCGGT GCGCAGGTCA CGTGCAGCTC GACCTACACC GTGACGCAGA CCGACGTCAA CAGAGGCCAG ATCGACAACA CCGCGGTCGC GACTGGCACG CCGCCGACCG GTCCACCGGT CGACTCGCCG CCGTCCAGAT CGACGTCGCC GAGCACGCCG GCGCCTGCGT TGACGATCGC CAAGAGCGCG TCGCCCGCGA CCTTCAGAGC CGCCGGCGAC ACGATCACGT ACAGATTCGT CGTGACGAAC ACCGGGAACG TGACGCTGGC GCCGGTGACC GTGAGAGAGA CGGCGTTCAC CGGCACCGGC ACCGCGCCGG TGGTGAGATG CCCGGCGGGC GCGGCGTCGC TCGACCCCGG CGGGCAGGTG ACGTGCACGG CGACGTACAC CGTGACCCAA GCTGACGCCG ACAGAGGCGA GATCGACAAC ACCGCGGTCG CGACGGGCAC GCCGCCGACC GGTCCGCCGG TCGACTCGCC GCCGTCGAGA GCGACGGTCA ACGGTCCGGC GTCGCCGTCG CTGACGGTCG TCAAGTCCGT CAGCCCGCCG TCGTTGAGCG GGGCGGGCCA GGAGCTGACC TACTCGTTCG TCGTCACCAA CACCGGCAAC GTGACGCTTG CTCCCGTGAC CGTGAGAGAG ACCGCGTTCA CCGGATCGGG GCCGTCGCCG ACGATCTCCT GCCCACCGGG CGCGGCGTCC CTGGCGCCGG GCGCGCAGGT GATCTGCACC GCCAGATACA CGGTCACGCA GGACGACTTC GACAGAGACA GCCTGGAGAA CACGGCCGTC GCGACGGGCG TCCCGCCGAG AGGCCCCCCG GTCGACTCGC CGCCGTCCGA CGCCTCGGTC CCGTTCACGC CCGAACCGAG ACTCGACATC GTCAAGACGG CGAACCCCAC CGCGGTCAGC GCGGCGGGCG ACCTCGTCTC GTACAGCTTC CTCGTCACCA ACACGGGCAA CGTGACGGTC GGCTCGGTCG CGGTCACGGA CACGTTCACG GCGCCCTCGA CCGGCACGCT CGGCCCGATC AGCTGCCCGC AGACGAGACT GATTCCGGGG CAGTCGACGA CGTGCACGGC GCCGGCGTAC GCGGCGACGC AGGCCGACAT CGACAACGGC ATCATCCGCA ACTCGGCGTT CGCCACCGGT GAGGACCCGG GCGGCGACCC GGTTGTCTCC GGCAGATCGC CCGCGACCGT CGAGGTCCTC GCGCAGCCGG GCATCACGAT CGTCAAGTCC GCGAACGTCA GAAGCTTCGC GAGACCCGGG ACGTTGGTCA CGTTCAGCTT CGAGGTGAGA AACACCGGCA ACGTGACGCT CGACCCCGTC GTCGTGAGCG ACCCGCTGCC CGGCCTCTCG CCGATCTCGT GCCCGCAGAC GAGACTCGCG CCGGCGGCAT CCCAGACGTG CACCGCCACC TACACGACGA CCGGCGCCGA CGTCAACGCG GGTGAGATCG ACAACACCGG CACCGTCACC GGCCAGCCGC CGACGCCGTT CGGCGGAACC CCGCCGCCGC CCGTCACCGA CAGATCGAGC ACGACGGTGC CCGCGGACCA GGCGCCGGCG CTGAGCATCG TCAAGACGGC GACGCCGACG TCGGTCACCG CCGCGGGTGA CGCGATCGCA TACAGATTCC TCGTCACCAA CACCGGCAAC GTGACGCTCA CGGGCGTGGC GGTGAGAGAC ACGTTCACGC CGCCTGCGAC CGGCCCCGGC GGGCCGATCA CCTGCCTCGT GACGACGCTC GACCCCGGCG ACTCGACGAC GTGCACCGCG CCGCCGTATC TCGCGAGCCA GGCGGACGCC GACAACGGCA GGATCGACAA CACGGCGATC GCGACCGGCA CGCCACCGAG AGGTCAGCTC GTCGACTCGC CGCCGTCGGC GGCCGTCGTC ACGATCGCGC CTGACCCGGG GATCTCGCTC GTGAAGTCCG CGAGCGTCAC CGAGTACAAG GTCGCCGGGA CGGTCGTGAC GTACAGCTAC GCGGTGAGAA ACACCGGCAA CGTGACGCTC GATCCGGTCG TCGTGACCGA CCCGATGCCC GGTCTGTCGG CGCTCTCCTG CCCGCAGACG AGACTCGCCC CGGGCGCCTC CGAGGTCTGC ACGGCGAGAT ACACGACGAC CGCGGCCGAC GTCCTCGCCG GCGCGCTCGA CAACACCGGC ACCGCGACGG GCTCGCCGCC GTCGACCCAG TCCAACCCGA ACCCGCCGCC GGTGAGAGCG ACCTCGAGCG TCTCGGTGCC GGCGCGGCCC GAGGCCGACC TGTCGATCGT CAAGACGGCC TCGCCCGGCG TCGCGACGCC GGGGCGGAGC CTGACGTACA CGCTCACGGT CAGAAACGAC GGGCCGTCCG ACGCGATCGC GGTCGTCGTC TCCGATCCGC TGCCCGCCGG GCTCACGTTC GTCTCCGCGA GCGCGGGCTG CAGCGCCGCC GGCCAGGACG TCACGTGCAC GCGCGCGTCG CTGGCCGCGG GTGAGACGGC CACGTTCACC GTCACCGCGA ACGTCGCCGG CGACGTCGCC CACGCGATCG ACAACACCGC GACGGTCAGA AGCGACACGC CGGATCCCGA CCCGACGGAC AACAGATCCA GAGTCGAGGT GCCGGTCAGA GGGGAGACCG ACCTCTCGAT CGTCAAGACG CCGTCGACGA CGACGCCCGG CCCGAGTGGG CAGGTGATCT ACACGCTCGT GGTCAGGAAC GCCGGCCCGA GCGCCGCGAC CGGCGTGAAG GTCTCCGACC CGATGCCGGC GGGCCTGACC GTGCAGAGCG CGACGCCGAG CCAGGGCAGC TGCTCGATCG CGGGCCGCAC CGTGTCGTGC GACCTCGGCG GGATCGCCGC CGGCGGCGGC GTGCAGGTGC TGGTGGCGGC GAACGTCGCG GCCGGCGCGA GCGGGGCGAT CGTCAACACC GCGACCGTCA CCGGCGACCA GGACGACCCG AGACCCGGCG ACAACAGAAG CAGCACGACG GTGACGCCCG GGCAGACGCC GGCCCCGGCC GCCGACCTCG TGGTGACGAA GACGACGAGC GCGAGAGAGG TCGTCGTCGG CAGACGCCTG ACGTACGAGA TCGTCGTCAG AAACGTCTCC GCGCATCCGG CGTTCGCGGT CGCGCTGACC GACACGTTCG GGCTTCCCGC GCGGATCGTC TCGGTCCGTG CGACCCAGGG CAGCTGCCTC CCACGGGCGC CGCTGACGTG TGCGCTCGGC ACGATCGCGG CGGGCAGATC GGTCATGGTC ACCGTCGTCG CGTACCCGCG CGCCACGGGC AGACTGCGCA ACGCCGCGAG CGCGACGTCC CGCGCGCAGG ACCCGACGCC GCGCAACAAC GTCGCGGGCG TCTCGCGCAG CGTCGGCAGA CCGCGGCTGC GGATCGCCAA GACGGCGGAC GTCCGCGTCG TGCGGGCCGG TGACACCGTC GAGTACGCGA TACGCGTCAG CAACCCGTCC GCGGTCACGC TCCGCAACGT GCGCGTCTGC GACACGCTCC CGCCCGGGCT CGTGCGCGAG GACGCGACGC CGGGCGCCAC GCTGCGGAGA GGCGCCTACT GCTGGAGCGT GAGATCGCTG CCAGCCGGCG AGTCGCGGAC GTTCTCGATG AGGGCTGGAG CGATCCGCGG CGCGCGCGGC AGCAAGGTCA ACACCGCGAC GGCGACCGCG CCCGGCGCGC GTGGCGACCG CGCGCAGCGC ACCGTCCGCG TCGTCGCGGG CGCGGTCGCG CCCGCGACGG GAGGAGGCGT GACCGGCTGA
|
Protein sequence | MSLPRSAARR AARPLATARR ALLAAAALTA TMTILALLAP AAQAVPFDCT GATIYSAMRG ATNSTASNGT IFALDESTVG GAQVTSTLVT TIPSGGFANG IGITRGGTAL YAVDQAATGS AVIRAYDAIA ETWTSYTGSA GTESFVAGAI NPANGLYYYA AYAAGTATTA GTATVYAFDT TTNTPVTGKI ATINLPIVGA GGPNGDIAFD ALGNLYLLES VGTTVAISRV NAASLPTTGS PTGATVTSTR VAGFTSSGPL YNGISFDNAG NLYVLNAGPN QLTRINPNTG VALAGPTSLD AAAQAFANVD LAACATNPTL SLRKEILGRY APADQFGLSI SGGGLTSGNV ATTSGGARGI QTAVAGPIIA QSGTTYTLTE SAARGARLAN YRTTYDCVDT ANGNAPVSSG SGASFTVPFP APVIGRASSN ILCTFLNSPL APSLTLDKSA DRRRLIVGDT VTYSFLVTNT GDVTLAPVTV TDTSFSGSGT PPAISCPPAA ASLAPSASVT CTATYVVTQA DVDTGVVANT AVATGDSPAG DPIRSPPSST SVPQAPEPAL DLAKSASPAT ISAAGDTVTY RFLVTNVGNV TLAPVTVRET AFSGSGTAPV VRCPAGAASL APGAQVTCTA TYTATQADVN RGQIDNTAVA TGTPPVGPPV DSPPSRATVT ASATPSLTVV KSNDSGTSFV LGQVITYRYV VTNTGNVTLA PVTVRETAFT GSGTPSAISC PPAAASLAPG AQVTCSSTYT VTQTDVNRGQ IDNTAVATGT PPTGPPVDSP PSRSTSPSTP APALTIAKSA SPATFRAAGD TITYRFVVTN TGNVTLAPVT VRETAFTGTG TAPVVRCPAG AASLDPGGQV TCTATYTVTQ ADADRGEIDN TAVATGTPPT GPPVDSPPSR ATVNGPASPS LTVVKSVSPP SLSGAGQELT YSFVVTNTGN VTLAPVTVRE TAFTGSGPSP TISCPPGAAS LAPGAQVICT ARYTVTQDDF DRDSLENTAV ATGVPPRGPP VDSPPSDASV PFTPEPRLDI VKTANPTAVS AAGDLVSYSF LVTNTGNVTV GSVAVTDTFT APSTGTLGPI SCPQTRLIPG QSTTCTAPAY AATQADIDNG IIRNSAFATG EDPGGDPVVS GRSPATVEVL AQPGITIVKS ANVRSFARPG TLVTFSFEVR NTGNVTLDPV VVSDPLPGLS PISCPQTRLA PAASQTCTAT YTTTGADVNA GEIDNTGTVT GQPPTPFGGT PPPPVTDRSS TTVPADQAPA LSIVKTATPT SVTAAGDAIA YRFLVTNTGN VTLTGVAVRD TFTPPATGPG GPITCLVTTL DPGDSTTCTA PPYLASQADA DNGRIDNTAI ATGTPPRGQL VDSPPSAAVV TIAPDPGISL VKSASVTEYK VAGTVVTYSY AVRNTGNVTL DPVVVTDPMP GLSALSCPQT RLAPGASEVC TARYTTTAAD VLAGALDNTG TATGSPPSTQ SNPNPPPVRA TSSVSVPARP EADLSIVKTA SPGVATPGRS LTYTLTVRND GPSDAIAVVV SDPLPAGLTF VSASAGCSAA GQDVTCTRAS LAAGETATFT VTANVAGDVA HAIDNTATVR SDTPDPDPTD NRSRVEVPVR GETDLSIVKT PSTTTPGPSG QVIYTLVVRN AGPSAATGVK VSDPMPAGLT VQSATPSQGS CSIAGRTVSC DLGGIAAGGG VQVLVAANVA AGASGAIVNT ATVTGDQDDP RPGDNRSSTT VTPGQTPAPA ADLVVTKTTS AREVVVGRRL TYEIVVRNVS AHPAFAVALT DTFGLPARIV SVRATQGSCL PRAPLTCALG TIAAGRSVMV TVVAYPRATG RLRNAASATS RAQDPTPRNN VAGVSRSVGR PRLRIAKTAD VRVVRAGDTV EYAIRVSNPS AVTLRNVRVC DTLPPGLVRE DATPGATLRR GAYCWSVRSL PAGESRTFSM RAGAIRGARG SKVNTATATA PGARGDRAQR TVRVVAGAVA PATGGGVTG
|
| |