Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2988 |
Symbol | |
ID | 8733433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 3195290 |
End bp | 3198274 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646503602 |
Product | Htaa domain protein |
Protein accession | YP_003394782 |
Protein GI | 284044442 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.360944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.268871 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACATCC CCGTTCCGCC GGGAGGCCGG TGTGCGCCCG TACGCGCACG CGCCGGCCTC GCCGCGTCGC TCGCGCTCGC GTTCGCGCTC GCTGCGCTCG TCCTGCTCTT CCCCGCCGTC GCCGCGCGCG CGGCCGATCC GCCCGTCCCG ATCAGTGCGG GCTACGCCGA CTGGGGCCTC AAGCAGTCGT TCCGCACCTA CGTCGGCGCG TCCGGGATCA CGCTCGCCGA TGGCGCGACC AGAAACGCCG ACGGCAGCTT CCGCTTCCCG GTGACGAGCG GCAGCTACGA CGCCGCGACG AAGACGACGG TCGTCGCGTT CGCCGGCTCG GTCCACTTCA GCGCCCATGC CGGCGAGCTG GACCTCACCT TCGCCAGACC GCGCGTCGAG ATCCGGCCGA CCGGCGCGAA GCTGTTCGTC GCGATGAGAA GCCGGCCGCT CGGCGGCAGC ACGCCGACCG ACTACGGCGA CGTCCACCTC GCAACGGTCG ACGTCAGCGG CAGAGATCCC GCGATCACGC CGCCGACGAC GACCTGGTCG GCGCTCCCGA CGGCGCTGAC GGCCGCCGGC GTCCAGCCGT TCGCGAGCTT CTACGAGGCG GGCCAGGCAC TCGACCCGTT CACCTTCACC TACGACGGCC CGGGCGGCAA GCCCGACGTC GCCCCCGGCG AGACGTGGAC GCCTGCCGGC ACCGTGCGGT GGCAGCCGAC CGGCGCCGCC GCGGGTGTGC CGAGCAGCGT CGCCGACAGC ACCGGCGACC CGATGGGCTA CCACTCCTAC CCGTCCTTCG CCCGCAACGC GACCGCGCGG CAGCTGCTGG CGCTCGACGG TCGCGGCGGC AGGCTGACCG TCGTCGACGA GGCGACGCTG ACGCGCGTCG GAACGCCGAT CGACGGGCTC GACGCCGCCG TCGGCGGACT CGCGGTCGAC GAGCAGCGCA ACGTCGCGTA CACGGTCGCG GCCGGCTGGG GCAACGGCCC GCTCGTGCTC AACCGCGTCG ACGTCGCGGC CCGCACCGTG ACGAGGTTCA CGATCCCGGG CGGCGAACGC GGCATGTCGC TGCCGGAGCT CGCGGTCGAC CAGAGAAGCG GCCGCGTGCT CGTCTCCGAG CGGCTCGGCG GCCACCTCTT CGTGATCGAC GGCCGCGCGG CGACGATGGC GCTGCTGCAG ACGTTCTCGT TCGACGCCTA CGGCGGCAGC GGCGTCGACG TCGACGAGAC GACCGGCGAC GTCTACCTCG CGACGGGCTC GCAGAGCACC GTCCGCCGGC TCACTCCGAA CAGCGGCGCG GGCGACCCCT ACACGCTCGA CCCGACGCCG GTCGCGACGT TCGACGGCGC GACGATCCGG CTCCAGGTCT CCGACGACGG CAGACGCATC TGGGTCGGCA GAATCGTCGG CTCGTCGATC AATCCGACGC TGCTGACGCG CACCACCACC GGCTGGGAGA CCGGTGCCAC GATCGCGTTC GGTCCCGCGA TGCGCTGGTT CGGGACCGAC GGCGCGAGCG GAGACCTGAT CGGCGCAATC GCGCCGCGCA GCATCTCCGT CGTCGCAGCC GACACCCAGC ACGTCAACCC GCTCACGCTC GACCTGCCCG CGAGCGCACG CATCGAGGCG GCGATCCGCA CCGCGGACGG CGACGCGCTC TGGGCACTGG ACGGCGGCGA CAGCGTCGTG CGGCGGATCT CGCGGCACGT CTCGCCGACG GTCAGCGCGC AGCCGGACGA CGCGACCGTC ACCGTCTCCC CCGCCCAGCC GGCGCCGCAG GCGACGTTCA CCACGACCGC GAGCGGCACG CCCGCGCCGA CCGTCAGATG GCAGGCGAAG GCGCCAGGCG GCGCCTGGGA GGACGTCCCC GGCGCCGGCG CGACCGGCAC GACGCTCACC GTCGACGGTG CGACCGGCAC CAGCAGAACC GCCTACCGCG CGATCTTCAC CAACGCGGTT GGCTCGCACG CGAGCGAGCC GGCGACGCTG ACGGTCGACT ACGCGACCGG AGGGCTCGAC CGACGCGTCT GGATCACCGG CGGCGCGCTC GACTGGGGCG TCAAGGCGTC GTTCCGCAGA TATGTCGGCG GACCGATCGC GCACGGCGCC ATCACGACCG GCGGCGGCGC CACCGCCAAC GGCGACGGCA CGTTCCGCTT CGCCGCCGAC GGCGGGACGT ACGACCCGGC GAGCGGGCGC GCGACGCTGC GCTTCGGCGG CAGCGTCCGC TTCAGCGGCC ACGCGGGACA GCTCGACCTG ACGATCGCGC GGCCGCGCCT GGAGATCGAC GGCGGCGTCG CGACGCTGCA CGCCGACGTC TCCAGCAAGT CGCTCAGCGG CGGCGTCGTC GAGCAGTTCA CCGGCGTCGA CCTCGCGCGG GTCGACCTCG GCACGCCCGC CGCCGGTCCG GCGGTCGCCG GCGGCCGGCT CGGCTGGCGC GAGCTGCCGG CGGCACTGAC CGCGAACGGG GCGCCGGCGT TCGCCGCGTT CTACCCGGCC GGGACGGCGC TCGATCCGCT CTCGATCGAC GTCGCGTACT CGACCGAGGA GCCGCGAACG TCGGTGCCGC CGCCGGTCGG ACGGCCGGCG CCGCCGGCGC AGCCGGCGCC CACGCCCACA CCGGTGCGGA AGCCGGCCCG GAAGCCGGCC GCCGCCGCGA TCGCCGCGGT CAGAGGAGCG CAGGCGGTCG GCCGCAACCG GATCGCGCGC GTCGCGACGG TCGCCTGCGG CAGCGGCGCC GGGGCGTGCA CGGTGAAGGT ACCCGCGCGG GTGCGCGTGC GGATCGGCGG CCGTCTCTAC ACGGCGCAGG TGCTCGCGCC CAAGCGGTTG AAGGCGGGCG CGCGCGGGAA CGTGCGCGTG CGGCTGAGCA AGCGCGCCGC GGCGCGACTG GCCCGTCGCA GCACGCGCGT GACGCTGCGG GTCGTCACGA CGGCCGGGCC TCGCACGACC GTGAAGGCCG TCAGCGTCAG GCTGATCGGA CCCAAGCGCG GCTGA
|
Protein sequence | MHIPVPPGGR CAPVRARAGL AASLALAFAL AALVLLFPAV AARAADPPVP ISAGYADWGL KQSFRTYVGA SGITLADGAT RNADGSFRFP VTSGSYDAAT KTTVVAFAGS VHFSAHAGEL DLTFARPRVE IRPTGAKLFV AMRSRPLGGS TPTDYGDVHL ATVDVSGRDP AITPPTTTWS ALPTALTAAG VQPFASFYEA GQALDPFTFT YDGPGGKPDV APGETWTPAG TVRWQPTGAA AGVPSSVADS TGDPMGYHSY PSFARNATAR QLLALDGRGG RLTVVDEATL TRVGTPIDGL DAAVGGLAVD EQRNVAYTVA AGWGNGPLVL NRVDVAARTV TRFTIPGGER GMSLPELAVD QRSGRVLVSE RLGGHLFVID GRAATMALLQ TFSFDAYGGS GVDVDETTGD VYLATGSQST VRRLTPNSGA GDPYTLDPTP VATFDGATIR LQVSDDGRRI WVGRIVGSSI NPTLLTRTTT GWETGATIAF GPAMRWFGTD GASGDLIGAI APRSISVVAA DTQHVNPLTL DLPASARIEA AIRTADGDAL WALDGGDSVV RRISRHVSPT VSAQPDDATV TVSPAQPAPQ ATFTTTASGT PAPTVRWQAK APGGAWEDVP GAGATGTTLT VDGATGTSRT AYRAIFTNAV GSHASEPATL TVDYATGGLD RRVWITGGAL DWGVKASFRR YVGGPIAHGA ITTGGGATAN GDGTFRFAAD GGTYDPASGR ATLRFGGSVR FSGHAGQLDL TIARPRLEID GGVATLHADV SSKSLSGGVV EQFTGVDLAR VDLGTPAAGP AVAGGRLGWR ELPAALTANG APAFAAFYPA GTALDPLSID VAYSTEEPRT SVPPPVGRPA PPAQPAPTPT PVRKPARKPA AAAIAAVRGA QAVGRNRIAR VATVACGSGA GACTVKVPAR VRVRIGGRLY TAQVLAPKRL KAGARGNVRV RLSKRAAARL ARRSTRVTLR VVTTAGPRTT VKAVSVRLIG PKRG
|
| |