Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5170 |
Symbol | |
ID | 8735636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 5535945 |
End bp | 5538983 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646505795 |
Product | hypothetical protein |
Protein accession | YP_003396954 |
Protein GI | 284046614 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCAG GTCCGTTGCT CCTCCCGACG CTGGTCGCGT TGGTCACCTC GTCGTCGCTC GGAGCAGCGG CCGCCCCGGC GGTCGCGGCC GAGGGACCGG CCCCCGCTGC CGCAGCGGCG AGAGCCCGGC GCGCGGCCGC TCCGGTGGTC GCACCGCGGC CGGGCGCCGT CGTGCGGGCC CATCGCGTCC GCATCCGCGT GCGCTCGGGC GACGCTGCGG GCGCGTTGCG GGTCCGCCTC AACGGCGTCC CCGTCGGCGC CGACTTCGGC GCAAGCCGGG GCGGTGTGCG CACGCTCGTC ACGTCGGTCA GCCATGGGCT GCGCCGTGGC CGCAACGTGC TGCGCGTGAC CGTGCTGCAC TCCGGCCGGC CGCCGCGCAG CACGACGGTC CGCTTCACGG TGCGCACGAG ACGAGCGCTG ACCGGCGCCG GACGTGACCG CCGTGTCGCG ATCGGGGCGA CGGCGCCGAT GACCGGCAGC GCGCACGGGC TGCCGCGCGC GAGCCGCCCC TCCGCCCGCT GGCGCGTCGT CGCCGGCCCT GGCGTGCGGC GTACGCCCGC CGGCAGATCC CGCACGAGAG CGCGCAGCGC GCAGCGGTCC CTCGCGAACG GGCTGCCGCT CGCGGCGCTG ACGCCGCCGC TGCTGGCCTC GCCTGCCGGC CTCGCGGCGA GCTTCCGCCC CGTCGTACCG GGCAGATACA CGCTGCGGCT GACGAGCGGG AGCGGGGCCG CGAGATCGTC GGACGAGGTG ACGCTGACCG CCGTCCCGCG CACGCCGCTG GTGCCGATCG ACACGATGGC GGCCTTGACG GGCGACGGGC GGGGCATCCG CGTCGGGGAG ATGACCTACC GGCTGCGTGA CGCGCAGGGC GCGCCGCAGA GCACCGCCGT TCCGGCGCTG CAGGTGGTCG TGCTCGACCG CGACACGCTG AGCCCGATCT CGAACAAGGC CTACGGCAGC AGCCAGCGGG CGATCCCCGA GCTGAAGGCC CTTCACGACG ATGAGCAGCT CGTGATCGTC TCGCTGCAGG CGGGCGACGC GTCGACGAGC GGCGACTTCG AGACCATCGA GCAGGGGCTC GCGGGACTCG GCTTCCCGGC GGGCAGGCTG CCGATCCGAC CCGGGTCGTT CTCGGGCGTC GGGGTGCCGG GGATGGAGCC CGGCGAGGCG GACGTCAGCG TCGTGCCGTT CTCGACGGAC ACCGGCCGGA TCGCGACGTA CGGGCGGATG CAGGGCTATC TCAGCCCCGA CCAGCACCTC AACTACGGCT TCGTCCCGAG CGAGCGCGTC CCCTTCAGGT ACGTCGCTCC GGAGTCGCCC TGCGGCCCTG GCGCCTCCTG CGCCGGGTCG GTCGGCTTCC GCGTGCTCGT ACAGGACTGG AAGACCCTCA AGCCCGCGCG CGGCGGCGAC GGGATCGTCT ACCAGACCGG CCGGCCCGGC CTGAGCGCAG CGCAGCAGAG CGCCGAGGCC CAGAGCATGG TGAGAAACCT CGCGGCGGTC GAGGACGGCA ACCTCGTGAT CGTCCAGGCC GTCAGCACCC GCAGCGGCGC CGGCTACCTG CCGCCGATCG GGGAGATCGA CAGAGCGACG ATGACGGCGC TCGCCAGAAC GATCGCCGGC TTCGGCGGCA CGCGCAACGC CTTCAACCGG ATCGCCCGCA CGACCGGCCC GGCCGCCGGC GGCGGCTCGA CCTACACGCT CGTCGGCTGG AAGGACGCCG GCGAGGGCGA GGCCGCCGAG GTCGCGGCGA ACGTCGACGG CGCCGGCCCG GCGCCCGCGC TCAGCGGCGT CCTGCGGCCC GACCGCGAGT CGCGCTTCCG GCCGGCCGAG GTCTCCGCGA CGAGCAGACC CTCCGCCGAC CCGACCGCGC TCGCCTCACT CGTGATGCAG GAGCCGACGG GGGAGTGGCC GCTGTCGGAC CCCAGACATG CCGCCGCGAT CGCCTACATC GGCGGCACGC TGCCGCAGCT CGGCTCCGAC CCGCGCACCG CCTACTGGAC GCAGAGCTTC ACCCAGTCCG ACACCAACGC GCTGATCCGC GACGTCACGA ACACCGCGTA CAGAGCGAAC GACCGCTTCA GCGCGGGAGA GTTCGTCGAG GCGCGCGACC AGCTCGTGTT GGAGCTCGGT CTGGTCGGCA AGGTCCGCTC CTACCTCTCG AAGCTGCAGC AGCCGTTCTC CGAAGGCCAG CTGAGCCAGT GGGTGACGGC GCAGACGGTC GCCGACCGCG TGCTCGAGGA CGCCGAGAAC CCCGACGACG AGGTGACGCT GTCGTGGCTC GAGCTGACCG CGCAGGTGCT CGAGCTGCTC GGACCGGCGA CCGAGGAGGT CACCGGCGTG CTCGGCGGCC TGCTCGACAT CGGCATGTGG GGGTACGGCG CGACGAAGAG CGGCGGCCCC AGCGAGGGCG AGATGCGCGT GCACGCCGAC GCGTTCGGCA AAGCGCTCGT CGACAGAGCC GAGCAGGCGA AGGTGACGAT CGACCGGATG GGCGACGTGA TCGTCGGCGA CTACGCGAAG CTCGCGGTCG TCGGCAGATA CGGGAGCTGC GTCGACTGCC CGGCGAGATA CGCCTACCTG GCGCTCGACG GGAGAGACAT GTCGCGCAAC AGAGCGCAGA TCGACCGCGG CGTCCAGCGC CTCGCCTACC AGAAGCTGCT GCCGCTCGGC TTCCCCGTGA TGGCGCTGAC GCGCGTCGGC AGACACAACG ACTGGCCCCG CAGCACGGCG CCGGACGTCC GGGACTACGT CTGCAACGGG TACCAGCCGT GGAAGGACTA CGCGGGGAAC GCTTCCACCT CCCTGCTGCA GGAGCTCGAT CCGGCAGGCA GAGTGAACGC GTACGACACG TTCGTGATGT CACGTCCGCC CGGCATCGCG ACCGGGCACG GCACGCCGCC GTCGAACGAG CTGCTCGAGC TGATCTTCGG CGCCGAGAGA CTCGGCATGG ATCCGGGCGC CTTCATGGCG GCGGCGAGAC ACCGCACCTG GTTCTCCAGC GAGGACAGAG AGCGGACCGC CTGCTTCTGG CTGAGATAG
|
Protein sequence | MRAGPLLLPT LVALVTSSSL GAAAAPAVAA EGPAPAAAAA RARRAAAPVV APRPGAVVRA HRVRIRVRSG DAAGALRVRL NGVPVGADFG ASRGGVRTLV TSVSHGLRRG RNVLRVTVLH SGRPPRSTTV RFTVRTRRAL TGAGRDRRVA IGATAPMTGS AHGLPRASRP SARWRVVAGP GVRRTPAGRS RTRARSAQRS LANGLPLAAL TPPLLASPAG LAASFRPVVP GRYTLRLTSG SGAARSSDEV TLTAVPRTPL VPIDTMAALT GDGRGIRVGE MTYRLRDAQG APQSTAVPAL QVVVLDRDTL SPISNKAYGS SQRAIPELKA LHDDEQLVIV SLQAGDASTS GDFETIEQGL AGLGFPAGRL PIRPGSFSGV GVPGMEPGEA DVSVVPFSTD TGRIATYGRM QGYLSPDQHL NYGFVPSERV PFRYVAPESP CGPGASCAGS VGFRVLVQDW KTLKPARGGD GIVYQTGRPG LSAAQQSAEA QSMVRNLAAV EDGNLVIVQA VSTRSGAGYL PPIGEIDRAT MTALARTIAG FGGTRNAFNR IARTTGPAAG GGSTYTLVGW KDAGEGEAAE VAANVDGAGP APALSGVLRP DRESRFRPAE VSATSRPSAD PTALASLVMQ EPTGEWPLSD PRHAAAIAYI GGTLPQLGSD PRTAYWTQSF TQSDTNALIR DVTNTAYRAN DRFSAGEFVE ARDQLVLELG LVGKVRSYLS KLQQPFSEGQ LSQWVTAQTV ADRVLEDAEN PDDEVTLSWL ELTAQVLELL GPATEEVTGV LGGLLDIGMW GYGATKSGGP SEGEMRVHAD AFGKALVDRA EQAKVTIDRM GDVIVGDYAK LAVVGRYGSC VDCPARYAYL ALDGRDMSRN RAQIDRGVQR LAYQKLLPLG FPVMALTRVG RHNDWPRSTA PDVRDYVCNG YQPWKDYAGN ASTSLLQELD PAGRVNAYDT FVMSRPPGIA TGHGTPPSNE LLELIFGAER LGMDPGAFMA AARHRTWFSS EDRERTACFW LR
|
| |