Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5098 |
Symbol | |
ID | 8735564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 5453179 |
End bp | 5455545 |
Gene Length | 2367 bp |
Protein Length | 788 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646505723 |
Product | hypothetical protein |
Protein accession | YP_003396882 |
Protein GI | 284046542 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.381891 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCT TGAATTCATG GCGGCGGAGC GTGCTGCTCG CCGCGATCGC TGCGACCTGC GCGGCGAGCG GCGAGACCGC GTACGCCGCC GGCGACGCGC CGGCCGGCGA CGGCCATGGC CCGCACCAGC GCAGCACCTG GAACGTGGCG CCGGACGGGC GCGGGCAGTC GTGCATGCGT TGGCGTCCGT GCCGGCTGGA GACGGCCCGC GAGCAGGTGC GCGCCGCCGC CCCGGCGATG ACGCGCGACC TGCGCGTCGA GCTGGCCGGC GGACGCTACG AGCTGTCGAG CACGTTCGTC CTTGGCGCGG CCGATTCGGG TCGCAACGGC CATGACGTCG TCTACGCGGC GGCGCCGGGC GCCGAGCCGG TCCTGACGGG TGGCCGCGAG GTGCGCGGCT GGAGACTGGT CGACCCGGCG AGGCGGATCT GGCGAGCGGC GGTCGGCCGG CTCGACACGC GTCAGCTCTA CGCCGACGAC GTCCGCCTGC GGCGCGCGGT CCACGTCCCG GGGCTGCCCG GCGACGTGAG ACGCACCGCG ACCGGGCTGC ACACGACGAG CGCCGAGCCG CGCACGTGGG CTGCCCCGAC CGACGTCGAG ATGGTCTGGC GGGCGAGAGG CGCGGGCGCC TTCGAGTGGG TCGAGGCGCG CTGCCGCGTG ACCGGCATGA GCGCCGACGC GGCCGGCACG GGGACCGACG TCAGCATCGC GCAGCCGTGC CTCGACAACG CGGTCGGGAT CTACGCCTCC AACGTCGCCG GCGGCGGCGA GGTGCTCGGC GCCCCGAGCT GGAGCGAGAA CAGCTCCACG TTCCTGGCCG ACGACACGCG GCCTGCCCGC TGGGCGATCG AACGCACGGC AGCGGGCAAC GCGATGCACT ATCGCGCGCG GCCGGGCGAG GATCCGCGTC GTCGCCGCTT CGTCGCGGCG ACCGGGGAGA CGCTCGTGCG GATCGGCGGA ACGGTCGCCG AGCCGGCGCA CGACATCGTC GTGCGCGGGC TGCGCTTCTC GGAGTCGACC TGGCTCGGAC CGGACACCGG CATGGGCTTC CCGAACCTGT TCGCGAACGC CTACGCGATC AGACCCGACG CGCTCAGCCC CGAGACCTAC CGCGGCGAGG CATTCCCGCC GGGCGCGCTG GAGCTGGACA AGACGCACGA CGTCCGCGTC GAGGGCAACC GCTTCGAGCG GCTCGGCGCG ATCGGCGTGC GGATCCAGAG CACGAGTCGC GCGACGGTCG ACGGCAACGT GCTGCGCGAC CTGTCCGCCG GCGGCGTCCT GATCGGCCGG CTCGGGCCCG ACCTCGAGGG CGAGGTCGAG GACAACGTGA TCTCGAACAA TGCGATCGAC GGCGTCGGCG CCGAGTACGG CGCCTCGCCT GGCGTCTTCC TCAACGTCCC GCGCCGCACG ACGCTGCGCA ACAACTTCAT CTCGAACACC GGCTACAGCG GCGTGACGAT GCGCGGCTCG TGGAACGGCT CGGGCACGAC CGAAGGCAAC AGATACGTCG ACAACTACGT CGCCAACGTG CTGCGGCGCG CCAGCGACGG CGGCGGCATC TACACCGTCT ACCCGCAGGG CACGTCGTGG GACAACGGGC TGCTGGTGGC CGGCAACGTC GTGCGCGACA TGCGCCCCGG GGCGCTCGGC TTCGGGCTCT ACAACGACAT CGGCAGCGAC TACTCGACCA GCCGCGACAA CGTCTCCTAC AACTTCCTGA TAGCCGGCGG CGGCTGCGCG CAGCCGTACC TGAACGAGAT CAGAAGCACC GGCAACTGGC TGCGCGGGAG ACTTCCCGGC GTGCCGGACG TGTGGTGGGT CTGCGACGGG CCGGTCACGA ACGTCGAGGT CGACAACCAC AGCCTCGACG CGGCCGATCC GGCGGCGGAC TGTGCGGCGA GACCGGCGTG CGCGGCGATC GTCGCGAACG CGGGGCTCCA GGCCGGCTAC GAGGACCGCC TGACCGGGAC CTACTCGTTC GACGACGTGC CGACGTTCGA GGACGTCAAC GCCGACTCGT TCGACGTCGA CAGCGTCCAC GCCGGGCTCG ACTTCGGCTG GGGGCAGTGG ATCGCCGCCT ACGGCACGCC GGGCTATGCG ACGACGTCGC ACGCCCGCTT CGCGACCGCG GACGCCGGGC CGCGCACGAT CGCGCCGGTG AGAGGCGTCG TGCTCTCGTC GCTGGTGCTC GAAGGCTCCG GCACCTACAC GATCAGCGAC GGCACGAGCA CCCGCAGCGG CACGCTGCGC GGGCCGGGAC GTCCGGTGCG CGTCCAGACC GGCTTCAGAA CGACCGGGGC GAAGGTCTCG CTCACGTTCA GCGATGGCCA GCAGGTCGTC GTCGACGACC TGCGCGTCTC GCGCTGA
|
Protein sequence | MSALNSWRRS VLLAAIAATC AASGETAYAA GDAPAGDGHG PHQRSTWNVA PDGRGQSCMR WRPCRLETAR EQVRAAAPAM TRDLRVELAG GRYELSSTFV LGAADSGRNG HDVVYAAAPG AEPVLTGGRE VRGWRLVDPA RRIWRAAVGR LDTRQLYADD VRLRRAVHVP GLPGDVRRTA TGLHTTSAEP RTWAAPTDVE MVWRARGAGA FEWVEARCRV TGMSADAAGT GTDVSIAQPC LDNAVGIYAS NVAGGGEVLG APSWSENSST FLADDTRPAR WAIERTAAGN AMHYRARPGE DPRRRRFVAA TGETLVRIGG TVAEPAHDIV VRGLRFSEST WLGPDTGMGF PNLFANAYAI RPDALSPETY RGEAFPPGAL ELDKTHDVRV EGNRFERLGA IGVRIQSTSR ATVDGNVLRD LSAGGVLIGR LGPDLEGEVE DNVISNNAID GVGAEYGASP GVFLNVPRRT TLRNNFISNT GYSGVTMRGS WNGSGTTEGN RYVDNYVANV LRRASDGGGI YTVYPQGTSW DNGLLVAGNV VRDMRPGALG FGLYNDIGSD YSTSRDNVSY NFLIAGGGCA QPYLNEIRST GNWLRGRLPG VPDVWWVCDG PVTNVEVDNH SLDAADPAAD CAARPACAAI VANAGLQAGY EDRLTGTYSF DDVPTFEDVN ADSFDVDSVH AGLDFGWGQW IAAYGTPGYA TTSHARFATA DAGPRTIAPV RGVVLSSLVL EGSGTYTISD GTSTRSGTLR GPGRPVRVQT GFRTTGAKVS LTFSDGQQVV VDDLRVSR
|
| |