Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0603 |
Symbol | |
ID | 8731031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 636466 |
End bp | 639738 |
Gene Length | 3273 bp |
Protein Length | 1090 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646501216 |
Product | hypothetical protein |
Protein accession | YP_003392413 |
Protein GI | 284042073 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTG CACTCGCGGT GGTCGTCGTC ATCCTCGTCG TGGCGAGCCA GGCGCAGGCG GCCGCGGCCG CGCCCGCTCC CCGGCTCATC GTCGCTCCGG GCGCGGACGC GCTCGTCAAG CACGGCGCGG TTCGGGTCGT CGTCGCCCGC GACGCCGGCG CGAAGACACG CGTGCGGCTC AACGACCGCG ACGTGACCGC GCGCCTGCGT GTCGCGGGCG GGCGGCTCGT CGGCGAGCTC CGGCGCAGCG ACGGCCTGCG TCCCGGGCGC AACCACCTCG TCGCCGTCTC CAGAGCGGGC GGCAGGTCCG AGCGCCGCAG CGTCCGGTCG TTCTTCATGG TGCGCCGGAG CGCGAGGTTC GCCCGCGTGC GCCTCACCGG CCGCAACCCG GCGACGCTGA AAGTCGATGT CGTCTCCGGC GCGCCCCGCG CGGAGCTCGC GCGGCGCCAG CGGATCCTGC GGGTCCGGCT CAACGGGCGC TCCATCACCA AGGCGATGGT CGCTCGGACG GGCACGAGCT GGACGGCCTC GCTGTCGGGG ACCCACGGGC TGCGCCACGG CGTCAACCGG CTGAGCGTGC TCGTCGCCGA GCCGCGCGAC GGTCGCTACA CGTCCGTGCG CCGCCGCTTC ACGGTGCACC GCGACCGGCC GCTGGCGGCG GCGGGCGCCG ACCGCACGAC GCGGCCGAGC ATGCGCGTCC GCATCGGGGG TGCCCATCGT GCCGCGCGCG GCGGCCGGCT GACGTACCGC TGGACCCTCG TCGGCAAGCC GGCGAGGTCG CGGGCGACGA TCGGCAGATC CACGAGCGCG CGCCCCTCGC TGGTGCCCGA TCGTCCCGGT CGCTACGTCG CGCGGGTGCG CGTCACCGAG CGCCCGCGCG GCCGCGCGTC GGCCGCGCAG ACGACCCTGG GGACGACCGT CGACGACGCC GTGATGCAGG CGATCCCGAA GTCGACGCTG CTCGAGGTGG CGGCCAACGC GTTCCCCGCG GATCCGCGGG GGATCCAGCT CGGCAAGCCC GGCAACGGCG GCACGTTCTA CCCGCACACC GGTCCCGCGG CGACGATCCA ATGGCTGGAG CTCGATCGCC GGACGCTCCA GCCGACGGTC GCCGGCAACA CGTGGTGCTG CGACGACGCC GACCACTCGC TCGACAGCCT GACGGCGAAG CTCCAACAGT CGAGCCTGAC CACCCTCGTG ATGCTCGCGC TCCCGCCCCA GCGGAGCACC CTGCCGCCGA GCCAGTACAA GAGATTCAAC GACGCGCTGG CCGCGATCGG CGTGAACCCG CTCGACGACG TCGACGACCT CGACAAGCCG GGGCAGCAGA TCGTCGCCGT CGGCTTCCCG ACCGCCGGCC TGGGCAGCGG CTGGCTGATG CGCAGCCACA AGGGCAACCC GAGACTCGCC AAGCAGGGCT GGCTGATGCC GGACGCCGAC CTGAGCGCCG GCTACCGCTT CCAGCCGCTG CAGGTGCCGT TCAACACGAG CGCGGCGTCG ACGCCCACCT CGAACACGAT GACCTTCGGC GGCGAGTCGG TGACGAGCCC GCCGCTCGGG ACGGCGACCG GCTTCCACTT CGTCGAGGTC GATCCGGCGG ACCTCAGTGT CGTGAGAAAC CTCACGTTCG AGAACGACGC CAACGGCCGC GCGCAGCTCG CCAACGCGAT CAGCGCGGCG AGCGAGCTCG ACAACGTCGG CAACATGAAC GGGCGCGGCA ACTACGTCGC GCTGCAGAGC ATCGGCGGCT TCGCGCCCGC CAACCCCGGC TGGGACGACA AGGTCAGCCT CAGCCTCACG GCGATCGGCG CCAACCCGCA CTACTTCAAC TACCGCTCGC CGAGCTATGC GTTCTTCGGC GGGGCGCACC TCGGGTCCAA CGGGGCCGCG CAGTCCAACG CCGGCCTCGT GCTCGACGCG ACCACGGGTG CGAAGCAGCG CGGCACGCTG AGCGGCGAGG CGCGCATGGG CCCCGACGGG TTCTTCCTGC CGCCCACCGG GTCGGCGACG GACGGACCTG TCGACTCGCT GTACGACGTC ATCTTCAACG CCGAGCCCGC GCCGTGGCCG TACACGTCAG GGCCGGACGC GGGCGCCTAC CAGAAGGCGC TGGCGTACAT CTCGGGGGAG CTCACCGATC CGGGCGTGAA TCCCGCGCAC GCGAAAGATT TCACCAAGGT CAGCGACACG TTCAGGACCG ACATCCGGGG CGCGTACCTC GGCCTGCCCG ACTACGCGCA CTGGGACTCC GCCTCGAGTG TCCTGTCCGG CTCGGTGCAC TATCCCGGCG CGCAGGCGGG CGCGTGCAGA GGACCGCCGA GCGGAAGCGA CGAGCCCGGC TTCACGCTCA CCCAGTTCTG CAACCTCAAG GCGGCGCTCC TGAGCGAGTT CGCCGACCTC GACGCCACGT TGGACTACAA GGACAAGCTC GTGCAGGCGT TGCAGGTCGC CTCCAGCGGC GACCAGGCCC GGCTGATCAC CACCTTCACC ACGATCAAGG AGAAGGTCGA TCCCCCGGAG AGCGAGGTCG CCTCGCCGCT CCTGGGTCTG ATGGAGACGC TCACGGAGTT CGCCGACCTG GCCGAGGTGG CGTTCGCTCC GGAGGCGGCT GCCGCCGTGA GTCTCGCCGA GAACGTCGTG GAGCTGACGA GCGCGGTGTC GAACTCGGCG GACAAGCCGC TCGGCACGGC GCTCGACGAC ACCGTGCAGC AGCTCGGCGG GGACCTGAAC GGAGCGGTCT ACAGCGGCGT CGGCGGCCTG CTCGCCGTCC GCGCCGCCGC GATGTCGGAC CGGGGACGGC TGGAGGCATT GGCCGACGCC GCTCGCGCCG CGGGCGCGAT CTCGTCAGGC GAGATCACCG ACCAGCTCGT GAACGCGAGC GGGCGCTACT ACACCTCGGC GCTGATGGAG TCGCTCAAGT CGGGCAAGTA CCACGGGTTC CGGATCTTCA CCGACGGCAG CGGCCGTGAC GGCGGCCCCA AGCCCGACGA CTGCCGCTAC CGGTTCCGCG GCGCGCCCGA CGGCGCCTGG GTGCCGATCC TCTACCAGCT GGACCTCTGG TCGTCGCTGG TCTTCGGCTG GGACGGGGGC ACCTCGACGG ACTATCCGCC CGACGAGATC CTCAGACAGA TGTTCGAGTC CCCCAACCTC GGCGCCCCGA TCCCCGCGCT CGGCCGCGGC TACGGCATCG ACAAGTCGAC GTGGTTCTGG CAGCAGGCCG ACAACCTCAC GACGAGATTC CGCCAGTACG ACAAGTGCAG ATTCGGGTCG TGA
|
Protein sequence | MKRALAVVVV ILVVASQAQA AAAAPAPRLI VAPGADALVK HGAVRVVVAR DAGAKTRVRL NDRDVTARLR VAGGRLVGEL RRSDGLRPGR NHLVAVSRAG GRSERRSVRS FFMVRRSARF ARVRLTGRNP ATLKVDVVSG APRAELARRQ RILRVRLNGR SITKAMVART GTSWTASLSG THGLRHGVNR LSVLVAEPRD GRYTSVRRRF TVHRDRPLAA AGADRTTRPS MRVRIGGAHR AARGGRLTYR WTLVGKPARS RATIGRSTSA RPSLVPDRPG RYVARVRVTE RPRGRASAAQ TTLGTTVDDA VMQAIPKSTL LEVAANAFPA DPRGIQLGKP GNGGTFYPHT GPAATIQWLE LDRRTLQPTV AGNTWCCDDA DHSLDSLTAK LQQSSLTTLV MLALPPQRST LPPSQYKRFN DALAAIGVNP LDDVDDLDKP GQQIVAVGFP TAGLGSGWLM RSHKGNPRLA KQGWLMPDAD LSAGYRFQPL QVPFNTSAAS TPTSNTMTFG GESVTSPPLG TATGFHFVEV DPADLSVVRN LTFENDANGR AQLANAISAA SELDNVGNMN GRGNYVALQS IGGFAPANPG WDDKVSLSLT AIGANPHYFN YRSPSYAFFG GAHLGSNGAA QSNAGLVLDA TTGAKQRGTL SGEARMGPDG FFLPPTGSAT DGPVDSLYDV IFNAEPAPWP YTSGPDAGAY QKALAYISGE LTDPGVNPAH AKDFTKVSDT FRTDIRGAYL GLPDYAHWDS ASSVLSGSVH YPGAQAGACR GPPSGSDEPG FTLTQFCNLK AALLSEFADL DATLDYKDKL VQALQVASSG DQARLITTFT TIKEKVDPPE SEVASPLLGL METLTEFADL AEVAFAPEAA AAVSLAENVV ELTSAVSNSA DKPLGTALDD TVQQLGGDLN GAVYSGVGGL LAVRAAAMSD RGRLEALADA ARAAGAISSG EITDQLVNAS GRYYTSALME SLKSGKYHGF RIFTDGSGRD GGPKPDDCRY RFRGAPDGAW VPILYQLDLW SSLVFGWDGG TSTDYPPDEI LRQMFESPNL GAPIPALGRG YGIDKSTWFW QQADNLTTRF RQYDKCRFGS
|
| |