Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4500 |
Symbol | |
ID | 8734964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 4797905 |
End bp | 4799962 |
Gene Length | 2058 bp |
Protein Length | 685 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646505127 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003396288 |
Protein GI | 284045948 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.491219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCCG TCCCAAGGCT GCTCGCGCTG CCGCTTGCCC TGATCGGGCT CGCCGCGGTC GCCCCCTCCG CTTCGGCGGC TGTCACCTTC GGTCCCATCC AGAGCCAGTC GGTCGGAGGG AACGACGTCT ACTCGTTCGC GGTCGACGAC TTCAACGGCG ACGGTCGTCC CGACGCGGCC CTGTCGCGCC GTGACTTCGC CAGCAACACC GACGCATATC AGGTGATCCG CTCGCGCCCC GGGGGAGCGT TCCACGCCCC GATCGGCCTG ACGCCCATCT CGCGCGCCGA CTACACCACG ACCGGCGACG TCAACGACGA CGGCCGGCCC GACATCCTCT CGGCCGACGC GTTCAGCGAC GAGATCGTCG CGCAGCTCAA CCGCGGCGGC ACGTCGTTCA GCGCTCCGGT GACGACCAAC AACGGCATCG GCGCCGCGAC GGGCATCGTG TCCGGCGACG TCGACGGCGA CGGCTTCGAC GACGTGGTCG TGGCGGCCAG CAGCGGCGAG ATCATCGTGA TGATCAGCAA CGGCGACGGG AGGTTCACCA GCACGCTCGC CGCGACGATC CCGGACGTGT ACCTGATGGA CCTCGCCGGC GGCGACTTCG ACGCCGACGG CGACCTCGAC CTGGCGGTGA CCGACTACGA CGCCGGGCTC GTCGTGCCGA TCGCGGGCGA CGGCGACGGC GGCTTCACTC CGCTCACCGG CGTCCCGTTG AGCACGTGCG CGTGCAACAA GGGATGGCCG GTCACGTTCT CCGACGTCGA CGGCGACGGT GACGAGGACA TCGTCGCGTC CTCCTACGGC TATCCCGAGG AGGAGAACCC AATGCTGACG CTGCGCTCCA ACGGCGACGG CACGTTCGCG CCGGTCCGCG GAACGACCCT GGCCGTGACC CAGGACGTCG CCACCGGCGA CCTCAACGGC GACGGGAACG CCGATGCGGT GGTGCTCGAT TTCCAATCGA CCGGCGTCGC GGTCGTCAAG CTCGGCAACG GCGACGGGAC GTTCGGCGCG GACACCAGCT TCACCGTCGG CAGCTTCCCC AACGACGTGG AGCTGCTGGA CTGGGATCTC GACGGGGACA CCGACATCGT GGTCGCCGAC GGCGACGGGA TCCTCCAGGT CCTGCCGAAC ACGAGCGTCC CGGCGATCTC CTCGAGCGGC GACGTCGCCT TCGGCGACCA GCCGATCCGG ACGATCAGCG AGCCCGAGGT CGTCACGATC ACGAACTCCG GCGACGCCGT GCTCCCCATC ACGAGCGTGA ACGCCGGCGG GACCGACGCC CGCGACGTGC TCGTCAGCGC CGAGGACTGC ACCGCCGCTC CGGTCCCCGC CGGGGACAGC TGCGAGATCG TCGTGCGGGT GATCCCTGGC GCGACCGGCG GGCGGACCGC GACACTCCTC GTCGCCAGCT CCGTCCTGCC GACCGCGACC GTCGCCGTCA CCGCGACCGG CACCGCGCTG CCCGCGGGGC CGCAGGGTGA AGACGGGCCG CAAGGCGAAC CAGGTCCCCA GGGTCCGGCC GGCCCCGGCG GCGCTACCGG CCCGACCGGC GCCACCGGAC CCGCCGGCGC CACCGGCCCG ACCGGCGCTA CCGGCGCTAC CGGCGCGACC GGCGCGACCG GCGCCACCGG CACGACCGGC ACGACCGGCC CCCACGGAGC GACCGGCCCC GCCGGCGCGA CCGGCCCCCG CGGCGCTATC GGCCCCGGCG GCGCGACCGG CGCCGCCGGC CCGACCGGCC CGGCGGGTGC TCGCGGCACC ACCACGGTCC TCGCCACCGT CCTCGCCGAG TCGCGCTTCA GCGTCCGCGC GAACAAGCGC AAGGTCGTCA GGTTCGGTGT CACGACCGCG AGCCGGGCGG TGGTGACCGT CACCAAGACG AAGGCGAAGA AGGCCGCTGC GACCATCGGC ACGACGCTGA GAAAGGCCGC CGCCAGCAGC GTCACCGTGC CGAAGCTCCC GCGCGGCGCC TACACGCTCA GGCTCATTGT CACCGCCCAC GACGGCACCA CCGCCACGGC CACCGCGCGG TACGTGGTCA CTCGCTAG
|
Protein sequence | MPSVPRLLAL PLALIGLAAV APSASAAVTF GPIQSQSVGG NDVYSFAVDD FNGDGRPDAA LSRRDFASNT DAYQVIRSRP GGAFHAPIGL TPISRADYTT TGDVNDDGRP DILSADAFSD EIVAQLNRGG TSFSAPVTTN NGIGAATGIV SGDVDGDGFD DVVVAASSGE IIVMISNGDG RFTSTLAATI PDVYLMDLAG GDFDADGDLD LAVTDYDAGL VVPIAGDGDG GFTPLTGVPL STCACNKGWP VTFSDVDGDG DEDIVASSYG YPEEENPMLT LRSNGDGTFA PVRGTTLAVT QDVATGDLNG DGNADAVVLD FQSTGVAVVK LGNGDGTFGA DTSFTVGSFP NDVELLDWDL DGDTDIVVAD GDGILQVLPN TSVPAISSSG DVAFGDQPIR TISEPEVVTI TNSGDAVLPI TSVNAGGTDA RDVLVSAEDC TAAPVPAGDS CEIVVRVIPG ATGGRTATLL VASSVLPTAT VAVTATGTAL PAGPQGEDGP QGEPGPQGPA GPGGATGPTG ATGPAGATGP TGATGATGAT GATGATGTTG TTGPHGATGP AGATGPRGAI GPGGATGAAG PTGPAGARGT TTVLATVLAE SRFSVRANKR KVVRFGVTTA SRAVVTVTKT KAKKAAATIG TTLRKAAASS VTVPKLPRGA YTLRLIVTAH DGTTATATAR YVVTR
|
| |