Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3449 |
Symbol | |
ID | 8733898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 3675502 |
End bp | 3677493 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 646504066 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003395242 |
Protein GI | 284044902 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.344079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0780597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTGGC GCATGCGCAC GATCAGCACC CTTCTCGCCG CCGCAGCCGC GCTGCTGGCG CTCACCGCTC CGGCGACCGC AGCGACGTTC AGCCCCGTCG CCGGCTCGCC GTTCGCGACG AGATCTACCG ACACGATCGA CCTCGACCTC GCCGACTTCG ACCTCGACGG CCGCCTCGAC GCGGTCGTCG CGGATCGCGG CGCCAGCGAG CTCGCGGTCC TGCGCGGCGC CGCGGGCGGC GGATTCGGCG CCCCGGAGGT CACGCCGATC GCGGGCCCCG GGCCGGTGCG GGTACAGGCC GCTGACCTCG ACGACGACGG CTTCCCGGAC GCGATCGTGC AGCGGCAGAA CCAGTCGGAC GTCACGGTGC TGCGCGGGGA CGGCACGGGC GCGCTGACGC CGGTCGCCGG CTCGCCGTTC CCCGCCGCCG CGGAGATCTG GAGCATGGAC GTCGCAGACG TGAACGGCGA CGGCCGCCCT GACGTCGTCG CGGGGCTGCT CGACGGCCGC GTGCAGCCGC TGCTCGGCAC CGGCGGCGGC AGACTGAGCG CGGGAGCCGT CGTCGACGTC GGCGGGGGCA TGCTGCCGCT CGTCACGGCG GGGCGCTTCA ACGCCGGCAG ACGGGTCGAC CTCGTGCTGG CGAGCCGCGT CGCGGCGACC GTCACCGTCC TGCGGGGCAA CGGCGACGGC ACGTTCGCGC CGGTGCCGGG CGGCTCGCTC GGCGTCGGCG CCGACCCCAG CGCGCTGGTC GCCGAGGACT TCGACGGAGA CGGCGACCTC GACGTGGCGG CGAGCAGCCA GACGGACGGC ACCGTGACGG TCGCGCTCGG CGACGGCACG GGCAGACTGG TCGTCGGCTC GACGGTCGCG GTGACGCAGA CGCCGACGGA CCTCGCCGCC GGCGACGCCG ACGGAGACGG CGACCCTGAC CTCGCCGTGG CACGGTACGA CGCGAGCCGG GTCCAGTTGC TCGTCAATGC CGGCGACGGC ACGTTCTCGA CCGACCTCGA CCCGCAGACG CCGGTCGTCG CCGATCCGAT CGCCCTGACG GCCGGCGACC TCGACGGCGA CGGGCTCGCC GACCTGCAGG TGCTCGGGCG CGGCCTCGTC GGCTCGCTGC GCAACGAGAG CGTCGCCGCC GTCACCGTCG ACCAGCCCGC GCTCGGCTTC GCCGACCAGG CGGTCGGGAC GCTCGGCGGC GGGCAGACCG TGAGAGTGAC GAGCGCGGGC GCCCTGCGCC TGGACGTCGA GCAGCTGCGG CTCGACGGCG CGGCCGACGA CTTCCTTCTG CTGCAGGACA CGTGCAGCGG GCGCTCCTTC GCACGCGGCG CGTCCTGCTC GGTGCGCGTC CGGTTCGCGC CGACCGCCGC CGGCCCGCGC GCGGCGACGC TCGTCGTCGA GAGCGACGCC GCCGGCTCCG CGCCGGCCGT CGCGCTGTCG GGGACCGGGA TCCTGATGCC GCCTCCGGTC GGCGGGAGAG ATGGCGCCGA CGGGAGAGAC GGCGCGGCCG GCAGAGACGG TGCGGCCGGC AGAGACGGTG CGGCCGGCAG AGACGGTGCG GCCGGGCCCA GCGGCGCGGC GGGCAGAGAC GGCGCGATCG GACCCAGCGG CCGGCGTGGT CGAGACGGTG CGGCCGGCCC CAGCGGCCCT GCCGGGCCTC GCGGCCCCGC GGGGCCGGAA GGCCCCCGGG GACGGCCTGG AGCACCCGCA GGAACGGCGC CGGCGGCCGC TTGCACGCGA GCAGGCACCC GCGCCGTGCG CTGCAGCATC ACGCTCCGCG GCGCCCTGGC GGGCCGGCGC GGCGCGGTGG CGGTGCGACT GCTGCGCGGC CGCGCCCAGG CCGCGGCGAC GCGGGGACGC GCCCGTGGCG GCCGCGTGTC CGTGACGCTG CGCTCACGCC GCGCGCTCCG CCGCGGGCGC TACCAGCTCG TCGTCGCGGC GAGCGCCCGC AAGGGCGCAC CTGTGAGCCG GGCGTGGGTG ACGATCCGCT GA
|
Protein sequence | MIWRMRTIST LLAAAAALLA LTAPATAATF SPVAGSPFAT RSTDTIDLDL ADFDLDGRLD AVVADRGASE LAVLRGAAGG GFGAPEVTPI AGPGPVRVQA ADLDDDGFPD AIVQRQNQSD VTVLRGDGTG ALTPVAGSPF PAAAEIWSMD VADVNGDGRP DVVAGLLDGR VQPLLGTGGG RLSAGAVVDV GGGMLPLVTA GRFNAGRRVD LVLASRVAAT VTVLRGNGDG TFAPVPGGSL GVGADPSALV AEDFDGDGDL DVAASSQTDG TVTVALGDGT GRLVVGSTVA VTQTPTDLAA GDADGDGDPD LAVARYDASR VQLLVNAGDG TFSTDLDPQT PVVADPIALT AGDLDGDGLA DLQVLGRGLV GSLRNESVAA VTVDQPALGF ADQAVGTLGG GQTVRVTSAG ALRLDVEQLR LDGAADDFLL LQDTCSGRSF ARGASCSVRV RFAPTAAGPR AATLVVESDA AGSAPAVALS GTGILMPPPV GGRDGADGRD GAAGRDGAAG RDGAAGRDGA AGPSGAAGRD GAIGPSGRRG RDGAAGPSGP AGPRGPAGPE GPRGRPGAPA GTAPAAACTR AGTRAVRCSI TLRGALAGRR GAVAVRLLRG RAQAAATRGR ARGGRVSVTL RSRRALRRGR YQLVVAASAR KGAPVSRAWV TIR
|
| |