Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0893 |
Symbol | |
ID | 8731327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 935461 |
End bp | 937395 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646501510 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003392701 |
Protein GI | 284042361 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.216124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.507094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTGCC TTTCGCTCAG CAGCCTCGTC GCGCTGAGCT TCCCCGCCGG CGCGGGCGCG GGCGTAGCCT TCGCGCCGAG CGTCAGCCTT CCGACGGGCA CCGCGCCGAG TGCGGTCGCC AGCGCTGACG TCAACAGAGA CTGGCGTCCG GATCTCATTA CGGCCAACGA CGTCGTCGGG GACGACGTGT CGGTGCTGGT GAGCGCGGGC GCGGGCGTGT TCGCACCCGC CACCAACTTC TCGTTCGGGG CGTCTCCCGT GGCGATCGCG GCCGGCGACT TCGACTGGGA CAACCGGGTG GACCTCGTGA CCGTCGACCG CGGGTCCGAC ACGGTGTCGG CGCGCAGAGG GATGGTCGGC GGAACGTTCG GCGCGCCGAC CGTCATCCCG GTCGGCGCCG CCCCGGCTGG GCTCGTCCTC GGCGATGTCA CAGGCAACGG CGCTGTGGAC ATCGTGACCG CCGACGCCGG TGCGAACACG GTGACGGTGG TGCGCGAGGT CTCAGGGAGC GGGTGGGACT CCCCGGTGGG CTTCCAGGTC GGTGCCACGC CCAGGTCGGT CGCCGCGGGC GACCTCGACG ACGGCTACTA CCGCGACCTC GTGACAGCCA ACGCGGGTGG GAACTCCGTG TCTGTGCTGA TGAACACGAT GCCGTCCATG GCCGCGCCCA TCACGTTCGC GCCAGCCGTC GACTATCCGG CGGGCAGCGC CCCAAGCGCG GTCGCCCTCG GGGACTTCGA CCGCGATGGG CAGGTCGACA TCGCGACGGC CAATCCGTCG ACGAGCAGCG TCTCCGTGCT GCTGAACACC GGCAGCGGGT TCTCGGCGCC GACCAGCTTT TCCGTCGACG GCACGCCGCA GTCGATCGCG GTGGGGGACT TCAACAGCGA CATGACGCAG GACATCGCGA CCGGCAGCAC CGCCGAGAGC ACGGTGTCGG TGCTCGTGAG CGACCGAGCC GGCGGCTTCG CGGCCGCGCG CGTCTTCGGT GTCGGCGCCG CTCCGAAGGG GATCGCGGTC GGCGATTTCA ACGCGGACGC CGCGACCGAC ATCGCGGTCG CCTCATCTGG CGCGAACGCC GTGTCGGTCC TGATGAGCAA GCCGGCCGCC GACCTGAGCA GTGCGTCGCT GACGTTCGGC ACGGCCGGCG CGCCGGTCCC CCAGGGTGCG CTCAGCGGCA CGCAGGGCGT GACGATCACC AACGACGGCG GGGTGTCGAT GGCGATCGCG GGCTTCATGT TCGGCGGGGC GAACGCTGAG GACTTCCTCG TCTCCGCGGA CACCTGTCGC GAGGCGCTCG CGCCGGACCG CAGCTGCACC GTGTGGGTGC GTTTCGTGCC GCAGGCCCAG GGCACCCGTG CGGCGACGCT GAACATCGTC TCCAACGATC CGGCGAACCC GTCGGTCCAG CTGCAGGGCT TCGCCGGTCC GCCGCAGGGC GGCGCGACCG GGCTGCAGGG GCCGAAGGGC GACGACGGCG ACACCGGCCC GAAGGGCGAC ACCGGCGCGA AGGGCGATGC CGGCACGCCG GGCTCCAAGG GCGACGCCGG CGCGAACGGT GCGCGGGGCG ACGCGGGCGC GTCGGGCGCC AAGGGAGACA GAGGCACCAA GGGCGACGCC GGCCCGCAGG GTCCGGCGGG CAAGGCCGGT CAGATCCGGC TCGTGACGTG CACGACCCGG ACGGTCAGAG GCAGGAGAGT CCGCAGGTGC GAGACGAAGA CGATCGAGGG AAGCACCAGC TTCACGACCG CCGCGACCGC TCGCGCGTCG TTGACGCGCG ACGGCCGCGT GTACGCGACC GGCACCGCCA GCCGCTCCAC GGGCCTGCGT CTGCGCACGC GCGGCGCGGT GCCTGCCGGC CGCTACACGC TCACGCTCCG CTACCGACAG GCGCGGCAGC AGATCACGGT GACCGCGAAG GTCACGCTCC GCTGA
|
Protein sequence | MTCLSLSSLV ALSFPAGAGA GVAFAPSVSL PTGTAPSAVA SADVNRDWRP DLITANDVVG DDVSVLVSAG AGVFAPATNF SFGASPVAIA AGDFDWDNRV DLVTVDRGSD TVSARRGMVG GTFGAPTVIP VGAAPAGLVL GDVTGNGAVD IVTADAGANT VTVVREVSGS GWDSPVGFQV GATPRSVAAG DLDDGYYRDL VTANAGGNSV SVLMNTMPSM AAPITFAPAV DYPAGSAPSA VALGDFDRDG QVDIATANPS TSSVSVLLNT GSGFSAPTSF SVDGTPQSIA VGDFNSDMTQ DIATGSTAES TVSVLVSDRA GGFAAARVFG VGAAPKGIAV GDFNADAATD IAVASSGANA VSVLMSKPAA DLSSASLTFG TAGAPVPQGA LSGTQGVTIT NDGGVSMAIA GFMFGGANAE DFLVSADTCR EALAPDRSCT VWVRFVPQAQ GTRAATLNIV SNDPANPSVQ LQGFAGPPQG GATGLQGPKG DDGDTGPKGD TGAKGDAGTP GSKGDAGANG ARGDAGASGA KGDRGTKGDA GPQGPAGKAG QIRLVTCTTR TVRGRRVRRC ETKTIEGSTS FTTAATARAS LTRDGRVYAT GTASRSTGLR LRTRGAVPAG RYTLTLRYRQ ARQQITVTAK VTLR
|
| |