Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2459 |
Symbol | |
ID | 8732902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2613855 |
End bp | 2614883 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646503075 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003394257 |
Protein GI | 284043917 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00356575 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCTATG TCCTGAGCCT TGCTCGCCGC CATGCGGTGG CGTTCCTGGC CGTCTTCCTC TTGCTCGGCG GCACCGCCTA CGCGGTCGCC GATCGCGTCG TGATGTCGAG AGCAACCCCC AAGATCTACG CGTGCGTCAC GGAGACGCAC GGCACGCTGA ACCTGTCGAG CGCGAAGGCG AGATGCGAGC CCGGCCAGCG CAAGATCTCG TGGAACGCAG AGGGTCAGCG CGGCCTGCCC GGCACTGACG GAGCGCGCGG CGCGAGAGGC GCGGCGGGCG CCGCCGGTCC CGCCGGTCCG GCGGGACCTG CCGGTGCCGC CGGTGCGAAG GGCGCGCCCG GTGAGAAGGG CGCGAAGGGC GACCGCGGCG AGACCGGCCC GGCTGGTGCG GGCGAGCGCG GCCCCGCCGG TACGCAGGGC GAGAGAGGCG CCGCCGGCGC CGTCGGTCCG ATCGGCCCGA TCGGCCCGCA GGGTCCGACT GGCCTGCAGG GGCCGGCCGG CACCAACGGC ACCGACGGCG CCAGCGCGAT GCTGACGGCG AGCGGACCGA CGAGCCCGAC GACGATCGCC GGCGGGGTGT CGGGCGACGT CGGCCTGCTG CCGCTGTCGG GTCAGCTGAC CGCTTCCAAC ACCAGCATCC TCATGGGCGG CACGCTGGAC GGGGACACGC CGGAGGTCCG GGCCGCGGCG CAGGTCTTCC CGCGCGATGG GACGCTGACC GCCATCGGCG GGACGTTCGT CGTGACGCAG GCGATGAGCC TCATCAGCAC GACGATCACC GTCGAGGCCC AGTTGTTCAC GGGCTCGGGC ACCACGCTGA CGCCCGTCCC GGGCGCGCAG TGCATCGCCG TGCCGGCGCT CACGGGCATC GTCTCGATCG GCGTTCTGTC GAGATGCATC ACGACCGGCC TCTCGATCCC GGTCACGGCG GGCACGCGTG GCGTGGTCGT CGTCACGATC ACCGCCGCAG GCATCCAGCT GACTCAGTCG GCCCAGCTGC AGAGCGCCGT CTCGCTCGCG GTGGCCTGA
|
Protein sequence | MSYVLSLARR HAVAFLAVFL LLGGTAYAVA DRVVMSRATP KIYACVTETH GTLNLSSAKA RCEPGQRKIS WNAEGQRGLP GTDGARGARG AAGAAGPAGP AGPAGAAGAK GAPGEKGAKG DRGETGPAGA GERGPAGTQG ERGAAGAVGP IGPIGPQGPT GLQGPAGTNG TDGASAMLTA SGPTSPTTIA GGVSGDVGLL PLSGQLTASN TSILMGGTLD GDTPEVRAAA QVFPRDGTLT AIGGTFVVTQ AMSLISTTIT VEAQLFTGSG TTLTPVPGAQ CIAVPALTGI VSIGVLSRCI TTGLSIPVTA GTRGVVVVTI TAAGIQLTQS AQLQSAVSLA VA
|
| |