Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1720 |
Symbol | |
ID | 8732160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1813134 |
End bp | 1814810 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 646502337 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003393522 |
Protein GI | 284043182 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.493637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0315117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCACCCA TCTCCGGCCT GCGCCGGTTC CTCCCAGGCG TCGTCGCCGC GTTCGCGGTC GTCCCCGCGA CCGCGACGGC CGCGGACGGC CCGGCCGTGA CGGTCCGCGT CGAAGGCGTC CAGAGTACGC TCGCCTCGCC GTTCGCCTCC GCCCAGGCGC CGCTCAGACG GATCGTGCTC GGCGCTGCGT CCGACGCCGC GGTCGACGCC CCGGCCAGCG TGTACTGCAA CGGCCTGCCG AGCGGGAAGG TCCCGGCCGA CTCGGTCGGC GCCGCGCTCG CGAGGGTCGA CGCGAGCTGG CAGTTCGATC CGTTCGGCTT CCCGTCGTCG ATGACGCTGC TGAGCGAGAG ACACGGTCTC GACGGCGTCA CGTTCGACGG CTCCGCCGGC GTCTGGAGCG TCTGGATCGG CCGCGACTAC CACAACCTCT CGCTCGAGTC GGGTGCTCCG TGCCAGCCGC TGGCCGACGG CGCGACGCTG CTGTTCCAGG CGTCCGAGCA GCGCAAGGCG ACGCCGGAGG AGATGTTCGC GACGCCGACG ACCCCGCTCG TCGAGATCGA CGGCGTGCCG CAGACGGTCC TGCGTGGGAC GTCGGTCCAG GTGACCGTCT CGACCTACGC GCCGAGCACG TGGGGCGGCG CCGCGATGCC GGGCGTCCGC GCTCCCGGCG CCGGCTTCCA CGTCTACCCC GACGGCGCGC CGGCGGACTT GATCGCCGAC GCCGCCGGCA ACGCGACGGT GACGCTCGAC GGGAGCGGGA ACGTCGCGAT CTCGGCGTAC CACGCCGACG CGTGGGGCGG AGAGGCCGGC ACGTTCCCGA CGCCGACGGG CAACTCCGGC CGCGCGCTGC CGCGGGTCGT CTGCGTCTTC GACCCCGACG CGATCGCCTC GCCGTGCGTC GGCAACCTCG TCGGCCCCGC GACCGCGCCG GACTTCGGCA CGCAGGCGCG CGAGACGCTC GGCGCCCCGC GCACGATCGC GCTCGGCTCG CAGCTCGGCA GAGTCGGCGT GACCGGCGTG AAGGTGGTCG GCAGCGGAGC CGACCAGGAC GGCGCGGACG ACTTCCTGCT CAGCTACGAC GGCTGCAGCG GGCGCACCGT CGACAGGGCG GCGCCGACCT GCTCGGTGCG TGTCCGTTTC GCGCCCTCCG TCGTCGGCCC GCGCAGCGCG ACGCTGCGCG TCGAGAACAG CGGCGGAGGC GGCACGCTCG ACGTGCCGCT GACGGGCACC GGCGGCGGCG CCGCGCCGGG CGCGCCCGGC GCGGACGGCG CCGACGGCGC GAAGGGCGAC AAGGGCGACG CGGGCGCGAA GGGCGACGCC GGTCCCGCCG GTGCGCCGGG CGCGACCGGC CCCAGCGGGC CGCAGGGTCC CGCCGGTGGC GTCGGGCCCA CCGGTGCCGC CGGGTCCACC GGCCCCGCCG GCCCGCAGGG CAAGCCGGGC AGAAACGGCA AGGACGCGAC GTGCACCGTG AAGCGCGGCA AGGGCGCGCC GAAGATCGTC TGCAGGCTCG TCAACGGCGC TGGCACGGCG CGCGCGTCGA TGACCCGCGG CGGCACGACC TACGCCCGCG GCACGGTCAG CTCGCTGCGC GCGACGCGTG CGGTGCGCGC CGGCAGCTAT CTCCTGCGCT TCCACGTCAA GGGCAAGCGC GTGACGCAGC CGGTGCGCGT CCGCTGA
|
Protein sequence | MSPISGLRRF LPGVVAAFAV VPATATAADG PAVTVRVEGV QSTLASPFAS AQAPLRRIVL GAASDAAVDA PASVYCNGLP SGKVPADSVG AALARVDASW QFDPFGFPSS MTLLSERHGL DGVTFDGSAG VWSVWIGRDY HNLSLESGAP CQPLADGATL LFQASEQRKA TPEEMFATPT TPLVEIDGVP QTVLRGTSVQ VTVSTYAPST WGGAAMPGVR APGAGFHVYP DGAPADLIAD AAGNATVTLD GSGNVAISAY HADAWGGEAG TFPTPTGNSG RALPRVVCVF DPDAIASPCV GNLVGPATAP DFGTQARETL GAPRTIALGS QLGRVGVTGV KVVGSGADQD GADDFLLSYD GCSGRTVDRA APTCSVRVRF APSVVGPRSA TLRVENSGGG GTLDVPLTGT GGGAAPGAPG ADGADGAKGD KGDAGAKGDA GPAGAPGATG PSGPQGPAGG VGPTGAAGST GPAGPQGKPG RNGKDATCTV KRGKGAPKIV CRLVNGAGTA RASMTRGGTT YARGTVSSLR ATRAVRAGSY LLRFHVKGKR VTQPVRVR
|
| |