Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4499 |
Symbol | |
ID | 8734963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4796824 |
End bp | 4797804 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646505126 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003396287 |
Protein GI | 284045947 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0524945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCAC GCTCGGTCTT CCCCCGCTCC GTCGCCGCCG TCTGCGTCGC TGCACTGCTG TTCGGTGCGT CGAGCACTGT GTCGCCTGCG ACCGCCGCCA CCGACTCGTC ACGCGTCTAC GCGTGCGTCG CCGGCAACTC GAAGACGTTG CAGCTGACGA CGAAGAACGC GCACTGCCCG CGCGGCCAGA CGAAGATCGT GTGGAACGTC ACCGGCGTCG AGGGGCCGCG CGGACCGAAG GGCGGCACGG GCTCCACCGG CGCGAGAGGC AACGCCGGCG CCAAGGGCGA CGTGGGCGCG AGAGGCGACG CGGGCGCCAG GGGTGACGCC GGCCCGGCCG GCGCGGCCGG CCCGACCGGT TCCGCGGGCG CGAACGGTGC CCCGGGCGAC GCCGGCCCGA CGGGCGACAC CGGTTCCGTC GGTCCCGCGG GGCCGACGGG CGGCCCCGGC CTGACCGGTC CCCCCGGCCC CACCGGTCCC GCCGGCCCGA CCGGTCCCGC CATGCTCATG AGCGGCGGCC CGGTGACAGT CAGCAGCGCG CTCGGCGGCC TGCCGTTGGC CACCACGCTG CTCCCGGTCT CCGGCCTGCT CGGCTCCAGC AGCTCCACCA ACACCACGTC CTACCCGCCG TCCAGCCTCG ATCCCGCGAT CATGGCGGCG GCGCAGATCG TCCCGGCGAA CGTCACGATC ACCGGCTTCC GCTTCGCGTA CACGAACGCG GTGGCGGCGT TCCCGTCAAG CACCCCCGCC TTCGAGGTCT CGCTGTACAT GGGCCCGCCC GGGGGGCCGT ATTCCCAGAC GCCCGTCAGC TGCAGCCTCT TCGGCACGAG TCCGATGCAG CTCGGGTACA CGGCTTCGTG CAGCGGCACC GCGTCGCTCG AAGTCTCGGC AGGCTCGATG CTCTACATCG GCGTGGTCTC GACGACCGAC GAGTCGACGA CGTTCACGGG CCAGGTCACG GCAGGCCTCA CCACGAACTG A
|
Protein sequence | MSARSVFPRS VAAVCVAALL FGASSTVSPA TAATDSSRVY ACVAGNSKTL QLTTKNAHCP RGQTKIVWNV TGVEGPRGPK GGTGSTGARG NAGAKGDVGA RGDAGARGDA GPAGAAGPTG SAGANGAPGD AGPTGDTGSV GPAGPTGGPG LTGPPGPTGP AGPTGPAMLM SGGPVTVSSA LGGLPLATTL LPVSGLLGSS SSTNTTSYPP SSLDPAIMAA AQIVPANVTI TGFRFAYTNA VAAFPSSTPA FEVSLYMGPP GGPYSQTPVS CSLFGTSPMQ LGYTASCSGT ASLEVSAGSM LYIGVVSTTD ESTTFTGQVT AGLTTN
|
| |