Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_0713 |
Symbol | |
ID | 8752370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 756286 |
End bp | 757254 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003407856 |
Protein GI | 284989302 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAGT CCAACAGCAC CAACAGCACC ACACGCAGCA AGACCGTTGC GGTTCCCGTC GACGTCACCG TCGAGGAGGG TGACCGCGGT GTGAAGGGCC CGACCGGACC GGTCGGCCCG GCCGGCGAGC CCGGCGAGAA GGGTGCCAAG GGCCCGGTCG GCCCCGCCGG CGAGCAGGGC GAGAAGGGTG CCAAGGGCCC GGTCGGCCCC GCCGGCGAGC GCGGCGACAA GGGCGCCAAG GGCCCGATCG GCCCGGTCGG TCCCGTCGGT GAGCGCGGCG AGAAGGGCCC GATTGGCCTG CGTGGCGAGC AGGGTCCGCA GGGTGAGACC GGCCCGCAGG GCCAGCAGGG ACCCGAGGGG GAGACCGGCC CGCAGGGCGA GCAGGGGCTG CCCGGCTCCA CCGGTCGCCC CGGTGAGCAG GGGGAGCAGG GCCCGCAGGG CCTGCCCGGA CTGCGCGGCG AGACCGGCCC GCGAGGCGTG GTCGGTGCCC CGGGTGAGCA GGGACCGCAG GGCCTGCGCG GCGCCCAGGG CCCGCGCGGT GTGATCGGCG AGACCGGCGA GCAGGGCGAG CAGGGCCTGC CCGGCGTGCA GGGCCTGCCC GGTGTGCAGG GCGAGCAGGG TGCTGCTGGT GTCCGCGGCC TGCAGGGCCC GCAGGGCGAG CAGGGCCCGG CCGGTCCGGC CGGCCCGGTG GGCCTGCCCG GCGTGCAGGG TCCGCAGGGC GCCTCTGGTC CGGCCTGGCT CTACCAGCCC AGCCCTGCCG AGCTGGCGTC CAAGGTCAAG CAGGCGGCGC TGGATGTCTC GCCGCGCACC GTGTTCGACC CGCGGGTGGG CGCCCGTTAC CTCGGGACGC TGGCCTCGCG GCTGATTAAG CTGCAGGGTG GGCTGGTCCT GCGCGCCATC GTCGACACCG ACGCCGCGGC CAACAACCAG GCCGGCGCCG ACCTGTCCAA CGTGCGCTCC ATCGCCTGA
|
Protein sequence | MAQSNSTNST TRSKTVAVPV DVTVEEGDRG VKGPTGPVGP AGEPGEKGAK GPVGPAGEQG EKGAKGPVGP AGERGDKGAK GPIGPVGPVG ERGEKGPIGL RGEQGPQGET GPQGQQGPEG ETGPQGEQGL PGSTGRPGEQ GEQGPQGLPG LRGETGPRGV VGAPGEQGPQ GLRGAQGPRG VIGETGEQGE QGLPGVQGLP GVQGEQGAAG VRGLQGPQGE QGPAGPAGPV GLPGVQGPQG ASGPAWLYQP SPAELASKVK QAALDVSPRT VFDPRVGARY LGTLASRLIK LQGGLVLRAI VDTDAAANNQ AGADLSNVRS IA
|
| |