Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_1304 |
Symbol | |
ID | 8752968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 1372586 |
End bp | 1373545 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003408412 |
Protein GI | 284989858 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.174683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAGT CCAAGAGCAC GACACGCAAC GAGACCGTTG CGGTCCCCGT CGACGTCACC GTCGAGGAGG GCGACCGCGG TGTGAAGGGC CCGACCGGTC CGGTAGGCCC CGCCGGCGAG CCCGGCGATA AGGGTCCCAA GGGCCCGGCC GGCCCTGTCG GCGAGCGGGG TGACAAGGGC GAGAACGGCC CGGTCGGCCC CGCTGGCGAG CCCGGTGACA AGGGCGAGAA GGGCCCGAGC GGTCCGATCG GCCCGGTCGG GGAGCGCGGC GAGAAGGGAC CGATCGGACC GCGCGGCGAG CAGGGCCCTC AGGGCGAGCC CGGCCCGCAG GGCGAGCAGG GGCCCGAGGG GGAGACCGGT CCGCAGGGCG AGCAGGGTCT GCCCGGCTCC ACCGGTCGCC CCGGCGAGCA GGGTGAGCAG GGCCCGCAGG GCCTGCCCGG GCTGCGCGGC GAGACCGGCC CCCAGGGCGT GGCCGGTGAG CAGGGGCCGC AAGGTGAGCA GGGCCTCCGC GGCGCCCAGG GCCCGCGCGG TGTGATCGGC GAGACCGGCG AGCAGGGTGA GAAGGGCCTG CCCGGTGTGC AGGGCCTGCC CGGTCTGCAG GGCGAGCGGG GTGCTGCTGG TGTCCGCGGC CTGCAGGGCC CGCAGGGCGT GCAGGGTCCG GCCGGTCCGG CCGGTCCGGT TGGTCTGCCC GGCGTGCAGG GTCCGCAGGG CCCGGCCGGA CCGGCCTGGC TCTACCAGCC CAGCCCCGCC GAGCTGGCGA ACAAGGTCAA GCAGGCGGCG CTGGACGTCA CGCCGCGCAC CGTCTACGAC CCGCGGGTGG GCGTCCGCTA CCTGGCGACG CTGACCTCGA CGCTGGTCAA GCTGCAGGGT GGGCTGTTCC TGCGTGCCAT CGTCGACACC GATGCCGCGG CGAACAAGCA GTCGGGCGCC GACCTGTCCA ACGTGCGCTC CATCGCCTGA
|
Protein sequence | MAQSKSTTRN ETVAVPVDVT VEEGDRGVKG PTGPVGPAGE PGDKGPKGPA GPVGERGDKG ENGPVGPAGE PGDKGEKGPS GPIGPVGERG EKGPIGPRGE QGPQGEPGPQ GEQGPEGETG PQGEQGLPGS TGRPGEQGEQ GPQGLPGLRG ETGPQGVAGE QGPQGEQGLR GAQGPRGVIG ETGEQGEKGL PGVQGLPGLQ GERGAAGVRG LQGPQGVQGP AGPAGPVGLP GVQGPQGPAG PAWLYQPSPA ELANKVKQAA LDVTPRTVYD PRVGVRYLAT LTSTLVKLQG GLFLRAIVDT DAAANKQSGA DLSNVRSIA
|
| |