Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3591 |
Symbol | |
ID | 8755276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 3774656 |
End bp | 3775906 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003410548 |
Protein GI | 284991994 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACGGC TGTTGCGCAT CGCACTGCTG TCCTACCGCA GCAAGCCGCA CAGCGGCGGC CAGGGCGTCT ACGTCCGGGC CCTCTCCCGC GAACTCACCG CCCTCGGTCA CGGGGTCACC GTGTTCAGCG GCCAGCCCTA CCCCGAGCTG GACGACGGCG TGCCGCTCAC CCGCGTGCCG AGCCTGGACC TGTACCGGGA GCCCGACCCG TTCCGCACCC CGCGGCCATC GGAGTTCCGC GACCGGATCG ACGTCCTCGA GTACGCCACG ATGTGCACGG CCGGCTTCCC CGAACCCCTG ACGTTCAGCC TGCGGGCGGC CCGGCTGCTG CTGCCCCGCG CTGCCGAGTT CGACCTCGTG CACGACAACC AGAGCCTGGG CACGGGCCTG CTGCAGCTCA CCCGCGCCGG CGTGCCGACC GTGGCCACCG TGCACCACCC GGTCGCCATC GACCGCGACC TGGAGCTCGC CGCCGCACCC TCGCTGCGCC GCCGGCTGAC GCTGCGCCGC TGGTACGGCT TCACCCGCAT GCAGGCCCGC GTGGCCCCGC AGCTGGACGG CGTCACCACC GTCTCGGAGA ACTCCCGCCG CGACATCGAG ACCCACCTCG GCGTCCCGGC GGACGCCATC CGGATCGTCC CGGTGGGCAT CGACCCCGAC GTCTTCACCC CGCCGCCGGC CGACCGCTCC CGCGACCCGG ACTCCATCGT GGTCACCACC AGCGCCGACG TCCCGCTTAA GGGGCTGGTG CACCTGCTCG AGGCGGTCGC GAAGCTGCGC ACCGAGCGGC CGGTGCGGCT GACCGTCGTC GGCACCGCGC GCCCGGGCGG CCCGGCGGAG GCCGCCCTCG ACCGGCTGGC GCTGCGCGAC GCCGTCCGCT TCACCGGCCC GCTGCCCGAG GCCGACCTCG TGCGGCTACT GCAGGGTGCC GCCGTCGTCG CCATCCCCTC GCTCTACGAG GGCTTCTCGT TGCCGGCGAT CGAGGCGATG GCCTGCGGCA CCGCGCTGGT CACGACCGAC GCCGGGGCGC TGCCGGAGGT CGTGGGCAGC AAGGCCGGCC TGCGGGTGCG CGCCGGGGAC GTCGGCGAGC TCACCGCGGC GCTGCAGCTG GTCCTGGACT CGCCGTCCTT CGCCGACCAG CTGGGCCGGG CCGGCCGGCG GCGGGTGCTG GCCTCCTACA CCTGGCGGTC GGCCGCCGAG CGGACGGCGG AGTGGTACCG CGAGGTGCTG GAACGGAAGG CGCGCCCGTG A
|
Protein sequence | MGRLLRIALL SYRSKPHSGG QGVYVRALSR ELTALGHGVT VFSGQPYPEL DDGVPLTRVP SLDLYREPDP FRTPRPSEFR DRIDVLEYAT MCTAGFPEPL TFSLRAARLL LPRAAEFDLV HDNQSLGTGL LQLTRAGVPT VATVHHPVAI DRDLELAAAP SLRRRLTLRR WYGFTRMQAR VAPQLDGVTT VSENSRRDIE THLGVPADAI RIVPVGIDPD VFTPPPADRS RDPDSIVVTT SADVPLKGLV HLLEAVAKLR TERPVRLTVV GTARPGGPAE AALDRLALRD AVRFTGPLPE ADLVRLLQGA AVVAIPSLYE GFSLPAIEAM ACGTALVTTD AGALPEVVGS KAGLRVRAGD VGELTAALQL VLDSPSFADQ LGRAGRRRVL ASYTWRSAAE RTAEWYREVL ERKARP
|
| |