Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_4746 |
Symbol | |
ID | 8756447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 4957393 |
End bp | 4959435 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | glycosyl transferase family 51 |
Protein accession | YP_003411657 |
Protein GI | 284993102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.842414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCAT CGCACAGGTC AGGCCACACC GACCCCCGGC TGCGCGCGGC CGGCAAGCTC CTGCTGGCCC TCCCCCTGAC CGGGGCGTTG CTCGCCGCGC TGCTGGCGCC CTGGGTCGTC GGGCCGGGCC TGGTCGCCCG CACCTCGGCC ACGCTGCTCG CCCCGCTGCC GGTCGAGCTG ACCGACGCCA CGCCCCCCGG CAACACCGTC GTCCTGGCGT CGGACGGATC GGTCATCACG TACTACTACC GGAACAACCG GACGCCGGTG GGTGCGGACC GGATCGCCGA CGTCATGGAA CAGGCGCTGG TCGCCATCGA GGACGAGCGC TTCTACGAGC ACAACGGCCT CGACGTGCAG GGCACCCTGC GGGCCCTGGC GCGCAACGTG AGCTCCGGCG GCGTCGAGGA GGGCGGGTCG ACGATCACCC AGCAGCTGGT CAAGCAGACG CTGCTGCAGA CCGCCGTCAC CCCCGAGGAC CGGCAGGCGG CCACGGAGCA GACGGTGGGC CGCAAGCTGC GCGAGGCGCG GCTGGCCCTG GCGCTGGAGG AGACCCACCC CAAGGAGGAG ATCCTCACCC GGTACCTGAA CACGGTGTAC TTCGGCGAGG GCGCCTACGG CGTGCAGGCC GCCGCGCAGG CCTACTTCGG CGTCGACGCC GCCGACCTCA CCCTGCCGCA GGCGGCGATG CTGGCCGGCC TGGTGCAGAC CCCGACGGCG GACAGCCCGC TGGCCGACCC CGAGCGCGCC CGGGAGCGGC GCGACGTGGT GCTCGAGCGG ATGCAGGCAC TGGGGTTCAT CAGCGAGGAG GAACGCACCG AGGCCGCCGC CGGCCCGGTG GAGACGAACC CCGCCCCGGC CCCGCCCAAC GGCTGCGCCG GCGGCGGCGT CCTCGGCGGC TTCTTCTGCG ACTTCCTGCA GCAGCACCTC ACCGGCACCC TCGGGATGAC CCAGGAACAG CTCGAGGACG GCGGGCTGAC GATCCGCACC ACGCTGCGCG CCGACCTGCA GCGCTCGGCC GACGCCGCCG TCCTGGCCAC CCAGCCGATG GGCGACCCGG TGGCGGCCAT GTTCACGGCG GTCGAGCCCG GCACCGGGAA CGTACTGGCG ATGAGCGCCA ACCGCGTCTT CGGCTTTGAG GCCGCCGACC CGGCGCAGTC CTCGGTCAAC CTCAACGTGG TCGCCAGCAA GGGCTCGGGC TCCACGTACA AGGTCTTCGT CGCCACGGCC GCGCTCGAGG CGGGATTCCC GCCCTCGCAC ACGATCACCA CGAGCGACCC CTACACCTCG CGGGTGTACC GCAACGGCCT CGAGCCCTAC ACCGTCAGCA ACGTCGGGGA CGGCGGGTTC CCGCCGCGGC TGAGCATGTA CGAGGCGCTG GTCCGGTCCT CCAACACCTA CTTCGTGGCC CTGGAGGACG ACCTGGGCAG CGTCGAGGGG CCGGTGCGGG TGGCGCAGCG GCTGGGCCTG ACCTCCCTCG ACCCGGTGGC CGACACCGTC ATCGCACAGA ACCGCGGCTC GTTCACCCTG GGCCCGGAGG CGACCAGCCC GCTCGCACTG GCCAGCGCGT ACTCCACGCT CGCCGCCAAC GGCACCCGGT GCGCCCCCGT CCCGGTCGTG GAGGTGCTCG ACCGGGAGGG GCAGCCGCTG ACGCGCGAGG ACGGCAGCCC CCTGGGCACC GGCGCCTCCT GCACCCCCGA GGTGGTGTCA CCGCAGATCG CCACGACCGT GAACCAGATG ATGGTCGGCG GCTCCTCGCT GCCGTACGGC ACCGGCCGGC GCGCCGCGAT CCCCGGCCAC CAGATCGCCA CCAAGACCGG CACCGCCCAG GACCGCGCGT CGGTGTCCTT CGTCGGCTCC ACGCCGCGGT ACACGGCCAG CGTCATGCTG TTCAACCCGA CGGCGAACGT GGACGTGGGC GGGTTCGGCG GCAGCCGCGG CGGTCAGATC TGGCACGACG CCTTCCTGCC GGTGCTGTCG GCCGAGGACC CGGTGTTCTT CCCGCCGCCG GGCATCCCGC TCCCGCCGCT GCCAGGAGGA TGA
|
Protein sequence | MAPSHRSGHT DPRLRAAGKL LLALPLTGAL LAALLAPWVV GPGLVARTSA TLLAPLPVEL TDATPPGNTV VLASDGSVIT YYYRNNRTPV GADRIADVME QALVAIEDER FYEHNGLDVQ GTLRALARNV SSGGVEEGGS TITQQLVKQT LLQTAVTPED RQAATEQTVG RKLREARLAL ALEETHPKEE ILTRYLNTVY FGEGAYGVQA AAQAYFGVDA ADLTLPQAAM LAGLVQTPTA DSPLADPERA RERRDVVLER MQALGFISEE ERTEAAAGPV ETNPAPAPPN GCAGGGVLGG FFCDFLQQHL TGTLGMTQEQ LEDGGLTIRT TLRADLQRSA DAAVLATQPM GDPVAAMFTA VEPGTGNVLA MSANRVFGFE AADPAQSSVN LNVVASKGSG STYKVFVATA ALEAGFPPSH TITTSDPYTS RVYRNGLEPY TVSNVGDGGF PPRLSMYEAL VRSSNTYFVA LEDDLGSVEG PVRVAQRLGL TSLDPVADTV IAQNRGSFTL GPEATSPLAL ASAYSTLAAN GTRCAPVPVV EVLDREGQPL TREDGSPLGT GASCTPEVVS PQIATTVNQM MVGGSSLPYG TGRRAAIPGH QIATKTGTAQ DRASVSFVGS TPRYTASVML FNPTANVDVG GFGGSRGGQI WHDAFLPVLS AEDPVFFPPP GIPLPPLPGG
|
| |