Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_0089 |
Symbol | |
ID | 8751735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 98053 |
End bp | 100386 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycoside hydrolase family 65 central catalytic |
Protein accession | YP_003407270 |
Protein GI | 284988716 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.472584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGCCC CGATCCTGGA CGTCGACCCG TGGCTGGTGC GCGAGCCGCG CGTGGACCCG GGGACGCTCC CCGTCGCCGA GTCGCTCTTC AGCCTCTCCA ACGGCTACCT CGGCGTCCGG GGGACGCTCG ACGAGGTCGA GCCCTTCGGC ATGCGCGGCA CCTACCTCTC CGGGGTCTAC GAGACGCACC CGCTGTCCTA TCCCGAGGGC GGCTACGGGC AGCCCGAGGA GGGGCAGGCG ATGCTCACCG TCGCCGACGG CACGCCGCTG CGCCTGCTGG TCGACGGGGT CCCGCTCGAC CTGCGCGAGT TCGAGCTCGA CGTCCACGAG CGCACGCTGG ACCTGCGCGC CGGCACGCTC GACCGCCGCG TGCGCTGGAC GACGCCGTCG GGCACGTCGG TCGAGGTGTG CTCGCGGCGG CTGGTCTCCC TGGTGGAGCG GTCGGTCTGC GCGGTGCGGT ACGAGGTCCG CGTCCTGAAC CGGCCCGCGC ACGTCGTCGT CCGGTCCGAT CTGGCCGCCG GCGAGGTCAC GCCCGCGGGC GTGGACAACG ACGACCCGCG CGTCGCGGAG TGCCTGGACG AGGCCTTCGA GCCGCGCGCC CAGATCGGCA ACGCGAACGG TGGCGCCCTC GTGGAGCGGA CGCGCCGGTC GGGGATCACG GTCGCGACCG CGGTCGGTCA CGAGGTCGAC GGCGGGCGGG TGAGCACGCA CGTCGACGAG CAGCACGTCG TCACCACGGT CGCCGCCGAC CTGCGCCCCG GCGAGGGCCT CACGGTCGTC AAGGTCGTCG GGTACAGCTG GTCGCACGAC GCCGGGTCCG ACTCCGTCCT CGCCGAGGCG TCGGCGGCCG TGAGCTCCGC CTTCGACCTC GGGTGGGAGG GTCTGCTCGC CGGCCAGCGG GCCGCGCTCG ACGAGCTCTG GGCGACGGCG GACGTCGAGG TCGACGGCGA CCCCGAGCTG CAGCAGGCGC TCCGCTACAC CGTCTTCCAG CTGATCTCCT CCGCCTCCTG CATCTCGGGT GCCCCGGTCG GCGCCAAGGG ACTGACCGGG ATCGGGTACA GCGGTCACAC CTTCTGGGAC GTCGAGGGGT TCGTCGTGCC CGCCCTCACG CTGCTGCGCC CGGACGCGGC GGCACGGCTG CTGCGGTGGC GCGCCGCCAC TCTCGACCTC GCGCGGGAAC GGGCCGGGAC GCTCGGGCTG GCGGGCGCGT GCTTCGCCTG GCGCACCATC AGCGGGCGCG AGGTCTCCGC GTACTGGCCG GCGAGCACGG CGGCGATGCA CGTCAACGCC GACATCGCGC GCGCCTTCTG GCTGTACCAG AACGTCACCG ACAGGGACCT CGACTCCCTC GGGGGCCTCG CGGTGCTCGT CGAGACGGCG CGGTCGTGGC TCTCGGCGGG CCACGAGGAC GCGGCCGGCG CCTGGCACCT GTACGGCATG ACCGGGCCGG ACGAGTACAC CGGGGTCGTC GACGACAACG TCTTCACCAA CCTCATGGCC CGGCGCAACC TCCGGTGGGC CGCCGACGCG TGCGAGAGGC TGCCGGAGCG CGCGTCGGAG CTGGGCGTGG ACTCCGCCGA GGCCTCCGCC TGGCGGTCCG CGGCGGACGC CGTGCACGTC CCGTGGGACG AGCGCCTGCG CGTTCACCCG ATGAACGACA ACTTCACGAC CTACCGGGAG TGGCCCTTCG AGGACGAGCG CGACCACTAC CCGGTGCAGG AGCACCACCA CTACGCCGAC TTCTACCGGC GGCAGGTGCT CAAGCAGGCC GACCTCGTGC AGGCGCTCTG GTGGTGCCGC GACGAGTTCA CGGCCGAGGA GGTGGCGCGC GACCTCGACT ACTACGAGGC GCGCACCGTC CGGGACTCCT CGCTCTCGGC CGCCGTCCAG GCCGTCGTGT GCGCCCAGGC TCAGCACCCC GACCTCGCCC TGCGCTACCT GCGGGAGGCT GCCCTCGTCG ACCTGCGCGA CGTCCGGGGC GACACCCCGA ACGGCCTGCA CCTGGCCGCC GTGGGCGGGA CGTGGCTGGC GTTCGTCGCC GGCCTCGGTG GCCTGCGGGA GGACCACGAG GACCTCGAGC TCGCACCGCT GCTGCCGTCC TCGCTCTCGC GCACCGCCTA CTCCGTCACC TGGCGCGGCA GCCTGCTCCG CGTCGAGACG ACGCGGGAGG GCACCACGGT CACGCTGGTG CGCGGCGAGG AGCCGGTGAC CGTCGTCGTG GACGGCGCCC CGCTGTCGGT CACGCCCGCC GCCCCTGTGC ACGCGCCGTT GCGGGACCCG ACCCCGCTGC TCGACGAGCC GAGGCAGCCG GTCGGTCGCG AACCCCGCAC GTGA
|
Protein sequence | MTAPILDVDP WLVREPRVDP GTLPVAESLF SLSNGYLGVR GTLDEVEPFG MRGTYLSGVY ETHPLSYPEG GYGQPEEGQA MLTVADGTPL RLLVDGVPLD LREFELDVHE RTLDLRAGTL DRRVRWTTPS GTSVEVCSRR LVSLVERSVC AVRYEVRVLN RPAHVVVRSD LAAGEVTPAG VDNDDPRVAE CLDEAFEPRA QIGNANGGAL VERTRRSGIT VATAVGHEVD GGRVSTHVDE QHVVTTVAAD LRPGEGLTVV KVVGYSWSHD AGSDSVLAEA SAAVSSAFDL GWEGLLAGQR AALDELWATA DVEVDGDPEL QQALRYTVFQ LISSASCISG APVGAKGLTG IGYSGHTFWD VEGFVVPALT LLRPDAAARL LRWRAATLDL ARERAGTLGL AGACFAWRTI SGREVSAYWP ASTAAMHVNA DIARAFWLYQ NVTDRDLDSL GGLAVLVETA RSWLSAGHED AAGAWHLYGM TGPDEYTGVV DDNVFTNLMA RRNLRWAADA CERLPERASE LGVDSAEASA WRSAADAVHV PWDERLRVHP MNDNFTTYRE WPFEDERDHY PVQEHHHYAD FYRRQVLKQA DLVQALWWCR DEFTAEEVAR DLDYYEARTV RDSSLSAAVQ AVVCAQAQHP DLALRYLREA ALVDLRDVRG DTPNGLHLAA VGGTWLAFVA GLGGLREDHE DLELAPLLPS SLSRTAYSVT WRGSLLRVET TREGTTVTLV RGEEPVTVVV DGAPLSVTPA APVHAPLRDP TPLLDEPRQP VGREPRT
|
| |