Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_0309 |
Symbol | |
ID | 8751958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 329653 |
End bp | 331467 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | glycoside hydrolase family 2 sugar binding protein |
Protein accession | YP_003407481 |
Protein GI | 284988927 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.83969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGCACA CGGACCCGAC CGAACACCCC CGGCCGCTGC TGCGCCGTCC CTGGACCTCG CTCGACGGCG AGTGGGAGTT CGCCGCCGAC CCCGACCTCG TCGGCACCGT CGGCGGCATC GCGTTCGACC GCACGATCCG GGTGCCCTAC GCCCCGGAGA CACCGGCGAG CGGCGTGACC TGGCCCGGCC GGCTCAACCG CGCGTGGTAC CGCCGCCGGC TGCCCGGCCG CGCCGGCGAC CGGCGCACCG TCCTCCACTT CGGTGCCGTC GACCGCATCT GCGACGTGTG GGTCGGTGGC GCGCACGTCA CCTCCCACGA GGGCGGCTAC ACACCGTTCA GCGTCGACGT CACCGACTTC CTCGGGGACG ACGGCGCAGA CCTCGTGGTC CGGGCCGACG ACGACCCCCT GGACCTCGAG GCGCCGCGCG GCAAGCAGGA CTGGCGGGAC GAGCCGCACG AGATCTGGTA CCCCCGCACC ACCGGCATCT GGCGCACGGT CTGGCTGGAG CAGGTGGGGT CGCGGCGCAT CGCCGACGTC CAGTGGCGCG CCCACCCGCG GACGATGCAG CTCGACGTCC GCGTGGAGCT CGCCGCCCCC CTGGCCGGTG CCCGGCTGCA CCTGCGGCTG CGCGCCGGCG ACCGGCTGCT GGTCGACGAC TCGGTGCGCG TGGACGGCCG GGTCGTCGAG CGCACCGTGC AGGTGGGGGA CGGCGGCATC GACGACCGCA GCCGGCTGCT GTGGCGACCC GGGCCGGACC CGGTGCTGCT CGACGCCGAG CTGACGCTGG TCGCCGACGA CGGCGAGCTG CTCGACGAGG TCGAGAGCTA CACCGCGCTG CGCTCGGTCG AGGTCGGCGA CGGGCGGCTG CTGGTCAACG GCCGTCCGGC GCCGCTGCGG CTGGTGCTCG ACCAGGGCTA CTGGCCCGAC ACCGGTGCCA CCCCGCCCGA CGTCGCGGCG CTGCGGCGGG ACCTGGAGCT CACCCGGGCG CTGGGTTTCA CCGGGGCCCG CAAGCACCAG AAGACCGAGG ACCCGCGCTA CCTCGCCCTC GCCGACCGGA TGGGCCTGGT GGTGTGGGCG GAGATGCCGT CGGCGTACCG GCCCGGCCCG ACGGCGAGCG CCCGGCTGCT GCGCGAGTGG GCCGACGTGG TGGTCGCCCA CCGCGGGTTC CCGAGCGTGG TGGCGTGGGT GCCGCTCAAC GAGTCGTGGG GTGTGCAGGA AGCCGAGGTC GACGAGCGGC AGCGCGGGCT GATCCGGGCG ATGGCGGCGA CGGCGGACGC GCTCGACGGC ACCCGCCCGG TCTCGGCCAA CGACGGCTGG GAGACCCTCG GCGGCGACAT CCTGGGCGTC CACGACTACG AGCAGGACCC CGCGGTGCTC GGCGAGCGCT ACGCCACGGC CGAGGACCTC GAGCGGCTGG CCACCGGACG CCGTCCCGAC GGGTACCTGG CCGACCTGGA GCGGGCCGGC GTCGCGGGCC GGGCCGTCGT CCTCTCGGAG TTCGGCGGCG TGGCGCTGCG CTCCCCGGAG GACGAGGGCT GGGGCTACGC CGACGCCACC TCGCCGGAGG ACCTGCTGGC CCGCTACCGC GCCCAGTGGG CCGCGGTGCA CGGCAGCACC GCGCTCGCCG GCGCCTGCTG GACGCAGCTG ACCGACACGT ACCAGGAGGT CAACGGGCTG CTGGGCATGG ACCGGGTGCC CAAGGTCGAC ATCGAGGCGC TGCGCCGGGC CACCCTCGGT GAGCCGGAGG CACCGCCCGC GCCGCCCGCG CCGCCCGCCC CGCCCATCCA GCCGACCCGC CTCTCTCCGA GCTGA
|
Protein sequence | MEHTDPTEHP RPLLRRPWTS LDGEWEFAAD PDLVGTVGGI AFDRTIRVPY APETPASGVT WPGRLNRAWY RRRLPGRAGD RRTVLHFGAV DRICDVWVGG AHVTSHEGGY TPFSVDVTDF LGDDGADLVV RADDDPLDLE APRGKQDWRD EPHEIWYPRT TGIWRTVWLE QVGSRRIADV QWRAHPRTMQ LDVRVELAAP LAGARLHLRL RAGDRLLVDD SVRVDGRVVE RTVQVGDGGI DDRSRLLWRP GPDPVLLDAE LTLVADDGEL LDEVESYTAL RSVEVGDGRL LVNGRPAPLR LVLDQGYWPD TGATPPDVAA LRRDLELTRA LGFTGARKHQ KTEDPRYLAL ADRMGLVVWA EMPSAYRPGP TASARLLREW ADVVVAHRGF PSVVAWVPLN ESWGVQEAEV DERQRGLIRA MAATADALDG TRPVSANDGW ETLGGDILGV HDYEQDPAVL GERYATAEDL ERLATGRRPD GYLADLERAG VAGRAVVLSE FGGVALRSPE DEGWGYADAT SPEDLLARYR AQWAAVHGST ALAGACWTQL TDTYQEVNGL LGMDRVPKVD IEALRRATLG EPEAPPAPPA PPAPPIQPTR LSPS
|
| |