Gene Gobs_0309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_0309 
Symbol 
ID8751958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp329653 
End bp331467 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content76% 
IMG OID 
Productglycoside hydrolase family 2 sugar binding protein 
Protein accessionYP_003407481 
Protein GI284988927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.83969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGCACA CGGACCCGAC CGAACACCCC CGGCCGCTGC TGCGCCGTCC CTGGACCTCG 
CTCGACGGCG AGTGGGAGTT CGCCGCCGAC CCCGACCTCG TCGGCACCGT CGGCGGCATC
GCGTTCGACC GCACGATCCG GGTGCCCTAC GCCCCGGAGA CACCGGCGAG CGGCGTGACC
TGGCCCGGCC GGCTCAACCG CGCGTGGTAC CGCCGCCGGC TGCCCGGCCG CGCCGGCGAC
CGGCGCACCG TCCTCCACTT CGGTGCCGTC GACCGCATCT GCGACGTGTG GGTCGGTGGC
GCGCACGTCA CCTCCCACGA GGGCGGCTAC ACACCGTTCA GCGTCGACGT CACCGACTTC
CTCGGGGACG ACGGCGCAGA CCTCGTGGTC CGGGCCGACG ACGACCCCCT GGACCTCGAG
GCGCCGCGCG GCAAGCAGGA CTGGCGGGAC GAGCCGCACG AGATCTGGTA CCCCCGCACC
ACCGGCATCT GGCGCACGGT CTGGCTGGAG CAGGTGGGGT CGCGGCGCAT CGCCGACGTC
CAGTGGCGCG CCCACCCGCG GACGATGCAG CTCGACGTCC GCGTGGAGCT CGCCGCCCCC
CTGGCCGGTG CCCGGCTGCA CCTGCGGCTG CGCGCCGGCG ACCGGCTGCT GGTCGACGAC
TCGGTGCGCG TGGACGGCCG GGTCGTCGAG CGCACCGTGC AGGTGGGGGA CGGCGGCATC
GACGACCGCA GCCGGCTGCT GTGGCGACCC GGGCCGGACC CGGTGCTGCT CGACGCCGAG
CTGACGCTGG TCGCCGACGA CGGCGAGCTG CTCGACGAGG TCGAGAGCTA CACCGCGCTG
CGCTCGGTCG AGGTCGGCGA CGGGCGGCTG CTGGTCAACG GCCGTCCGGC GCCGCTGCGG
CTGGTGCTCG ACCAGGGCTA CTGGCCCGAC ACCGGTGCCA CCCCGCCCGA CGTCGCGGCG
CTGCGGCGGG ACCTGGAGCT CACCCGGGCG CTGGGTTTCA CCGGGGCCCG CAAGCACCAG
AAGACCGAGG ACCCGCGCTA CCTCGCCCTC GCCGACCGGA TGGGCCTGGT GGTGTGGGCG
GAGATGCCGT CGGCGTACCG GCCCGGCCCG ACGGCGAGCG CCCGGCTGCT GCGCGAGTGG
GCCGACGTGG TGGTCGCCCA CCGCGGGTTC CCGAGCGTGG TGGCGTGGGT GCCGCTCAAC
GAGTCGTGGG GTGTGCAGGA AGCCGAGGTC GACGAGCGGC AGCGCGGGCT GATCCGGGCG
ATGGCGGCGA CGGCGGACGC GCTCGACGGC ACCCGCCCGG TCTCGGCCAA CGACGGCTGG
GAGACCCTCG GCGGCGACAT CCTGGGCGTC CACGACTACG AGCAGGACCC CGCGGTGCTC
GGCGAGCGCT ACGCCACGGC CGAGGACCTC GAGCGGCTGG CCACCGGACG CCGTCCCGAC
GGGTACCTGG CCGACCTGGA GCGGGCCGGC GTCGCGGGCC GGGCCGTCGT CCTCTCGGAG
TTCGGCGGCG TGGCGCTGCG CTCCCCGGAG GACGAGGGCT GGGGCTACGC CGACGCCACC
TCGCCGGAGG ACCTGCTGGC CCGCTACCGC GCCCAGTGGG CCGCGGTGCA CGGCAGCACC
GCGCTCGCCG GCGCCTGCTG GACGCAGCTG ACCGACACGT ACCAGGAGGT CAACGGGCTG
CTGGGCATGG ACCGGGTGCC CAAGGTCGAC ATCGAGGCGC TGCGCCGGGC CACCCTCGGT
GAGCCGGAGG CACCGCCCGC GCCGCCCGCG CCGCCCGCCC CGCCCATCCA GCCGACCCGC
CTCTCTCCGA GCTGA
 
Protein sequence
MEHTDPTEHP RPLLRRPWTS LDGEWEFAAD PDLVGTVGGI AFDRTIRVPY APETPASGVT 
WPGRLNRAWY RRRLPGRAGD RRTVLHFGAV DRICDVWVGG AHVTSHEGGY TPFSVDVTDF
LGDDGADLVV RADDDPLDLE APRGKQDWRD EPHEIWYPRT TGIWRTVWLE QVGSRRIADV
QWRAHPRTMQ LDVRVELAAP LAGARLHLRL RAGDRLLVDD SVRVDGRVVE RTVQVGDGGI
DDRSRLLWRP GPDPVLLDAE LTLVADDGEL LDEVESYTAL RSVEVGDGRL LVNGRPAPLR
LVLDQGYWPD TGATPPDVAA LRRDLELTRA LGFTGARKHQ KTEDPRYLAL ADRMGLVVWA
EMPSAYRPGP TASARLLREW ADVVVAHRGF PSVVAWVPLN ESWGVQEAEV DERQRGLIRA
MAATADALDG TRPVSANDGW ETLGGDILGV HDYEQDPAVL GERYATAEDL ERLATGRRPD
GYLADLERAG VAGRAVVLSE FGGVALRSPE DEGWGYADAT SPEDLLARYR AQWAAVHGST
ALAGACWTQL TDTYQEVNGL LGMDRVPKVD IEALRRATLG EPEAPPAPPA PPAPPIQPTR
LSPS