Gene Gobs_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_0234 
Symbol 
ID8751883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp244887 
End bp246155 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content76% 
IMG OID 
Product1, 4-beta cellobiohydrolase 
Protein accessionYP_003407410 
Protein GI284988856 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGAAT CGGTCCACTA CTTTACGTGC ATGCCCGTCG TCACCACCGC GCCGCCCAGT 
ACCTCCGGCC GGCACCGGCC AGGTCGCGGC AGGCGACTGC TGTGGACGTC CGGGCTCGCC
GCGGCACTGG TCGGCACCGG CCTGGTCACG CCGCTGCTCA CCGGTCCCGA GGCGACGCCG
GCCGCGCAGC CGGCCATCGA GCAGGTCCGC AAGGTCCGAC CGGCGCCGAC GACGACGGCT
CCGGCTCCGG CTCCTGCCAT CGCCAGCCCC ACGACGAGCG CTCCCGCGCC GAGCAGCACG
ACGACCGCCG CGCCCTCGAC GCCCGCGAGC GCCGCTCCGA CGAGCAGCTC CGCCGCCGGT
GCGCCGAGCA CCACGACGCC CGCCGCACCG ACCTCGACCG CCGCCCCGGC CCCCTCGACG
GCCAACCCGC TCGCCGGGAT GACCTTCCAC GGCCCCAACA CCGGTGCGGC CCTGGCCGCG
GCGCAGCCGG GCCGCAGCCC CGAGGACGCC GCGGCGCTCG CCCAGCTGGC GGGCGTGCCC
ACGGCGACCT GGCTGGGGGC GTGGAGCGGA GACGTCACGG CGGCGGTCCG CCAGGAGGTC
ACCGCCGCCC GCGCGGCCGG GGCCGTGCCG GTCCTCGTCA CGTACAACGT CCCGGGCCGG
GACTGCGGCG GCTACTCAGC CGGGGGCGTG GACTCGTCGG CCGAGTACCT CCGCTGGGTG
CAGGCGGTCG CGGCCGGCAT CGGGACCGCG CAGGCGGTGG TGGTCGTCGA GCCCGACGCG
CTCGCGCTGC TGTGCGGCGA CCCGGCGCAG CGCCTGTCGC TGCTGCGGTC GGCAGTCGAG
GTGCTCGAGG CCAACGCCGG CACCCACACC TACCTCGACG CCGGGCACTC GACCTGGATC
GACGCCGCGA CGATGGCCGA GCGGCTTCGC GCCGCCGGGG TGACCGCCGC GGACGGCTTC
GCGCTGAACG TCTCCAACTT CCAGACGACC GCGAGCAACG TGGCCTACGG CCATCAGGTG
TCGTCGCTGC TGGGCGGCGC CCACTTCGTC GTGGACACCA GCCGCAACGG CAACGGCCCC
GGCAGCGACT GGTGCAACCC CCCGGGCCGC GCCCTCGGCG AGCGCCCGAC GGCGCAGACC
GGGCAGCCCC GGGTCGACGC GTTCCTGTGG GTCAAGCGAC CCGGCGAGTC CGACGGCACG
TGCAACGGCG GCCCGGCCCC CGGGACCTTC TGGGACGCCT ATGCCATCGG GCTGGTCCGG
GGCTACTGA
 
Protein sequence
MKESVHYFTC MPVVTTAPPS TSGRHRPGRG RRLLWTSGLA AALVGTGLVT PLLTGPEATP 
AAQPAIEQVR KVRPAPTTTA PAPAPAIASP TTSAPAPSST TTAAPSTPAS AAPTSSSAAG
APSTTTPAAP TSTAAPAPST ANPLAGMTFH GPNTGAALAA AQPGRSPEDA AALAQLAGVP
TATWLGAWSG DVTAAVRQEV TAARAAGAVP VLVTYNVPGR DCGGYSAGGV DSSAEYLRWV
QAVAAGIGTA QAVVVVEPDA LALLCGDPAQ RLSLLRSAVE VLEANAGTHT YLDAGHSTWI
DAATMAERLR AAGVTAADGF ALNVSNFQTT ASNVAYGHQV SSLLGGAHFV VDTSRNGNGP
GSDWCNPPGR ALGERPTAQT GQPRVDAFLW VKRPGESDGT CNGGPAPGTF WDAYAIGLVR
GY