Gene Gobs_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1952 
Symbol 
ID8753623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2023589 
End bp2025343 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content73% 
IMG OID 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_003409019 
Protein GI284990465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCTG CCGTCGAGTT CGCCGTCTGG GCGCCGGTCC CGGAACGGGT CCGGGTGCAG 
GTCGACGGGT CGGTGCACGA GATGCGCCGG GAGGAGGGCG GCTGGTGGCG CGCCGAGGTC
GAGGCCGGTC CCGAGGCCGA CTACGGCTTC CTGCTCGGCG ACGACGACAC GCCCCGTCCC
GACCCGCGCT CGCGCCGCCA GCCCGAGGGC GTGCACGGGC TGTCCCGCCG CCACGATCCC
GCCTCGTACC AGTGGGGCGA CCGGGCCTGG ACCGGCCGGC CGCTGGCCGG GGGCGTGGTC
TACGAGATGC ACGTCGGCAC CTTCACGCCC GAGGGCACGC TGGACGCCGC GGTCGCCCGG
CTGGACCACC TCGTCGACCT CGGCGTGGAC TTCGTCGAGC TGCTGCCGGT CAACGGCTTC
GACGGCACGC ACAACTGGGG GTACGACGGC GTCCTCTGGT ACACGGTGCA GGAGAGCTAC
GGCGGTCCCG CGGCCTACCA GCGCTTCGTC GACGCCTGCC ACCAGCGCGG CCTGGGCGTT
CTGCAGGACG TCGTCTACAA CCACCTCGGT CCGTCCGGGA ACTATCTGCC GCTGTTCATG
CCGATCTTCA GCGAGGGCGG CGCCAACACC TGGGGCAGCT CGGTGAACCT CTCCGGGCCG
GACTCCGACG AGGTGCGCCG CTACATCATC GACAACGCGC TGATGTGGCT GCACGACATG
CACGTCGACG GGCTGCGGCT GGACGCCGTC CACGCGCTGG TCGACGAGCG GGCCATCCAC
GTGCTCGAGG AGATGGCCCA GGAGGTCGAC CGGCTGTCGG TCGCCGAGGG CCGGCCGCTG
ACCCTCATCG CCGAGAGCGA CCTCAACGAC CCGCGCATGG TCACCCCCCG GGTGGCAGGC
GGCCTGGGCA TCTCCGCCCA GTGGAGCGAC GACTTCCACC ACGCGCTGCA CTCGGTGCTC
ACCGGTGAGG GCCAGGGCTA CTACGGCGAC TTCGCCGCGG CGGGCCTCGC CGGGCTGGCC
AGGACGCTGA CCGGCGCGTA CTTCCACGAC GGCGCCTGGT CGAGCTTCCG CCGCCGGCAC
CACGGCCGGC CGGTCGACAC CGCGCTGCTG CCGGGCTGGA AGTTCCTCGG CTACCTGCAG
GACCACGACC AGATCGGCAA CCGCGCGGTC GGCGACCGCA TCTCGGCGTC CCTCTCCCCC
GGCCTGCTCG CCGTCGGCGC GACCCTGGTG CTCACCAGCC CGTTCACCCC GATGCTGTTC
ATGGGCGAGG AGTGGGGTGC CGCGACGCCG TGGCAGTTCT TCACCAGCCA CACCGACCCG
GAGATCGGCC GGGCGACGGC GGAGGGGCGC AAGGGCGAGT TCGCCGAGCA CGGGTGGGAC
GCCGACGAGG TGCCCGACCC GCAGGACCCG GAGACCTTCA CCCGCTCGAA GCTGGACTGG
AGCGAGCCCG AGCGCGAGCC GCACCGGACG CTGCTCGCCG CCACCCGGGC GCTGCTGCGA
CTCCGGCACC AGCACCCCGA CCTGGTCGAC CCGGACCTGT CGCGGGTGCA CGTCTCCTGG
GACGACGCCG ACCGCTGGCT GGTCGTGCAC CGCGGCTCCC TGCGGGTGGT GGCGAACCTC
GCCGACGTCC CCCGCGAGAT CGATCTGGAC GGCCCGGTGG CGGAGGTGCT GTTCGCCACC
GGAGAGCTGC CGCACCTCGA CGGGCGGACG GTGACCGTGC CGGCCGAGAG CGCGGCCGTG
CTGACCACCG CCTGA
 
Protein sequence
MSAAVEFAVW APVPERVRVQ VDGSVHEMRR EEGGWWRAEV EAGPEADYGF LLGDDDTPRP 
DPRSRRQPEG VHGLSRRHDP ASYQWGDRAW TGRPLAGGVV YEMHVGTFTP EGTLDAAVAR
LDHLVDLGVD FVELLPVNGF DGTHNWGYDG VLWYTVQESY GGPAAYQRFV DACHQRGLGV
LQDVVYNHLG PSGNYLPLFM PIFSEGGANT WGSSVNLSGP DSDEVRRYII DNALMWLHDM
HVDGLRLDAV HALVDERAIH VLEEMAQEVD RLSVAEGRPL TLIAESDLND PRMVTPRVAG
GLGISAQWSD DFHHALHSVL TGEGQGYYGD FAAAGLAGLA RTLTGAYFHD GAWSSFRRRH
HGRPVDTALL PGWKFLGYLQ DHDQIGNRAV GDRISASLSP GLLAVGATLV LTSPFTPMLF
MGEEWGAATP WQFFTSHTDP EIGRATAEGR KGEFAEHGWD ADEVPDPQDP ETFTRSKLDW
SEPEREPHRT LLAATRALLR LRHQHPDLVD PDLSRVHVSW DDADRWLVVH RGSLRVVANL
ADVPREIDLD GPVAEVLFAT GELPHLDGRT VTVPAESAAV LTTA