Gene Hoch_4788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4788 
Symbol 
ID8547195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6537067 
End bp6541761 
Gene Length4695 bp 
Protein Length1564 aa 
Translation table11 
GC content70% 
IMG OID646389462 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003269171 
Protein GI262197962 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.35795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGGGCG CGAACGCGTG TCTACCCGGC CAAGACGACG CCGAGTCCAC CGGCACGAAC 
GCGCGGGCCG CGTTCACCAC CGCGGCGGCG ACGCTCAGCA TCGCGGGCGC GGCCGCGTCC
TCGGGCGCAG CCGCGTTCGC GGTCGACGGC AACCTCGGCA CCCGCTGGGA GAGCTCGTTC
GCCGACCCGC AGTGGATCCG TTTCGACCTC GGCAGCGCGC AGGAGCTCGG CAGCATCGAG
TTGGTCTGGG AGACTGCCAA CGCGAGTAAC TACACGGTCG AAGGCTCCAA CGACGACACC
AACTGGACCA CGCTGGCGAC CCAGACCGGC ATGGCCGCGG GCGAGCGCAC CGACACGGTC
GCGCTCGCGG GCAATTTCCG CTACGTGCGC GTGCACGGCA CCGCGCGTAC CACCGAATAC
GGCTACTCCC TGTGGGAGGC CACGCTCTAC AGCCCCGACG ACGGCCCGCC CTCGGCCGGG
GACACGCGCT GGGATGTGGT CGCCGCGACC GCGTCGAGCC TGGAGTTTGG CACCGCGCTT
GAGGCCGCCG ATGGCAACAT GGGCACCCGC TGGGCCAGCG AGGCCGCGGA TCCGCAGTGG
GTGCGCTTCG ACCTCGGCCA GGCGCGCGAT ATCGGACGCG TGGTCATTCA CTGGGAGACC
GCGAGCGCCA GCGCCTACAC CATCGAGGGC TCGAACGACG ACGCGAGCTG GACCGTACTG
GCGAGCAAGA GCAACATGGC CGTAGGCGCC CGCTCCGACG ATCTGAGCGG GCTCGCGGGC
AGCTACCGCT ACCTGCGCAT CCACGGCACC GCGCGCACCA CCGGCTACGG CTACTCCATC
TGGGAGACCG AGATCTACGC CGGCGGCGGC GGCCAGCCTC CGGGCGGCGG CGATCCCTGG
AGCGTGTCCG GGGCCACCGC CTCGAGCACC GAGTACGGGA CCCCGGCCGA GGCCGCGGAC
GGCAACATGG GCACCCGCTG GGCCAGCGAG GCCGCGGATC CGCAGTGGAT CCGCTTCGAC
CTCGGCCAGG CCCGGGAAAT CGGACGCGTG GTCATTCACT GGGAGACCGC GAGCGCCAGC
GCCTACGCCA TCGAGGGCTC GAACGACGAC GCCAACTGGA CCACGCTGGC GACCAAGACC
GGCATGGCCG CGGGTGCGCG CAGCGACGAC GTCGGCGGCC TCGCGGGCAG CTATCGTTAC
CTGCGCATCC ACGGCACCGC CCGCACCACG GGCTACGGCT ACTCCATCTT CGAGACCGAG
ATCTATGCCA CCGGCGGTGG CAGCACCGAG CCGCCGCCGA CGGGCCCCTT CACCATGTCG
CTCTCGGTTC CCTACGTCGA GTATCTGCAG ATCGAGCTGG TGCCGCCCTC GCTCGAGGGC
ACCGATATCC TGGTCATCCA GAACAACACC CTGGACACCC AGGTGAGCTA CGCGGGCGGC
ACCCAGGTGA CCATCCACGA GCGCCAGAGC TCGATTTTCG GACGCAACGT CTTTTTCAAT
GAGAGCGGCA CCAGCGCCAA CCCGCTGAGC GTCAACATGA ACCAGAACCG CAACGTCGCC
GTGCAGCTCG TGCCCATCGA CGACGGCGGC GAGAACCCGG TCTACGACGA TCCCATCGAG
TATCCCGAAA CCGCGCCGCG TCCGGGCGCC TTCGCGCTCA CCGCGCCGGG GCATCAGGAA
GTCTATCACC AGGACCGCAC GCCGGTGCTG AGCTGGGCGC CGGTAGCCGG TGCCTCGAAC
TACCGCGTGT ACCTCAACAT CACCCGCGAC GACTACGACT TCAGCCAGCC GGGCTCGCTG
CTCGAGCGCT ACACGCTCAT CGGCGAGACC AACCAGACCT CGCTGCAGGC GCCGAGCCTG
CCCGATCGCT GGACCTACCG CTGGTACGTC GAGGCGGTGA CCGGCGGCGG CATCGTCACC
AGCGAGAACC GCGCGTTCAG CGTCTACCTG CCCGATTTCG AGGATGTCGA CGACGGCGTG
AACATCGTCG CTGGCGCCCG CGACCTCAAC AAGAACGGCA CCATCGAGCC CTACGAGGAC
TGGCGGCTGC CGATCGAGAC CCGCATCGAC GATCTCATCA GCCGCATGAG CCCGATGGAA
AAGGCCATGC AGCTGTTCTT CGTGGTCGAG GAGAACCCGA CCGCGGGCTG GCACTTTGGC
CCGGCGCTGC CGCACGACCT GTTCGCGTAC CAAAAGGCCA CGGCGGCGAC CAATCTCGGT
ATCCCGTTCG TGTCCGCCGG CGACACCACG CACGGCTACG TCACCAGCTT CCCCACCGGC
GTGGGCATGG CGGCGACCCG CGATCCCTCG CTGGCCTACG ACGCCGCCAA CATGCAGCGA
CGCGAACACG TGGTCGCCGG CTACCGCGGC ACCCTCGGCC CCATCGCCGA GGTCGGCACC
AAGGCGATTT ATCCGCGCAT CCAAGAGGGT GGCGGTGAGG ACGCCGATCT CGTGGCCGCG
ATCATGCGCG CCATGATCAC CGGTTACCAG GGCGGCCCCG AGCTCAACCC GAGCTCGCTG
CTGCCGACCA CCAAGCACTG GCCGGGCGAA GGCGCGGGCG GCGAGGCCGG CATCACCTTT
GACGGCGTCA CCATCAAGTA CCACATGAAG CCCTTCCAGG GCGCCATCGA CTCGGGCACC
GGCGCGGTCA TGCCCGGCTA CGCGGGCAGC GACTTCCTCG ATCCCGGCGG CCGCGGCGCC
GGCGACAGCA TCGGCATCCT GGCGTATCTG CGCGAGGTGC TCGACTACGA CGGCCTGATC
ATCACCGACT GGCTGCCCAG CGGCGCCTGG GTGCGCGCGG CCAACGCCGG CAGCGACGTC
ATGGGCGGCG CCGATCCCAG CGCCATCGAC ATGAACACCT TCATCGCCGA GGTCGACAAC
GCCCGCCTGC ACAAGGCGCT GCGGCGCATC TTCCGCGTCA AGTTCGCGCT CGGCATCTTC
GAGAACCCCT ACGGCGACCT CGACGCGGTC ACCGCCGAGC ACCACAGCGA CGCGAACGTG
GCCATCGCCC AGCGCGCGGC CGAGGCCTCG CTCACCCTGC TCAAGAACGA CAGCCTGCTG
CCCTTCCGCA TGCCGGCGGG CTCCAAGCTG CTGGTCACCG GACCGCGCGC CGATGACGGC
CTGTCGCACG CCATCTGGCG CAGCGCCTTC GAGGCCGCCT ACGGCGACCA GACCATCGCC
GCCGCCATCT CCGCGCGCGG CCAGCAAGCC GGCCTCGACG TGGTCGTGGA CACCTCGCTG
GCGCCGACCA ACCAGGGCTA CGCGGGCGCG GTGGTGGTGC TGGGCGAGCG CTCGTACACC
CACGGGACGG CCTGGGATAA GGAAGAGCCG TACATCCCGC AGGAGCAGCG CGACCTTCTC
AGCCACCTGT CCGCAAACAA CATCCCCTTC GTGGTGGTCT ACATCCTGCC GCGGCCCTAC
GTCATCGATT TCGAGGTCTC GCTGGCCAAC GCCATCGTGG CCGCGTATCG CCCCGGCGGC
GCGGGCGGCG GCCCGGCCGT GGCCCGGCTG CTGTTCGGCG ACATCAAGCC GCAGGGCAAG
CTGCCGTTCC AGTTGCCGCG CAACATGACC CAGGTGGGCA ACGACAACCA CCCCGAGATC
GGCGAGGTCT GGGACATCCC CTACGACATG GGCGCGACCG CGGCCGAGCG CCAGCAGATC
CGCGACCTGA TCAACGCCAA CCAGCCGGTG CCGCCGACCT TCGGCGACCC GCTGTTCCAG
TACGGCGCCG GCTACGAGGA TTTCCACCTC TCGGACGGCT CTGCGCCCTC GGCGCCGCAG
ATCACCTCGC CCACGGGTGG CCAGGTTATC GGACAGGCGC CCGTATTCAC CTGGAACCCG
GCCCAGGACG CCGAGACCGG CATCCAGTAC TACGAGATCG TGGTCGATGG CGTGCTGCGC
GACACCACGC TCAGCACCTC GTACAACGCC AAAGGCCTGC ACCTGTCGAG CGGCGGCCAC
AACCTGGTGA TCCGCGCCCG CAACTGGGCC GAGCAGACCA CGGCCAGCGC CACGGTCGGC
TTTCAGTTCA TCGACGCGGT GGCGCCGACG CAGCCCGCGG TGGTGGCCGT GAACGCGCTC
GGCGGCGCTC AGGCGCGCAT CATCTGGCGC AGCGCCGGCG ATAACGAGAG CGGCATCCTC
GAGTACGTGC TGCGCGCCGG CTCCGCGGTG CTGGCCACGG TGCCGGGCAG CGACAAGAAC
GTCGCCTACC GCAACCTGGC GAGCACCAGC ACGGCGACCT CCAGCAGCCG CCAGGACGAT
CTCCGGGCCT CGTACGCGGT GGACGGCGAC CCGGCCACCC GCTGGAGCAG CGCGGCGGCC
GACGACCAGT GGCTCGAACT CGACCTGGGC GGGCTGTTCG CGCTCGACCG CGTCGCTCTG
AGCTGGGAGG CCGCGTACGC CAGCGGCTAC GTCATCGAGA CCTCGCGCGA CCGCCAGTCG
TGGTCCGTGG CGCTCGACGT CACCGGCGGC ACGGGCGGCG AGGAGTCGCG CGCGCTGTCG
GTGGCCGCCG CCCGCTACGT GCGGATGCGG GCCACGGTGC GGGCCACGGC CTACGGCGTG
AGCCTGTGGG AGATGGCGGT TATGGGACGA CCGGTCGAAG AAGCTCAGGT CGGCGTGAAC
GCGCCGGCCA GCATCACGGT CGAAGCCGTG GACCGCCACG GCAACCGCAC CACCTCCGCG
CCGTACGCGT ACTGA
 
Protein sequence
MLGANACLPG QDDAESTGTN ARAAFTTAAA TLSIAGAAAS SGAAAFAVDG NLGTRWESSF 
ADPQWIRFDL GSAQELGSIE LVWETANASN YTVEGSNDDT NWTTLATQTG MAAGERTDTV
ALAGNFRYVR VHGTARTTEY GYSLWEATLY SPDDGPPSAG DTRWDVVAAT ASSLEFGTAL
EAADGNMGTR WASEAADPQW VRFDLGQARD IGRVVIHWET ASASAYTIEG SNDDASWTVL
ASKSNMAVGA RSDDLSGLAG SYRYLRIHGT ARTTGYGYSI WETEIYAGGG GQPPGGGDPW
SVSGATASST EYGTPAEAAD GNMGTRWASE AADPQWIRFD LGQAREIGRV VIHWETASAS
AYAIEGSNDD ANWTTLATKT GMAAGARSDD VGGLAGSYRY LRIHGTARTT GYGYSIFETE
IYATGGGSTE PPPTGPFTMS LSVPYVEYLQ IELVPPSLEG TDILVIQNNT LDTQVSYAGG
TQVTIHERQS SIFGRNVFFN ESGTSANPLS VNMNQNRNVA VQLVPIDDGG ENPVYDDPIE
YPETAPRPGA FALTAPGHQE VYHQDRTPVL SWAPVAGASN YRVYLNITRD DYDFSQPGSL
LERYTLIGET NQTSLQAPSL PDRWTYRWYV EAVTGGGIVT SENRAFSVYL PDFEDVDDGV
NIVAGARDLN KNGTIEPYED WRLPIETRID DLISRMSPME KAMQLFFVVE ENPTAGWHFG
PALPHDLFAY QKATAATNLG IPFVSAGDTT HGYVTSFPTG VGMAATRDPS LAYDAANMQR
REHVVAGYRG TLGPIAEVGT KAIYPRIQEG GGEDADLVAA IMRAMITGYQ GGPELNPSSL
LPTTKHWPGE GAGGEAGITF DGVTIKYHMK PFQGAIDSGT GAVMPGYAGS DFLDPGGRGA
GDSIGILAYL REVLDYDGLI ITDWLPSGAW VRAANAGSDV MGGADPSAID MNTFIAEVDN
ARLHKALRRI FRVKFALGIF ENPYGDLDAV TAEHHSDANV AIAQRAAEAS LTLLKNDSLL
PFRMPAGSKL LVTGPRADDG LSHAIWRSAF EAAYGDQTIA AAISARGQQA GLDVVVDTSL
APTNQGYAGA VVVLGERSYT HGTAWDKEEP YIPQEQRDLL SHLSANNIPF VVVYILPRPY
VIDFEVSLAN AIVAAYRPGG AGGGPAVARL LFGDIKPQGK LPFQLPRNMT QVGNDNHPEI
GEVWDIPYDM GATAAERQQI RDLINANQPV PPTFGDPLFQ YGAGYEDFHL SDGSAPSAPQ
ITSPTGGQVI GQAPVFTWNP AQDAETGIQY YEIVVDGVLR DTTLSTSYNA KGLHLSSGGH
NLVIRARNWA EQTTASATVG FQFIDAVAPT QPAVVAVNAL GGAQARIIWR SAGDNESGIL
EYVLRAGSAV LATVPGSDKN VAYRNLASTS TATSSSRQDD LRASYAVDGD PATRWSSAAA
DDQWLELDLG GLFALDRVAL SWEAAYASGY VIETSRDRQS WSVALDVTGG TGGEESRALS
VAAARYVRMR ATVRATAYGV SLWEMAVMGR PVEEAQVGVN APASITVEAV DRHGNRTTSA
PYAY