Gene Hoch_4787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4787 
Symbol 
ID8547194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6533226 
End bp6535685 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content66% 
IMG OID646389461 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003269170 
Protein GI262197961 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2273] Beta-glucanase/Beta-glucan synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA CGAACAGGAA CTTCGAGCTA CGGACACGCG CGGCTCGCTG GCGGCTGGCC 
GGACCTTCGG TTTTATTGCT GGCCGTAACT GGCTGTATGG GCAATCCCTC GCCAGACGAG
GAGTCGCCCA TCGGGGCCGA TGAGCCGGCG ATGATGCCCG ACCAGGGGTC CGACATAGAC
CCCGACATGG GCGCAGGCGA CGAGGTGGCG GCGCCCGAGC TGGGTGAGGA GCCCGAAGCG
CCGGCGCCGA TGGCCGAGCC CGGGCTGCTG TGGGAGGAGA ACTTCGACGG GCCCGAGATC
GACCGCGACA CCTGGACCTA CGATGTCGGC GTGGGCGTGT GGAACTGGGG TTCCAATCAG
GAGCTGCAGT ACTACACGGA TCGGCCGCAG AACTCGTACA TCGAGAACGG CAAATTGGTC
ATCGAGGCCC GGCGCGAGGC CATGGAGGGC TACGAGTTCA CCTCGGCGCG CCTCAAGTCC
ACGGGGCGGG TGTCGTTCAA GCACGGCTGG ATCGAGGCGC GCATCAAGAT GCCCGACCTG
CAGGACGGGC TGTGGCCGGC GTTCTGGCTG CTGGGCAACG AGAACACCTG GCCGGCCTCG
GGCGAGCTCG ACATCGTCGA GATGGGCTAC GCCGACGCCA TCGCCGAGGG CAAGGTCAAC
AACCGCGTCG GCGCCACCGC GCACTGGGAC TACGAGGGCA ACTACGCCGG CTACGGCGAG
ACCTACGACG CGCCCGAGGA TCTGACGCAA GACTACCACG TGTACAAGCT GTACTGGGAC
TCCTCGGTGA TCCGCGGCTA CATCGATGAT ATCCACTACT GGACCCTGGA TATCTCGGGC
GACTCGGCCT CGCTCGAGGA GTTCAAGTCG CACCAGTTCT ACATCATCCT CAACCTGGCG
GTCGGCGGCA TGTTCCCGGG CATCTACGAT CCGGCGCAGA TCACGGCGCC GCTGCCGGCC
AAGATGTACG TGGATTACGT CCGCCTGTAC GGGATTGAAG AGACCGAGCT GTACGTCGGC
GCCGACAACG AGAAGGACGG CGCGTACGGC GTGTACACCG AGAATACGCC GGTCGAGGAC
CAGGCCCAGC TCGGCGTCGA TACCAACCTG TACCTGTGGA ACAACCTGGT CGATGTGGCC
ACGGCCGCGT TCGAGGGCGT CGACGCGCTC GGCCTGCGCG CCGCGGCCGG CGACTGGTTC
GGCATGGGCT TCTCCACATC GATCAAGAAC ATGTCGAACT ACTCCGACGG TCACCTGCGC
TTTCACGCCA AGACCACCAG CACGCATCCC TTCGAGCTCG GCATCAGCAG CGCGGGCGCG
GGCGAGGGCT GGGTGCGCTT CGAGAGCGGC GACGATCCCT ACGGCCTGGT GCGCGACGGC
CAGTGGCACG AGGTCGCGAT CCCGCTCAAT AAGTTTGGCA ACATCGACTT CCACTCGATC
AACCAGCTCT TCATGCTCGT GGGCGACGCG CCCGGCGCGG TCATGGATCT GGCCTTCGAC
AACATCTACT GGACCCCGAG CGTGGCCCGG CCGTCGCCGG CCAATGGCAA CTACGGCGTC
TACACCGAGA CCGCGTCGGT GGTCGATTCC TTCGATCTGC AGAGCGAGGG CGGCCTCTTC
GTCTGGGAGA ACACCTTGCA GGCGGCGCCC GGCGCGCCCT TCGAGGGCGG CGAGTCGCTG
GCGTACCAGT CGACGCCGGG CCTGGCTTGG TTCGGCATGA GTTTTACCCC CGACGTCAAA
CACGACCTGC GCGCGTTCGC GAGCGGCTAT CTGCACTTCG CGCTCAAGAC CACGTCCACC
ACGCGCTTCC AGATCGGTAT GAAGAGCGGC AACGTGGATA ACCTCGGCCA GTCGTGGATC
GCCTTCGAGA ACGGCAACGA CCCCTACGGC TTCGCGCGCG ACGGCCAGTG GCACGAGATC
GAGATCCCGC TGTCCGACTT CGCCAGCGTC GACCTCGGCG AGGTCAGCCA GCTCTTCGAG
CTGCTCGGCA CCGACGGCGC CATCACCAAC ATCGAGATCG ACGACATCTA CTTCGGCGGC
GGTGGCTCGG GACCGACGGA TCCGCCCGAC GAGGACCACA GTGTCAACCG CGCGCTGGGT
CAGCCGACCT TCGCCAGCAG CGTCGAGGGC GGCGTGTTCG TGGCCAGCGG CGCCACCGAC
GGTGACCCGG GCACGCGCTG GTCGAGCGAG TTCGCCGACC CGCAGTGGAT CTACGTGGAT
CTCGGCGAGC GTCGCTCGCT GGCCCGCGTG GTGCTGAGCT GGGAGGCCGC CTACGGCAGC
GCCTATCAGG TCCAGGTGTC CGACGACGCC GCGCAGTGGA CCACGCTGGC CGCGGTGGAC
GGCGGCGACG GCGGCATCGA CGACATCGCC ATCAGTGGCG CCGGGCGCTA TGTACGCGTC
TACGGCACGC AGCGGGCCAC GCCCTACGGC TACTCGCTGT TCGAGTTCGA GGTCTACTGA
 
Protein sequence
MSTTNRNFEL RTRAARWRLA GPSVLLLAVT GCMGNPSPDE ESPIGADEPA MMPDQGSDID 
PDMGAGDEVA APELGEEPEA PAPMAEPGLL WEENFDGPEI DRDTWTYDVG VGVWNWGSNQ
ELQYYTDRPQ NSYIENGKLV IEARREAMEG YEFTSARLKS TGRVSFKHGW IEARIKMPDL
QDGLWPAFWL LGNENTWPAS GELDIVEMGY ADAIAEGKVN NRVGATAHWD YEGNYAGYGE
TYDAPEDLTQ DYHVYKLYWD SSVIRGYIDD IHYWTLDISG DSASLEEFKS HQFYIILNLA
VGGMFPGIYD PAQITAPLPA KMYVDYVRLY GIEETELYVG ADNEKDGAYG VYTENTPVED
QAQLGVDTNL YLWNNLVDVA TAAFEGVDAL GLRAAAGDWF GMGFSTSIKN MSNYSDGHLR
FHAKTTSTHP FELGISSAGA GEGWVRFESG DDPYGLVRDG QWHEVAIPLN KFGNIDFHSI
NQLFMLVGDA PGAVMDLAFD NIYWTPSVAR PSPANGNYGV YTETASVVDS FDLQSEGGLF
VWENTLQAAP GAPFEGGESL AYQSTPGLAW FGMSFTPDVK HDLRAFASGY LHFALKTTST
TRFQIGMKSG NVDNLGQSWI AFENGNDPYG FARDGQWHEI EIPLSDFASV DLGEVSQLFE
LLGTDGAITN IEIDDIYFGG GGSGPTDPPD EDHSVNRALG QPTFASSVEG GVFVASGATD
GDPGTRWSSE FADPQWIYVD LGERRSLARV VLSWEAAYGS AYQVQVSDDA AQWTTLAAVD
GGDGGIDDIA ISGAGRYVRV YGTQRATPYG YSLFEFEVY