Gene Hoch_4545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4545 
Symbol 
ID8546950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6204124 
End bp6205275 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content64% 
IMG OID646389219 
Productglycoside hydrolase family 16 
Protein accessionYP_003268930 
Protein GI262197721 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2273] Beta-glucanase/Beta-glucan synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGC TCTATATCGC ACTCGCAGCG GCTCTGTCGC TTCTCTATGT CAACGAGGCC 
AACGCCCAGA GCTGGCAACT GGTCTGGGCC GATGAATTCA ACGGCAGCAT CAGCTCCGAC
TGGGTATTCG AAACTGGCAC GGGTTCCAGT GGTTGGGGCA ATAACGAATT GCAGTACTAC
CGCCGTGAAA ACGCCACCGT GGAGAACGGC AACCTGGTGA TCACGGCGCG GCGCGAGAAC
TTCGGCGGCC GCAATTACAC CTCGGCACGT ATGAAAACCC AGGGTCGCAA GACCTTCCGC
TACGGCCGCA TCGAGGCGCG CATCGCGCTG CCCACGGGCT CGGGTCTGTG GCCGGCGTTC
TGGATGCTCG GCAGCAACAT CAGCTCAGTG GGCTGGCCGG CCTGCGGCGA GATCGACATC
ATGGAGCACG TCAACAGCAA CAACGTCGCC CACGGCACCA TCCACTGGCA GGATCACAAC
GGCAACTACG CCAACTACGG CGGTCACACC TCGACCAACG TGAACAACTA TCACGTCTAC
GCCATCGAGT GGGACGACCG CGGCATCCGC TGGTTCCTCG ACGGCCAGCA GTACCACGAG
GTGAACACCT CGGGCGGTGT CAACGGCACC CACGAGTTCC ACAACGACTA CTTCCTGCTG
CTGAACATGG CCGTCGGCGG TAACTGGCCC GGCTTCACGG TCGACGAGGG CCGCCTGCCC
GCGCGCATGC TGGTCGACTA CGTGCGCGTG TACCAGGGCG GCGGTGGCGG CGGCGGCTTC
TCGCTGCACC GCGAGGCCGA GACCTACTCG TCGATGAACG GCGTGGACCT CGAGGGCTGC
TCGGAGGGTG GCCAGAACGT CGGCTGGATC GATCAGGGCG ACTGGATGGC CTACGGCGGT
ATCAACATCC CCAGCGCGGG TACCTACGTC ATCCGCTACC GCGTCGCCAG CCCCGGCGGC
AGCGTGCTGT CCTCGGATCT CAACGCCGGC TCGATCCCGC TCGGCAACGT CAACATCCCG
GCCACCGGCG GCTGGCAGAA CTGGACCACG GTGTCCCAGA CCGTGTCTCT CAACGCCGGC
ACCTACGACT TCGGCATCTT CGCCCAGCAG GGCGGTTGGA ACCTCAACTG GTGGAGCATC
GAGCGCCAGT GA
 
Protein sequence
MKTLYIALAA ALSLLYVNEA NAQSWQLVWA DEFNGSISSD WVFETGTGSS GWGNNELQYY 
RRENATVENG NLVITARREN FGGRNYTSAR MKTQGRKTFR YGRIEARIAL PTGSGLWPAF
WMLGSNISSV GWPACGEIDI MEHVNSNNVA HGTIHWQDHN GNYANYGGHT STNVNNYHVY
AIEWDDRGIR WFLDGQQYHE VNTSGGVNGT HEFHNDYFLL LNMAVGGNWP GFTVDEGRLP
ARMLVDYVRV YQGGGGGGGF SLHREAETYS SMNGVDLEGC SEGGQNVGWI DQGDWMAYGG
INIPSAGTYV IRYRVASPGG SVLSSDLNAG SIPLGNVNIP ATGGWQNWTT VSQTVSLNAG
TYDFGIFAQQ GGWNLNWWSI ERQ