Gene Hoch_6638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6638 
Symbol 
ID8549055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9098416 
End bp9100350 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content75% 
IMG OID646391298 
Productglycoside hydrolase family 35 
Protein accessionYP_003270997 
Protein GI262199788 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.119053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000205681 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAAGCAC ACCGCCGCAG CACCCTCGGC GCAGCCGGAA TAATCCTCGA TGACGCAGAC 
GGCGGCGCCG GCGCGCGCGA GTTGCCGCTG TTCTCGGGCG TGCTGCACTA CTGGCGCGTC
GACCGCGCCG ACTGGCGCGC GTGTCTCGCC GCCATGCGCG AGATCGGCTT CGCGATGGTC
CACGTGCCGG TGCCGTGGAG CGTGCACGAG CGCGGCCCGG GCAGCTACGT TTGGCGCCAG
GAGCGCGACC TCGGTGGGTT TCTCGACCTG CTGGCCGAAA TGGGCATGCA CGCGGTGCTC
GAACCCGGGC CCGTGATCGG CGCCGAGCTG CCGGGTGGCG GCGTGCCCGC GCGCATCCTC
GACGACGCCG CGCTGCTGGC GCGGACCGCG CGCGGTGCGC CGGCGTGGGT GACGGCGGTG
CCGCAGATGA TGACCGTGCC CTGCCTGGCG GCCTCGGCGC TGCGCGAGCA GGTCCTGGCC
TGGTTCGCTG CGGTAGCCGA AATCGCGGCG CCGCGCCTGG TGCCGAACGG GCCGGTCGCG
GCCGTGCACC TGGGTCGTCA CGCCGATCTC GTCGCCCAGC TCGGCCCCTT CGAGTACGGC
TATCAACCCG ATGCGCTGCG CCTGTGGCAG GAGCTCGAGG GCGGCGAGCC GCCGCGGCAG
TGGTCGGCCG AAGACGGCGC GCGCATGGCG GCGTGGATGC AATGCCGCGA GGTGGGCATG
CGCCGCGATC TGGTCTGGCT GTCGGGCGCG CTCGACCAGG TCGGACTGCG CGAGGTGGTG
CGCCTCGTCG ATCTACCGTG GTCGCTGCCC GGCGCCTTCG ACATCGCCGC GCTCGAACGC
GCGCTGGGCG GCGAGGTCGC TGTCGGCATG CGCGTCGGCC ACCGTCCAGC CGTACCCGCG
GCCCTGCACC GGCGCGCGCT ATACCTCACC GGCTCGGCGC GCCTGCCTGT ATTCGGACAG
GTGTCCGTCG GTGGCCCGAC GCTGGGGGTG CCGGTCCCTG TGGACGCCGC TCAGCGCGCC
GTGCTCGGCG TCTTGGCTGC GGGCGCGCGC GGCGTCGGTC TGCATCTGCT GGCCGAGTGC
AGCGGCTGGT ACGGGGCGCC CGTGAGCGAA CTCGGCGAGG CCCAGCAGGC CGCCGAGTGG
CTCACGCGGG TGCTCGCGGC GCTGCGTGAA GTATCGTGGA CCACGCTGCG CAGGCACACG
CCGGTGGCCG TGATGGTCGG CCGCGCGGAG CAGCGCTTTG CCGCCGCGTC CTCGGCTGCG
GGCGCGCTGG GGTCGCTGCT CGAGGGCTTG TTGCCGCCGG GGACCAACGA CCGGGCCAGC
CTGGCGCGCG ACCTCGACGC CGCTGCCAGT CGCCGCTGGA CCGAGGCCGC CATCGATGCA
CTCGAACTGG CACAGATTCC CTACCGCGTG GTCGACGAAG GCTGCGCGCC CGAGGCCTTT
GCCGGGGTCC GCGCCGTGAT CGCGCCGACC CTGCGCCGTG TCGATCGCGG CGCCTGGCAG
CGGCTGCACG AGCTCGCCCG CGGCGGCGCG GTGGTGATCG CCGGTCCCGA GCGTCCGCGC
TGCGATGAGC GCGGCCGGGA GCTGGGCGAC GACGCCGCGC TGCCGGCGCG CGCGGGGCTC
ATGCGGGCGG CCTCGCTCGA GGACCCCGAG GGCCTGGCCG ATGACCTCGC CGAGGTCGCC
GGGGAGCTGT CCGAGCTGTG GCTCACGGCC GAGCAGGGCG AGGTGGACTG CTCGCTGTTC
AGCGACCCGA GCGGCGCGCC GCGGGTGCTG TTCGTGAGCA ACCGGCGCGC CGCGGCGGTG
GTCGCCGATG TCCTCGTGCC CGCCGGGGTC GCGCTCGAGG ACGCGATCAC GGGTGAGACC
CTGCGGCCCG GGCGCGACGG CGTCGTCGAT GTGCGCCTCG AGCCGCTGCA GATCGCCATG
CTGCTGGTGC GCTGA
 
Protein sequence
MEAHRRSTLG AAGIILDDAD GGAGARELPL FSGVLHYWRV DRADWRACLA AMREIGFAMV 
HVPVPWSVHE RGPGSYVWRQ ERDLGGFLDL LAEMGMHAVL EPGPVIGAEL PGGGVPARIL
DDAALLARTA RGAPAWVTAV PQMMTVPCLA ASALREQVLA WFAAVAEIAA PRLVPNGPVA
AVHLGRHADL VAQLGPFEYG YQPDALRLWQ ELEGGEPPRQ WSAEDGARMA AWMQCREVGM
RRDLVWLSGA LDQVGLREVV RLVDLPWSLP GAFDIAALER ALGGEVAVGM RVGHRPAVPA
ALHRRALYLT GSARLPVFGQ VSVGGPTLGV PVPVDAAQRA VLGVLAAGAR GVGLHLLAEC
SGWYGAPVSE LGEAQQAAEW LTRVLAALRE VSWTTLRRHT PVAVMVGRAE QRFAAASSAA
GALGSLLEGL LPPGTNDRAS LARDLDAAAS RRWTEAAIDA LELAQIPYRV VDEGCAPEAF
AGVRAVIAPT LRRVDRGAWQ RLHELARGGA VVIAGPERPR CDERGRELGD DAALPARAGL
MRAASLEDPE GLADDLAEVA GELSELWLTA EQGEVDCSLF SDPSGAPRVL FVSNRRAAAV
VADVLVPAGV ALEDAITGET LRPGRDGVVD VRLEPLQIAM LLVR