Gene Hoch_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1481 
Symbol 
ID8543863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2010490 
End bp2011893 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content70% 
IMG OID646386192 
Productbeta-galactosidase 
Protein accessionYP_003265927 
Protein GI262194718 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCA CAGCCTCCAC CACCTCCGGC CGGCTCCCGG AGGACATCGA CTTCCCGGCC 
GACTTCACGT GGGGCGTCGC CACCTCGTCC TACCAGATCG AGGGCGCTGC CAACGAAGAC
GGCCGCACCC CGTCCATATG GGACACCTTC TGCCGCGTGC CCGGCGCCGT CCACGAAGCC
CACAACGGCG ACGTCGCCTG CGACCACTAC CACCGCATGC CCGAGGACGT CGCGCTGATC
AAGAGCCTGG GCCTCGACAC CTACCGCTTC TCGGTGGCCT GGCCGCGCGT GCAACCGCGC
GGGCGCGGAC CGGTCAACCC CGCCGGCCTG GCGTTTTATG ACGCCCTGGT CGACGAACTC
CTGGGCGCCG GCATCGCGCC GTGGGTGACC CTGTACCACT GGGATCTGCC GCAGGAGCTC
GAGGACGCCG GCGGCTGGCC CAATCGCGAC ACCGCGTACC GCTTCGCCGA GTACGCGATG
ATGGTCTTCG ACAAACTCGC CGACCGCGTC GACACCTTCA CCACGCTCAA CGAGCCCTGG
TGCTCGGCCT GGCTGGGCTA CAACACCGGC GTCCACGCGC CCGGCCGCCG CGACTTCGAG
GCCTCGATCC ACGCCGTGCA CCACCTGCTG CTCGGCCACG GCCTGGCCAC CCAGCAGATG
CGCGCGGCCG CCACCCGCGA GCACCAGCTC GGCATCACGC TCAACCCGCT CGTGGCCGCG
CCCGCGACCG AGGCGCCCAC CGATCGCGAA GCCGCCCGCC GCGCCGACGG CCTGGGCCTG
CGCATCTACC TCGACCCGCT GTGGCACGGC CGCTACCCGG CCGACATGCT CGCCGAGCTG
GCCGACCGCA ACGTCGGCTT CCCGGTGCAG GAGGGCGACC TGGCCATCAT CTCCGAGCCC
TTCGACGTCC TCGGCATCAA CTTCTATTTC GGACAGGACT TCAGCGGCGT CGATGAAGAC
GGCCGGACGA TCGGCGACGA CGGTCTGCCC GTGGTCCGCG ACGTCCCGCT CAAAGGCCCC
CGCACGGCCA TGGGCTGGCC GATCACGCCC GATCGCTTCA CCCAGCTCCT GGTGCGCCTG
CAGCAGGACT ACCCGGGCGT GCCCATCTTC ATCACCGAGA ACGGCGTCGC CTTCGACGAT
GTCGCCGACG CCGACGGCTT CGTCGAGGAC GATAACCGCA TCCAGTACGT GGCCGATCAC
CTCGCCGCCG TCGCCGAGGC CCGCCGCCAG GGCGCCGACA TCCGCGGCTA CCTGCTGTGG
TCGCTGATGG ATAACTTCGA GTGGGCCGAG GGCTACGCCA AACGCTTCGG CATCGTGCGC
GTGGACTACG AGACCCAGAA GCGCACGCTC AAAAAGAGCG CGCTGTGGTA CCGCGACGCC
GTCGCCAGCT TCCGCGCCCG CTAA
 
Protein sequence
MNRTASTTSG RLPEDIDFPA DFTWGVATSS YQIEGAANED GRTPSIWDTF CRVPGAVHEA 
HNGDVACDHY HRMPEDVALI KSLGLDTYRF SVAWPRVQPR GRGPVNPAGL AFYDALVDEL
LGAGIAPWVT LYHWDLPQEL EDAGGWPNRD TAYRFAEYAM MVFDKLADRV DTFTTLNEPW
CSAWLGYNTG VHAPGRRDFE ASIHAVHHLL LGHGLATQQM RAAATREHQL GITLNPLVAA
PATEAPTDRE AARRADGLGL RIYLDPLWHG RYPADMLAEL ADRNVGFPVQ EGDLAIISEP
FDVLGINFYF GQDFSGVDED GRTIGDDGLP VVRDVPLKGP RTAMGWPITP DRFTQLLVRL
QQDYPGVPIF ITENGVAFDD VADADGFVED DNRIQYVADH LAAVAEARRQ GADIRGYLLW
SLMDNFEWAE GYAKRFGIVR VDYETQKRTL KKSALWYRDA VASFRAR