Gene Hoch_1635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1635 
Symbol 
ID8544017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2229492 
End bp2231129 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content71% 
IMG OID646386343 
ProductSMP-30/Gluconolaconase/LRE domain protein 
Protein accessionYP_003266078 
Protein GI262194869 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3386] Gluconolactonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.260295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCGAT TCCTCCTCGT CCCGCTCGCC CTCGCGCTGC TTGTGCCCGG CGGCCACGCG 
CAGGCAGAGC CCGCGTATCG CCAGGAGGTC GTGATCGCCG GCTCGCCCTT CCAGGGCGTG
CATGGCCTGG CCGTCGACGG CGATCGCCTG CTGGCGAGTA ACCTTCTCGG ACAGTCGGTC
CACAGCGTCG ATCTGCGCAC CGGCGCGGTC AGCACCGTGG TGGGGCCGCC GCTCGGCGGC
GCCGACGACG TCGCCGTGGG CGCCGACGGA TCCATCTACT GGACCGGCTA CTTCACCGGC
CGGCTGATGC GCCGCACGCC CGACGGCAAA ACCCGCGTCA TCGCCCGCGA CCTGCCCGGC
CTCAACTCGC TGGCCTTCCG CGCCGACGGC CGCCTCTACG TCACCCAGGT CGGCCGCGGC
GACGCGCTGT GGGAAGTCGA CCCAGCCGGT CAAAAACCGC CGCGCAAGCT GCACGAGGGC
ATCGGTTTCC TCAACGGCTT CGAATTCGGC CCCGACGACC GCATCTACGG TCCGCGCATG
ATGACGCGCG AGATCATCCG CCTGGACGTC GACAGCGGCG CCATCGAGGT CGTCGCCGAC
GGCTTCATCG CGCCCACCGC GGTCAACTTC GACAGCCGCC GCCAGAACCT GTACGTCACC
GACACCTCGA GCGGCGCGCT GGTGCGCGTC GAGGTGGCCA CCGGTGCCAG GGAGGTCGTG
GCCGAGCTGC CCTCGGGCCT CGACAACCTG GCCATCGGTC CCGACGATCG CATCTACATC
TCCAACATGA TCGATAACGA CATCCGCGTA TTCGACCCCG CGGACGGCTC GCTGCGGCAT
CTGGTCGAAT CCCGCCTGAG CGTGCCCGCC GGCCTGTTCA TCGATCCCGA AGATCCCGCC
GAGCGGCTGT ATCTCGCCGA TGTCTTCGCC CTGCGCCGGG TGCGCACCAG CGACGGCCGT
GTCACCAAAA CCGGCCGCGT GCTGTCCACG GCCATGACCT TCCCCATGCA CGTGAGCGCC
ACCGCCGAGC ACCTGGTGCT CAGCAGCGCG TTCATCGGCA ACGTCCAGGT CATGGACCGC
GCGTCCGGAA ACATCCTGCA CACCGTGCCC AACGGCAACG GCGTCCAGGG CGCGATCGAG
CTCGAGGGCG GCGCGCTGCT GATCGCCGAA GCCGGCACCG GCCGCCTGGT GCGCGTGGAG
CTGAGCGGCG GCGACGCGAG CAGCTCCGTG CTGGTCTCCG GCCTGGGCGG ACCGGTGGGC
CTCATCGCCG CGCGCGACGC CGACGAGCCC AGCGTGTACC TCACCGAGGT GAAGGCCGGG
CGGGTCACCG AGGTGCGCCT GAGCGACGGC CGCCGCCGGG TCGTGGCCAA GGGGCTGCGC
GCGCCCGAGG GCATCGCCCA GCACCCCGAC GGCAGCTTGA TCGTCGCCGA GGTCGGCCGC
AAACGGCTGC TGCGCATCGA TCCCGACACC GGCCGCCGCA GCGTGATCGC CAGCGACCTG
CGCATCGGCC TGCCCGAGAA CGACGGCCTG CCGCCGGGCT TCATGCCCAC GGGCGTGGCC
GTCGGCGCCT CCGGGACCAT CTACATGAGC TCCGACCTCG ACAGCGCGCT GCTGCGATTC
GTCCCGATCG CGCCCTGA
 
Protein sequence
MYRFLLVPLA LALLVPGGHA QAEPAYRQEV VIAGSPFQGV HGLAVDGDRL LASNLLGQSV 
HSVDLRTGAV STVVGPPLGG ADDVAVGADG SIYWTGYFTG RLMRRTPDGK TRVIARDLPG
LNSLAFRADG RLYVTQVGRG DALWEVDPAG QKPPRKLHEG IGFLNGFEFG PDDRIYGPRM
MTREIIRLDV DSGAIEVVAD GFIAPTAVNF DSRRQNLYVT DTSSGALVRV EVATGAREVV
AELPSGLDNL AIGPDDRIYI SNMIDNDIRV FDPADGSLRH LVESRLSVPA GLFIDPEDPA
ERLYLADVFA LRRVRTSDGR VTKTGRVLST AMTFPMHVSA TAEHLVLSSA FIGNVQVMDR
ASGNILHTVP NGNGVQGAIE LEGGALLIAE AGTGRLVRVE LSGGDASSSV LVSGLGGPVG
LIAARDADEP SVYLTEVKAG RVTEVRLSDG RRRVVAKGLR APEGIAQHPD GSLIVAEVGR
KRLLRIDPDT GRRSVIASDL RIGLPENDGL PPGFMPTGVA VGASGTIYMS SDLDSALLRF
VPIAP