Gene Hoch_1592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1592 
Symbol 
ID8543974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2176748 
End bp2180191 
Gene Length3444 bp 
Protein Length1147 aa 
Translation table11 
GC content65% 
IMG OID646386300 
ProductCapsule polysaccharide biosynthesis protein 
Protein accessionYP_003266035 
Protein GI262194826 
COG category[R] General function prediction only 
COG ID[COG5610] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.80092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATCC TCTACTACCT TGAGCCGATG ATCGAACTCG GCAACCCCTT GTTCCGCATG 
GGGACGGTTC GAAATCACTT GGACAGAGAG GTCAAGGCGC TCCAAGCTCA CCCCAAGGAG
CGCATCGAGG TACGGGTGCT GTGCTCCGAG GCCGTCGCTC GCGAGTCCAA AGCCAATGGC
TTGCTCAAAG ACGCGACGGT GTCGACGGTC GACGCCGAGC GTCTGCAGCG CGCCTTTCCC
GACTATCAGA GCATCTCGAG CGCCTGGTAC AGGGGCAGCT ATTCGCCAAA CGACAGCATG
TACATGATGG GGCTGGTCGC CGAAGCCCTC GAGGGCTTCG TCCCCGACGT CATCATCTGT
TACGAATCGT CGGCGCCGTT TTTGAGCGAG TTGTTCCCGC GGGCGCTGTT TCTCAACAGC
ATGCTCGGTC TGCTGTCGCG TCCGCCCTAT CCGGAGATGT CGTATCTCGA CCCCGTGGGC
ATCGGGCGCT ATTCGTCGTT GATTCAGTTC CAGGATCAGC TCCGCGCGCA CACCATCGAC
GACGCCGCGC GCGCGCGGCT CGAGCACATG AAAGAGCGCT ATCGCTGGAC CCTGCTCAAG
TACTCGCCGG TGCGCGCGGG CATGATCCGC CAGGGCTTCG ACCGCGTGCT GCTGGTGCCG
CTGCAGGTCT CGAAGTACTT CATGTTCAAC GATAACTGCC GCGACGAGCA GCCCTTTGGC
AACCAGCTCG ATTTCCTGCG CCACGTCGCC GCGCGCGTCG ACCGCAACAT CGGCCTATTC
GTGACCATGC ACGGCGCCGA GACCGCGTAC ATGAGCGACG AGGTCCTGCG CGGCCTGAAA
ACCGAATTTC CCAACATCAT GTTTTCGTCC GGGCTGCAGG AGCTGCGCTG GTCGTCGCAG
CACATTCTGC CGCACGTCGA CGGCGTGGTC ACGGTGTCGA GCAGCGTCGG CATGCAGACC
CTGCTGTGGG ATATTCCGGT GTTCTGCGCC GGTATGTCGC ATGTCAACGC CATCTCGTCG
GGGGCGGTGG AGGACATCGA CGCGGTCCTC GGCGAGGACA GCTTCGCGCC CAAGGACGAG
ATCTTCCACT TCCTGCTCAC GCGCTATTCG CCGCTGCTCG ACCGCTATCA CCACGATCCG
GTGTGGTTTC ACGATTTCCT CGAGCGCGGT ATTTCCATGC GCGAAGTGGC GCCGCTGGGG
CTCGATTTCT TCGCCCCCAT CGACGACGAG GAGCGCTTGT TTCTGGCCCT GGAAGAGGAC
CTGCGGGTCC CCGCCTACCA CGCCGATCTC AAGCGCTCGA AGAAGAGCTT CTCGATCTCG
CGCACGGCCG ACCTGGTGTC GGCGCACGCC AGCGTGGTCT CGGCCGATAT CGTCAGCTTC
GATGTCTTCG ACACCCTGCT CACGCGCGAG CTGGCCGAGC CCGCCACGGT GTGGGACCTG
GTTCGGCTCG AGGCCGAGCG CACCCTGCCG GAGCTGCGCT GGTCGTCAGC GCAGCCCTTT
GCCCAGCTCC GCCGCGACGC CGCGCAGCAA GCGGCCGAAG CGGCCCGCGC CGAGGGCAAG
GAGGAGTACA CGCTGGCCGC GGTCTACGAG TCTATCGGCG AGCAGCTCGG GCTCTCCAAG
GCCGACGCTG ACGCCTTGCT CGCGCTCGAA TTGCAGGTCG AGCGGCGTGT GCATCGCAAG
CCCCGGCGCC GCGGTTGGAT GCTGTATCAA GAGGCCAAGG GCCTGGGCAA GCGCGTCATC
GCGGTCTCGG ATACCTACCT GCCCAAGGAC TTCGTCGCCT CGCTGCTCGC GGATGCTGGC
TACGAGCTCG ACGAGCTGTA CACCTCGAGC GAATACGAGC GGCTGAAGAA GACCGGCGCG
CTGTTCACCA TCGTCCGCAA ACGCGAGGGC CGCAAGAAGC GCATCCTGCA TATCGGCGAC
AATTACCTGT CCGACGTGCA GATGGCCTCG GCCAAGAAGC TGCAAGCCTT CCACCTGCCG
AACCTGAGCG AAATCTATGC GGAATCATCG AAAGGTCCGG GGTGGACGCG CGGCGACCTC
GCCGCGAGCT GCGGCGCCTC GCTGATGCAC GGCGCCATCT CGCGCAAGTT CTTCGACGAG
GAGCCGCCCG AGGATTCGTG GCTGGGCGGC AGCCCCTACC GGCTCGGCTA CGTGGCCGGC
GGGCCGCTGG TGCTCGCGTT CACCGCCTGG CTGGTCGAGC AGCTCGCCGA TGCCGACATT
CGCCGCGCCT ATTTTCTGGC GCGCGACGGC TACCTCATCC GCAAGATCTA CGAGCTGATT
CGCCAGCACA ATCCCGCGCT GCCGGAGGCG CGCTATCTGC TCGCCTCGCG CCGCGCCTAC
AGCTTCCCCT CGGTCAAGGA CGAGCAAGCG CTGCTCAAGA CCCTCGACTG GCGCTTCAGC
GAATGCCCCG TGGGAACGCT GCTGCAATAC CGCTTCGGCA TGGCGCTCTC CGACCTGCCC
GAGCAGGCCT GGGCGCAGGC CGGCTTCAAG GGCCCCGAGC AGCTCGTCGA CAGCGGTCGC
GCTCAGGACA TGCGCAAGCT GCGGCGGCTG CTGAGCCGCA ACGCGACCTG GTTTCTGGGC
AAGGCCGAGG AGGAGCGCGC CGCGCTGCTG GCCTACCTGG GCGCCGAGGG CTTCACCGGC
GGCGCCGACC AGGCGATCGT CGATATTGGC CACAACGCCA CGCTGCAGCG CTGTTTGGGG
CAGCTCTGCG GCAATAACGC GTTTCGCGGC TTCTACTTCT CGACCTTCGA CCGCGCGCGC
GAGGTCGCCC GCGACGGTTA CTCGATCGAC TCGTTTCTGC TGCGCTACGA GCGCAACACC
TCGAGCGAGC ATCCCTACTG CAAGAATATC GGCCTGTTCG AGTTTCTGTT TCTGCCGCCG
CATCCCTCGT TCGAGCGCTT CTCGCTCGAT CGCGACGGCA CCCCGCGGCC GCATCACGTG
GTCGGCAACG AGGAGCGCCG CTGCCAGGTC GTCGAGCAGG TGCACGAGGG CGTGAGCGAC
TTCTGCCGCG ACGTGCTCGA GACCTGTAAC TACGACGTGC GCGCCTTCGA TGTCGCGCCC
AACGATGCGC TGCGCAGCTA TTTTCACTAC GCGGCGCAGC CCGATCCTGC CGACGCGCGC
ATCCTGAGCC GGGTGAGTTT TGTCGACAAG TTCGGCGGCA GCAACGCCCG CTACCTCATC
GAGCCGCTGC CGAACGACCT GCACGAGCGT CGCGTCTACG ACCGCTATCT GCGCAAGAGC
TGGTGGCGCA AAGGCGCCGA GGTGCTCGCC CAGGCGGCCG CGGAGCCGAG CCTCGGGACG
CCGCGCAAAG GCGGTTCGCG GCGCAAACTC CAGGAGTCGC AGAAGAGCGT GTGGGAGCGA
AAGTTGCGCA AGCTGCTCCG CGATCCCCAG TTGTTCTTCG CCGACGCGGT CAACAAGCGC
ACGCGCAAAC GCAAACGCAA ATGA
 
Protein sequence
MRILYYLEPM IELGNPLFRM GTVRNHLDRE VKALQAHPKE RIEVRVLCSE AVARESKANG 
LLKDATVSTV DAERLQRAFP DYQSISSAWY RGSYSPNDSM YMMGLVAEAL EGFVPDVIIC
YESSAPFLSE LFPRALFLNS MLGLLSRPPY PEMSYLDPVG IGRYSSLIQF QDQLRAHTID
DAARARLEHM KERYRWTLLK YSPVRAGMIR QGFDRVLLVP LQVSKYFMFN DNCRDEQPFG
NQLDFLRHVA ARVDRNIGLF VTMHGAETAY MSDEVLRGLK TEFPNIMFSS GLQELRWSSQ
HILPHVDGVV TVSSSVGMQT LLWDIPVFCA GMSHVNAISS GAVEDIDAVL GEDSFAPKDE
IFHFLLTRYS PLLDRYHHDP VWFHDFLERG ISMREVAPLG LDFFAPIDDE ERLFLALEED
LRVPAYHADL KRSKKSFSIS RTADLVSAHA SVVSADIVSF DVFDTLLTRE LAEPATVWDL
VRLEAERTLP ELRWSSAQPF AQLRRDAAQQ AAEAARAEGK EEYTLAAVYE SIGEQLGLSK
ADADALLALE LQVERRVHRK PRRRGWMLYQ EAKGLGKRVI AVSDTYLPKD FVASLLADAG
YELDELYTSS EYERLKKTGA LFTIVRKREG RKKRILHIGD NYLSDVQMAS AKKLQAFHLP
NLSEIYAESS KGPGWTRGDL AASCGASLMH GAISRKFFDE EPPEDSWLGG SPYRLGYVAG
GPLVLAFTAW LVEQLADADI RRAYFLARDG YLIRKIYELI RQHNPALPEA RYLLASRRAY
SFPSVKDEQA LLKTLDWRFS ECPVGTLLQY RFGMALSDLP EQAWAQAGFK GPEQLVDSGR
AQDMRKLRRL LSRNATWFLG KAEEERAALL AYLGAEGFTG GADQAIVDIG HNATLQRCLG
QLCGNNAFRG FYFSTFDRAR EVARDGYSID SFLLRYERNT SSEHPYCKNI GLFEFLFLPP
HPSFERFSLD RDGTPRPHHV VGNEERRCQV VEQVHEGVSD FCRDVLETCN YDVRAFDVAP
NDALRSYFHY AAQPDPADAR ILSRVSFVDK FGGSNARYLI EPLPNDLHER RVYDRYLRKS
WWRKGAEVLA QAAAEPSLGT PRKGGSRRKL QESQKSVWER KLRKLLRDPQ LFFADAVNKR
TRKRKRK