Gene Hoch_1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1557 
Symbol 
ID8543939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2123658 
End bp2125331 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content65% 
IMG OID646386266 
ProductX-Pro dipeptidyl-peptidase domain protein 
Protein accessionYP_003266001 
Protein GI262194792 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.198824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGCAG CATCACAACG GATTTTGGGT ATCGCGCTTT GTTTCGCGGC GAGCGCATGC 
ACCGCCGAGT TCACTCCCGA AGCGGACCAT GGCGACCAGG TATTCGCAGA GAGCATCGCA
AAAGGCATAT CAGATACCTC GGCGTCCGCA TTCGCGGGCA CCGAGCAGCT CGATTTCACC
TTTCGAGACG GCATCACCTT GAGTTCGGCG GACGGCACCG CGATCACGGG CAACGTGTTC
GAGCCCACCG AGGGCGCGCC GGACACGCTT CCCGCGGTGG TCTTCGTCAA TAGCTGGGCG
CTCAACGAGT ACGAATACCT GGTGCCGGCG GCCGAGCTAG CATCGCGCGG CTACGTGGTC
ATGAGCTACA ACACCCGCGG TTTTGGTACC TCGGGTGGCC TGATCAACGT CGCTGGCCCG
GGCGACATGG AAGATCTGTC AGCGGTTCTC GATTGGATGG ACGAAAATAC CGACGCCGAT
ATGGACCGGG TCGGCATCGC GGGGATTTCG TACGGCGCCG GCATCTCGCT GCTGGGCCTG
GCCCAGGAGG GCCGCATCCG CACCGCGGTG GCCATGAGCG GCTGGGGCGA TCTCTACGAC
TCGCTCTATA AAGATGATAC GCCGCGCCTG GCCTGGGGCC TGATTCTCAT CGCCTCGGGC
TACTTCACCG GGCGCATGGA CCCGATCATC GCGCAACAGT TCCAGCGCCT GCTCAGCCAC
GAGGACATCG ACGAGGTGCA GAGCTGGGCG GCGGTGCGTT CACCGGCCAG CTACGTCGAC
GCGCTCAACG CCAGCGGCAA GCCGGTGTAC ATCAGCAGCA ACCTCTCGGA CACGCTGTTC
AACCCCAACC AGATGCTCGA CTTCTACGAG CGCCTCACCG GCCCCAAGCG CCTCGATTTC
AACCTCGGCA CCCACGCGAC CGCGGAGGCG CCCGGCCTCT TCGGCCTGTC CAACTACGTG
TGGAACAACG CCTACGACTG GCTCGACTAC TGGCTGCGCG ACATCGATAA CGGCATCACC
GCGCGCCCGC CCGTGACCAT CGAGAAGAAG TACAGCCACG AGCGCGTAGA GCTCGACGAT
TGGCCGGCGC AGGGCATTGC GGCGACGCAG ATGTATCTGA CGCCGCGCCT GCTCTCCGAT
GGCTCGCTGT CGTCCAATCC CAACGGCATG TCGATCAGCA ACCGCATCTG GTCGGGCGTG
GGTACGCTGG CGTCCACGGG CATCCCGCTG CTGTCCGATA TCCTCGATTC GCATCTCGAC
GTGCCCGTGA CCGCGTCGCT GCCGCTGATC GACCGGCTGC GCGGCTTCAC CTTCTGGTCG
GGCTCGTTCT CGGGCGGCCT CGAGATCATC GGCCGGCCGC AGGTCAACCT GCGTCTGGTC
TCGGGCGCGG ATACCGCCCA CGTGGTCGTG TATCTCTACG ACGTCGATGC CTTTGGCACC
GGCACGCTGA TCACCCACGG CACCGCATCG CTGCACGACA TCGCCGCGGG CCAGGTGCAG
ACCCTCGAGG TCGATCTCAA CGCCGTGGCC TACGACCTGC CCCGCTACCA TCGCCTGGGC
ATCGTCATCG ACACCGTGGA CCCGCTGTAC GCCAGCCGCA CGCCGGGCGG CACGGCCACC
GACCTGCCGT TCTCGGTTTC GGGACAGATG GGTCTCGAGC TCCCGGTCCG CTGA
 
Protein sequence
MKAASQRILG IALCFAASAC TAEFTPEADH GDQVFAESIA KGISDTSASA FAGTEQLDFT 
FRDGITLSSA DGTAITGNVF EPTEGAPDTL PAVVFVNSWA LNEYEYLVPA AELASRGYVV
MSYNTRGFGT SGGLINVAGP GDMEDLSAVL DWMDENTDAD MDRVGIAGIS YGAGISLLGL
AQEGRIRTAV AMSGWGDLYD SLYKDDTPRL AWGLILIASG YFTGRMDPII AQQFQRLLSH
EDIDEVQSWA AVRSPASYVD ALNASGKPVY ISSNLSDTLF NPNQMLDFYE RLTGPKRLDF
NLGTHATAEA PGLFGLSNYV WNNAYDWLDY WLRDIDNGIT ARPPVTIEKK YSHERVELDD
WPAQGIAATQ MYLTPRLLSD GSLSSNPNGM SISNRIWSGV GTLASTGIPL LSDILDSHLD
VPVTASLPLI DRLRGFTFWS GSFSGGLEII GRPQVNLRLV SGADTAHVVV YLYDVDAFGT
GTLITHGTAS LHDIAAGQVQ TLEVDLNAVA YDLPRYHRLG IVIDTVDPLY ASRTPGGTAT
DLPFSVSGQM GLELPVR