Gene Hoch_6480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6480 
Symbol 
ID8548897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8897166 
End bp8900210 
Gene Length3045 bp 
Protein Length1014 aa 
Translation table11 
GC content70% 
IMG OID646391143 
Productpeptidase M16 domain protein 
Protein accessionYP_003270842 
Protein GI262199633 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGGT CCATGTCACC CGAACTGGCT TGCAGCAGCG CTGCCACGAA CGCCCGCCGG 
GGCCGGTCTT TCGCGATTCC GCTCGCGGCC GGTCTGCTCC TCTCGCTCGC CGCTTGCGGC
GAGAAATCAC CGCCGCCCGA GCAGCCCACG CCGCCGCAGC CGAGCGTCGA GGTCGAGACA
GCGGACACGC CGTCCGATAT CCTGGCCAGC TCGGATCTGC CGGAGCCGCT GCCGGCCCCG
CTCGCGGACG ACGCCATGGG CGTCACCGTG CATCGGCTGG CCAACGGCCT CACGGTGTAC
ATCAGCACCG ATCGCCAGAC GCCGCGCTTC ACCTCGTGGA TCGCGGTGCG CGCCGGCAGC
CGCCACGACC CCGCTGATTC GACCGGTCTG GCCCACTACC TCGAGCACAT GCTGTTCAAG
GGCACGGGCG CGCTCGGCAC CATCGACGCC GACGCCGAAG CCGTGCACCT GTTGCGCATC
GCCGAGCTCT ACGACGCCCT GCGCGCCACC GACGACGAGG GCGAGCGCGG CGAGATCCTG
ACCGCCATCG ACGCCGAGAC CCAGAAATCG GCGCGCTTCG CGGTGCCCAA CGAGTTCGAG
CAGACCTACG GCAAGCTCGG CATCAACCGG CTCAACGCCT TCACCTCCTT CGACCAGACC
GTCTACCTGT CCGAAGTCCC CTCGACGCGC CTCGAGGCCT GGGCCCGGGT CGAGGCCGAG
CGCTTCCGCA ACCCGCGCTT CCGCCTGTTC TACCCCGAGC TCGAGGCGGT CTACGAGGAG
AAGAACCGCT CGCTCGACAA CCCCGCCTGG CGCACCTTCG AGTCGATGTT TCAGGCGCTG
TTTCCCGGAC ACCCGTACGG TTCGCAGTCC ACCATCGGCC TGATCGAGCA CCTCAAGGTG
CCGGCCTACG CCGACATGGT CGCGTTCTTT CAGCGCTGGT ACGTGCCCAA CAACATCGCC
ATCGTGCTCG CCGGCGATAT CGACGCCGAG ACCGCGCTGC CGGTCATCGA GAAGTATTTC
AGCGACTGGG CGCCGCGCGC GCTCGAGACC CCGGCCGCGG GTGAGCTGGC GCCGCTGAGC
GAACGCGTGC AGCGCACGGT CAAGGCGCCG GGCGAGGCCG AGGTGCACCT GGCCTGGCAG
CTCGTGCCCG CCAACCACGA GGACGAGCCC GCGCTGTACA TCCTCGACCA GCTCATGGAC
AACGCCACCG CCGGGCTCAT CGAGGTCGAG CTGGTGCTGT CGCAGAAGCT GCCCGACGCC
GGCGCCTACA CCGAGATCAT GCGCGAGGCC GGCGCCTGGA TGATGTACGG CACCGCGCGC
GAGGGGCAGT CGCTCGCGGA GGTCGAGGGC CTGCTGCTGG GCGTGGTCGA GAAGCTCAAG
GCCGGCGACT TCACGCAGGA GCAGCTCGAC GCGGTGAAGC TCAACGCCAC GATCCGCGAG
ATGCGCGAGC TGGAGTCGAA CTGGGCGCGC GTGGCCAAGA TGACCGAGGC GTTCGTCAAC
CACACGCCCT GGTCGCAGGC CGCGGACCGC AGCGAGCGCA TCAAGGCGGT CACCCGCGAG
GACGTGATCG CGGTGGCCAA CACCTATCTC GGCGACGCCT ACGTGGCCGT GTATCGCGAG
AAGGGCGAGT TCACGCCGCC CAAGATCACC AAGCCGCAGA TCACGCCGGT AGCCATCGAT
CCCGAGCGCC AGAGCGCGTT CGCGGCCGAG ATCCTGGCCA TGCCGGCGAG CGAGCTCGAG
CCCGAGTGGC TGGTCGAAGG CGAGCACTAC ACGCGCACCA AGCTGCCCTC GGGCACGCTC
ATCGCGGCGC CCAACCGCGC CAACGAGCTG TTCACGCTCA GCTACGAGTT TGATTTTGGC
TACCAGCAGC GCCCGCTCTT GTGTCTGGCG CTGGAGCTGA TGGAGCAGTC GGGCATCCGG
GGCGAAGGGG CGATGTCGCC TGCCGAGCTC AAGCGCGCGC TGTTCGCCAT GGGCACCACG
GTGAGCGTGC GCTGCGGCGT CGACAGCGCC TCGCTCACGC TCTCGGGCAT CGACGACAAG
CTCGAGGACA GCGTGCGCCT GCTCGACGCC TGGCTGCACC GCCCGGCGCT CACGCAGGAC
ACCCGCGACA AGCTGGTCGC CAACATCCTC AGCCAGCGCA AAGACGAGCT CGAGGATCCG
CGCCAGATCG GCCGCGCGCT GGCCAATTTC GCCCGCTACG GCAAGAACAG CCCGTCGCTC
GTCGAGCCCT CCAATCAGGC GCTGCGCCGG GCCAACCTGC GCGAGCTCGG CCGTCTGCTG
GCGTCGCTGC CCAGCACCCG CCACCGCAGC TCGTATTTCG GACCGCGCGC GGCCGACGCC
GTGGCTGCGC AGGTGACGCT CGGCCGCCGG CATCGACCCG CGCCCAAGGT GCCGGCCGAG
AGCTTCCGCC GGGTGGCCGA CGGCCGCATC TTCTTCCTCG ACCAGAAGCG GGCCCAGGCC
GAGATCTCCA TCACCCTGCC CGAGAAGCCG CTGCCGGCCG AGGAGCGCGC GCTGGCGCGG
CTGTTCTCGG AGTACGTGGG CGGCGGCATG GGCGCCTTGA TCTTCCAGGA GATCCGCGAG
GCCCGCGGGC TCGCGTACTC GGCCTGGGGC TACTACGCGA CCGGGCGCCG GCCGCAGGAC
GCCGCGGCCG TGTTCGCGTC CATCGGCACC CAGGCCGACA AGACCTTCGA GGCGCTCAAG
GCCATGCTGC CGCTGCTGCG CCAGACGCCC CTGCAGCCGG CCCGCTTCGC CAGCGCCAAG
CGCAACCTGC TCGAGGAGTA CCGCACCAAT CGCGTGCTGC CGCGCGCGGT GCCCGACGCG
GTCAAGGGCT GGGACGATCT CGGCGAGGCC AGCGACCCGC GGCCGCGGAG CTGGGAGTTC
GTCCAGACGG CCGAAATCGA CCAGTTGGGC GAGTTCTCGC AGCGGCTGGG CAGCGAGCCG
CTGATCATCT CGATCATGGG CGACGCCGAG CGCATCGACA TGGACGCGCT CGGCAGCATC
GCGCCCATCG AGAAGGTCAC GGTCGAGCAG CTCTTCAGCT ACTGA
 
Protein sequence
MLGSMSPELA CSSAATNARR GRSFAIPLAA GLLLSLAACG EKSPPPEQPT PPQPSVEVET 
ADTPSDILAS SDLPEPLPAP LADDAMGVTV HRLANGLTVY ISTDRQTPRF TSWIAVRAGS
RHDPADSTGL AHYLEHMLFK GTGALGTIDA DAEAVHLLRI AELYDALRAT DDEGERGEIL
TAIDAETQKS ARFAVPNEFE QTYGKLGINR LNAFTSFDQT VYLSEVPSTR LEAWARVEAE
RFRNPRFRLF YPELEAVYEE KNRSLDNPAW RTFESMFQAL FPGHPYGSQS TIGLIEHLKV
PAYADMVAFF QRWYVPNNIA IVLAGDIDAE TALPVIEKYF SDWAPRALET PAAGELAPLS
ERVQRTVKAP GEAEVHLAWQ LVPANHEDEP ALYILDQLMD NATAGLIEVE LVLSQKLPDA
GAYTEIMREA GAWMMYGTAR EGQSLAEVEG LLLGVVEKLK AGDFTQEQLD AVKLNATIRE
MRELESNWAR VAKMTEAFVN HTPWSQAADR SERIKAVTRE DVIAVANTYL GDAYVAVYRE
KGEFTPPKIT KPQITPVAID PERQSAFAAE ILAMPASELE PEWLVEGEHY TRTKLPSGTL
IAAPNRANEL FTLSYEFDFG YQQRPLLCLA LELMEQSGIR GEGAMSPAEL KRALFAMGTT
VSVRCGVDSA SLTLSGIDDK LEDSVRLLDA WLHRPALTQD TRDKLVANIL SQRKDELEDP
RQIGRALANF ARYGKNSPSL VEPSNQALRR ANLRELGRLL ASLPSTRHRS SYFGPRAADA
VAAQVTLGRR HRPAPKVPAE SFRRVADGRI FFLDQKRAQA EISITLPEKP LPAEERALAR
LFSEYVGGGM GALIFQEIRE ARGLAYSAWG YYATGRRPQD AAAVFASIGT QADKTFEALK
AMLPLLRQTP LQPARFASAK RNLLEEYRTN RVLPRAVPDA VKGWDDLGEA SDPRPRSWEF
VQTAEIDQLG EFSQRLGSEP LIISIMGDAE RIDMDALGSI APIEKVTVEQ LFSY