Gene Hoch_5011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5011 
Symbol 
ID8547421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6912055 
End bp6914499 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content71% 
IMG OID646389687 
Producthypothetical protein 
Protein accessionYP_003269393 
Protein GI262198184 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0852589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA ATTCTCGGTG GTGTTTCCGT CTGGCGCCGG TGGCGCTGCT GACGGTGTGG 
GGCGCGCTCG CGGGCGAGGC TCAGGCACAG GTCCCCAGTC CGGTCCAGGC TCCGGTGGCT
CAGGTTCCGG TGGCCGCGGG GCCGGTGGCG AAGGATGAGG CGCGCATGGT GCGCGGTCTG
CTCAGCGCGC CGTCGCAGGC GGGGCCGGCG GCGGTGGCGC AGGCCTTTGT CGCGTCCCGG
CCCGGGCCGC TGGCCGCGGT CGATCCCAGC ACCCTGGAGG CGCCGGAGCT GCGCGCGCAG
GCCGACGGTC ATCTGGTCCG CTACGGGCAG CGCTACCGCT CGCTGCGCGT GGTGGGCGGC
GACACCCAGG TGCGCGTCGA CGCGCTCGGC CGGGTGCGCT GGCTAAAATC GGACGCTCGT
CCGGCGACTG AGCTGCGCGC ACTCGACGAG CAGATCGCGC GCGGCCCGGC CCTCGATGGC
GCCGCCGCGC GCGCGCGCTT TGCCCGCGCG GCCGGGTACA GCGACGCGCT GTTGGCGGCC
GCCGGCGCGG CGCCCGAGCT GGTCGTGTAC GCGGCCGGCG TGGCCCGAGC CGGCGCTGCG
AGCGCGGCCG CCGGGGGCGG TCTGGCGCCG CGGCTGGCCT ACCGGGTCGA GCTGCCCTAC
CACCCGCAGC GCCGGCACAA GCTGCGCGCC TATATCGACG CCGACAGCGG CTTCGCCCTG
GCGGTCGACA ACCTCATCCG CACGCAGAGC GCGCCCGCGT GTCCGGGGGC TCAAGACGCC
TACGTCTACG AGACCAATCC CGCCGACTCG GCGCTCACCT GCGTGTCGCT CTCGCCGTAT
CTCGCCGAGG ACGCGAGCGA GCTGGCCAAC GGCGACGTGC GCGTGCTCAA CTGCCTCGAC
CGCAACGGCT GCTTCAGCGC CGGCGGCACG CTGTACCACT TCTGCGACAT CGAGAGCGTG
GCGGCCGCCG ACCAGGCCGG GCATTTCACG GCCTACCAGT TCGAGAGCGA CACCGCCGCG
GAGGACGCCT TCGCCGAGGT GCAGATGTTC TACCACGTCA ACGTGGTCTA CGAGCTGGCG
CGCACGCTGG GCGGGTTCGA CGACCTCGAC GTCAAGCCGC TCGACGCCAT CGTCAATTTG
CGGCTGCCGA GCTTCGAAAC CAGCTCGCAG TGCAGCTCGC CGAGCTACAC CGGCGACGAG
GCCCTGGAGG CGTTTAGCAA CGCCGCTTTC GTGCCCAAGG ACGGCTTCTT TCCCGGTTTT
CCCGAGGACG ATTCGATCAT GTTCGGACAG GGGCCGGAGC GCGACTACGC GTACGACGGC
GACGTCATCT ATCACGAATT CGGACACGCG GTCATGGCCA AGATCGCGCC CGACCTGCCG
AGCTTCTTCA TCGATGGTCT GGGCCTCAAC TCGATGCCCG GCGGCATGCA CGAGGGTTAT
GCGGACCTCA TGACCATCTT CGTGACCGAC GACCCCGAGG TCGGCGAGTA CGTGGCCGGC
GAGTTCGGAT CCGATGCGGT GCGCGATGTC GAGAACGGCT ACACGTGTCC CTTGGGCCTC
ACCGGCGAGG TTCACGACGA TTCGCTGCCG TTTACCGGGG CGATGTGGGA GGCGCGCGAG
GCGGTGGCGT CCACGGCCGA GCGCAAGCGC AGCTTCGACC AGGCGGTGTT CGCGGCGCAG
GCGACGCTGG GCTCGGGCGA TGATTTTGGC GACGCGGCCG AGCGCACGGT CGCCGAGGTC
GCGCAGGCGC TGGACGCACA GGCCGCGGCC ACGGTCGAGT CGGTGTTCAC GGCGCGCGGC
CTGCTGGGCG ACGACACACG CGGCCCGTGC TCGGATCGCG TCATCGACGC CGCCGAGCTC
ACCGATCTGC CGTATCTGTA CCTGGTCGGC ATCGAGTACT TCGGCGGCAT CAACCAGGTG
CCGGGGCCGG TGCAGTTTCG CTACGAGCTG AGCGAGCGCG CCGGCGCGAT CCTGCTCGAC
ATCGCCGTGG CCGCGCCCGA CGCGGGCCTG GTGGGGCCGG AGGGGCAGTT CGAGCCTCTG
CTCGAGGTGC TGGTCAATGA GAGCGAGACG CCGATCATCT GGGGCGGCGT GCCGACCGGT
GTGGCGGCGG CGACCAGCAG CGAGCCCGAG CCCGTGGAGT TCGAGGACAT CCCCGACCGT
CCCGGGTTGA CGAGCGCGAC GGTGAAGATT CCGGGCCCCT TCGAGCCCGG CGTGTATCAC
CTGCAGTTCA CCAACCGGGG CCGGACCTGG CTGGTCGCCG GTCTGCACGT GTACTCGACG
CCGCCGAAGG AGGGCGGCTG TCAAGTCTCG CCCGGCGGCC GCAGCCGCGA CATCGGCATG
GGGATTCTGT TTCTGCTCGG CGTGATCGCG ATGTTCGTGC GCTGGCGGCG TTCGGCCGGG
CGCATGCGCC GGCTGCGCCG CGGTTCGGAC GAGCGCGAGC GCTGA
 
Protein sequence
MTMNSRWCFR LAPVALLTVW GALAGEAQAQ VPSPVQAPVA QVPVAAGPVA KDEARMVRGL 
LSAPSQAGPA AVAQAFVASR PGPLAAVDPS TLEAPELRAQ ADGHLVRYGQ RYRSLRVVGG
DTQVRVDALG RVRWLKSDAR PATELRALDE QIARGPALDG AAARARFARA AGYSDALLAA
AGAAPELVVY AAGVARAGAA SAAAGGGLAP RLAYRVELPY HPQRRHKLRA YIDADSGFAL
AVDNLIRTQS APACPGAQDA YVYETNPADS ALTCVSLSPY LAEDASELAN GDVRVLNCLD
RNGCFSAGGT LYHFCDIESV AAADQAGHFT AYQFESDTAA EDAFAEVQMF YHVNVVYELA
RTLGGFDDLD VKPLDAIVNL RLPSFETSSQ CSSPSYTGDE ALEAFSNAAF VPKDGFFPGF
PEDDSIMFGQ GPERDYAYDG DVIYHEFGHA VMAKIAPDLP SFFIDGLGLN SMPGGMHEGY
ADLMTIFVTD DPEVGEYVAG EFGSDAVRDV ENGYTCPLGL TGEVHDDSLP FTGAMWEARE
AVASTAERKR SFDQAVFAAQ ATLGSGDDFG DAAERTVAEV AQALDAQAAA TVESVFTARG
LLGDDTRGPC SDRVIDAAEL TDLPYLYLVG IEYFGGINQV PGPVQFRYEL SERAGAILLD
IAVAAPDAGL VGPEGQFEPL LEVLVNESET PIIWGGVPTG VAAATSSEPE PVEFEDIPDR
PGLTSATVKI PGPFEPGVYH LQFTNRGRTW LVAGLHVYST PPKEGGCQVS PGGRSRDIGM
GILFLLGVIA MFVRWRRSAG RMRRLRRGSD ERER