Gene Hoch_6864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6864 
Symbol 
ID8549283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9398777 
End bp9400615 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content72% 
IMG OID646391524 
Productoligoendopeptidase, pepF/M3 family 
Protein accessionYP_003271221 
Protein GI262200012 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.176327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGC CAAGCGATGC AGCCGAGGTC CCGCGGCCCA ACTGGGACAT GCGACCGTAC 
TTCACCTCAC CCGCCGGCCC CGACTACGCG GCCTTCTGGG ACATGTTGCG CACCAGCGTG
GCCGAGCTCG GCCACGCCCT CGACGCCCTG CCCGCCCTGA GCGAGGACAG CGCGGCGCAG
GACGCCTGGT CGGCCACCCT GCTCGAGCTC GAGGGCGTGC ACGCGCGCAT GCGGCACCTC
GAGTCGTATC TCGACTGCAT GGCCTCGGCC GACGCCCACG ACGAAGTCAT CCGCGCCGAC
GTCGGTCAAT TCGCGTCGCT ACAGGCGTCC TTCGCCGCGC TCGGCGTCCG GCTGCGCGCC
GCGCTCGGCC ACGCCGATGA CGGCGCCGTG GCCGCGCTGC TCGAGCGCCC CGAGCTGAGC
GCGGCCGAGC ACTGGCTGCG GCGCACCCGC GACAGCGCGC GCGAGTCCAT GAGCCCGGCG
CTCGAAGAGC TGGCCACCGA GCTCGGCGTC GACGGCATCG CCGCCTGGGG GCGGCTCTAC
GATCAGCTCT CGGGCACGCT GAGCTTCTCC CTCGAGGTCC CCGGCCGCGC GGCCGAAGAG
CACCCGGTGT CGATGGCGCG CACGCTGCTC GAAGACCCCG ATCCCGCGGT CCGCCAGGCC
GCGCTCGTGG GCTCGAACGC GGCCTGGGCC CGGGTGTCCG AGCCGGTGGC CGCGTGCCTC
AACGCCATCG CCGGCACCCG GCTCACGCTC TACCGCCGCC GCGGCGTCGG CCATTACCTC
GACCCCGCGC TGTTCGACGC CGCGATCACG CGCCGCACCC TCGACGCCAT GATGTCGGCC
GTGCGCGGCC GGCGCGCGCG CATGCAGCGC TACCTGGGCA TCAAGGCGCG CCTGCTCGGC
CGCGAGCGGC TGGGCTTTCA GGACCTGCTC GCGCCCCTGC CCGAGGACGC CGCGCCGCGC
ATCTCCTGGG ACCAGGCGCG CGAGCGCGTG CTCGCCGCCT TCGGCACCCA CTACCCCGCC
CTGCGCGAGT TCGCGGCCGA CGCCTTCGCC AAGCGCTGGC TCGATCACGA ACCGCGACGC
GGCAAGCGGC CCGGCGGCTT CTGCTCCAGC TCGCCCGTGA TCGGCGAGTC GCGCGTCTTC
ATGACCTATC ACGGCTCCAT GGGCGACCTC GAGACCCTAG CCCACGAGCT CGGTCACGCC
TTCCATAGCT GGGTGATGCG CGATATGCGC CCGTGGGCCC GGGTCTACCC CATGACCCTG
GCCGAGACCG CGTCGACCTT TGCCGAGCAG CTCGTGAGCG AGGCCATGCT CCAGGGCGGC
GACGCCGATC CCGCGACCCA GCGCGCGGTG CTCGACAGCC GGCTGCAAAA GGCCGCGGTG
TTTCTGCTCA ACATCCCCAT GCGCTTCGAC TTCGAGTGCG CGTTCTACGA GGCGCGGCAA
CACGGCGAGG TCGGCGTCAC CCGGCTGTGC GAGCTGATGC GCGCGGCCCA GCGCGACAAC
TACGGCGACG CGCTCGACCC CGATGCCCTC GACCCGTGGT TCTGGGCGTC GAAGCTGCAC
TTCTACATCA CCGAGCTCAG CTTCTACAAT TTCCCGTATA CCTTCGGGTA CTTGTTCAGC
CTCGGCATCT TCGCCCGGAC GCTCGGACAA GGACCGGAGG CGCTCGCGCG CTACGAGACG
CTGCTGCGCC GCACCGGCAG CGCCAGCGCC GAGCAGGTAG CCCGCGAAGG CCTCGGCGTC
GATCTCGAGA GCCCCGACTT CTGGAACGCC TCGCTCGATG TCATCGAGGC CGACCTGGGC
CGCTTCGAGC AGCTCAGCGG CAAAGGCAGC GGCTCGTAA
 
Protein sequence
MNQPSDAAEV PRPNWDMRPY FTSPAGPDYA AFWDMLRTSV AELGHALDAL PALSEDSAAQ 
DAWSATLLEL EGVHARMRHL ESYLDCMASA DAHDEVIRAD VGQFASLQAS FAALGVRLRA
ALGHADDGAV AALLERPELS AAEHWLRRTR DSARESMSPA LEELATELGV DGIAAWGRLY
DQLSGTLSFS LEVPGRAAEE HPVSMARTLL EDPDPAVRQA ALVGSNAAWA RVSEPVAACL
NAIAGTRLTL YRRRGVGHYL DPALFDAAIT RRTLDAMMSA VRGRRARMQR YLGIKARLLG
RERLGFQDLL APLPEDAAPR ISWDQARERV LAAFGTHYPA LREFAADAFA KRWLDHEPRR
GKRPGGFCSS SPVIGESRVF MTYHGSMGDL ETLAHELGHA FHSWVMRDMR PWARVYPMTL
AETASTFAEQ LVSEAMLQGG DADPATQRAV LDSRLQKAAV FLLNIPMRFD FECAFYEARQ
HGEVGVTRLC ELMRAAQRDN YGDALDPDAL DPWFWASKLH FYITELSFYN FPYTFGYLFS
LGIFARTLGQ GPEALARYET LLRRTGSASA EQVAREGLGV DLESPDFWNA SLDVIEADLG
RFEQLSGKGS GS