Gene Hoch_5061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5061 
Symbol 
ID8547472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6979771 
End bp6981738 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content67% 
IMG OID646389737 
ProductThimet oligopeptidase 
Protein accessionYP_003269442 
Protein GI262198233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCAG CGACTGATAC TCCCGCCGCG TCCCCGTCCG CCGAGGCATT CGCCGCGCGC 
TGCGACGAGG GCTTGCAGCG CGCCAAGGCG CTGCTGCCGA CGATCATCGA CGTCTCGGGT
CCGCGCACCC TGGAGAACAC GCTCGAGCCG TACAACGACA TGCTGATGCA TGCCGAGCGC
ACGGCCGGCC TGGCCGGTCT GATGTCATCC GTCCACCCGG ACGAAGTCAT CCGAGAGGCC
GCGCGCGCCA GCGAACAGGC CGTGGACGCC TTCATGTCGG GCCTCAGCCG CGACCGCCGC
CTGTACGAGG CCATCGCCGA GCTGGCCAAA TCGCCCGCGC ACGACTCCTT CGACGCCGAG
ACCCAGCGGC TCATCGAGCG CGGCCTGCGC GACTTCCACC GCAGCGGCGT CGACCGCGAC
GAGGCCACCC GCGAGCGCTT GCGCGCGATC GACGACCAGC TCACCAAGCT CGGACAGGCG
TTCCAAAAAG CCATCGTCGA CGACGTCCGC CACGTCGATG TCAACGATGT CTCGGAGCTG
GCCGGATTGC CCGAGGACTT CGTGCGCTCG CATCCGCCGC AGGAGGACGG CCGCATCCGC
ATCTCGACCG ACTTCCCCGA CTACCTGCCG GTGATGGCCT ACGCCGAGAA CGGCGACCTG
CGGCGCGCGC TGTACGTCTG CTACAAGGCC CGCGGCGGCG CGGCCAACGA GGCCACGCTC
AAGCAGATCC TGGTGCTGCG CGCGGAAAAA GCCAAGCTGC TCGGCTACGA GAGCTGGGCC
GACTACGCCA CCGAGAACAA GATGATGAAG AGCGGCGCCA ACGCCGCCGC GTTTATCGAC
CGCGTGGCCA AGATCGCTCG CCCGCGCGCC GAGCGCGACT ATCAGGAGCT GCTCGCGCGC
AAGCGCAAGC AGCACCCCGA TGCCCAGGGC GTGGACGACT TCGAGAAATA CTTCTACGAG
ACCAAGGTCA AAACCGAGTC GTACGCCTTC GACCCGCAGT CGGTGCGCCC GTACTTCGAG
TACGCTCGCG TCGAGCGCGG CCTGCTCGAT ATCACCGCGC GCATCTACGA CATCGCCTAC
GAGCCGGTGG CGCCCGAGCA GGTCGCCGAG CTGTGGCACC CCGACGTCAA GGTCTTCGAC
GTCACCCGCG AGGGCGAGGT GCTCGGCCGC ATCTTCCTCG ACATGCACCC GCGCGAGGGC
AAATACAAAC ACGCGGCCCA GTTCCCCTAC CGCTCGGGCG TACGCGGCAA GCAGATGCCC
GAGGGCGTGC TGGTGTGCAA CTTCCCCAAC CCGCGCACCA CCGACGGTCC GGCGCTGATG
GAACACCCCG ACGTGGTCAC CATGTTCCAC GAATTCGGAC ACCTGATGCA CCACGTGCTC
GGCGGCCACA AGCGCTGGAT CGAGCAGAGC GGCGTGGCCA CCGAGTGGGA CTTCGTCGAG
GCGCCCTCGC AGATGTTCGA GGAGTGGGCG TGGTCCCACG AGACCCTGGC CACCTTTGCC
CTGCACCACG AGACCGGCGA GCCCCTGCCC GAGGAGACCG TGCAGCGCAT GCGCCGGGCC
GACACCTTCG GCCTGGGGCT CAACACCATG CAGCAGATGT TCTACGCGTC CATCTCGCTC
GAGTTCCACA CCGTGGCTCC CGAAGAGCTC GACATGAACG CGGCCGTACA GCGGCTACAG
AGCGCGTACA CGCCCTTCCC CTTCGTCGAG GGCACCTGCT TCCACGCCAA CTTCGGACAC
CTGAACGGTT ACTCGGCCCT GTACTACACG TACATGTGGT CGCTGGTGAT CGCCAAGGAC
CTGCTCACGC CATTCAAAGA GCACGGCCTG CTCAATCTGG AGTGGACGCA CCGCTATCGC
GACCACGTCC TGGCGCCGGG CGGCAGCAAG GACGCCGCCG TGCTGGTCGA GGAGTTTCTC
GGCCGCACGT ACAAATTCGA CGCCTTCGAA GCCTATCTCG CCGGCTGA
 
Protein sequence
MPPATDTPAA SPSAEAFAAR CDEGLQRAKA LLPTIIDVSG PRTLENTLEP YNDMLMHAER 
TAGLAGLMSS VHPDEVIREA ARASEQAVDA FMSGLSRDRR LYEAIAELAK SPAHDSFDAE
TQRLIERGLR DFHRSGVDRD EATRERLRAI DDQLTKLGQA FQKAIVDDVR HVDVNDVSEL
AGLPEDFVRS HPPQEDGRIR ISTDFPDYLP VMAYAENGDL RRALYVCYKA RGGAANEATL
KQILVLRAEK AKLLGYESWA DYATENKMMK SGANAAAFID RVAKIARPRA ERDYQELLAR
KRKQHPDAQG VDDFEKYFYE TKVKTESYAF DPQSVRPYFE YARVERGLLD ITARIYDIAY
EPVAPEQVAE LWHPDVKVFD VTREGEVLGR IFLDMHPREG KYKHAAQFPY RSGVRGKQMP
EGVLVCNFPN PRTTDGPALM EHPDVVTMFH EFGHLMHHVL GGHKRWIEQS GVATEWDFVE
APSQMFEEWA WSHETLATFA LHHETGEPLP EETVQRMRRA DTFGLGLNTM QQMFYASISL
EFHTVAPEEL DMNAAVQRLQ SAYTPFPFVE GTCFHANFGH LNGYSALYYT YMWSLVIAKD
LLTPFKEHGL LNLEWTHRYR DHVLAPGGSK DAAVLVEEFL GRTYKFDAFE AYLAG