Gene Hoch_5038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5038 
Symbol 
ID8547448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6950940 
End bp6953774 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content71% 
IMG OID646389713 
Productpeptidase M28 
Protein accessionYP_003269419 
Protein GI262198210 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.705528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACACG TACATCTTGA GCGACGCCCT GCGGCGTCCA TCCTGCCCGT CCTGGGCCTG 
CTGCTCGGCG CTTGCGCGTC CGCGCCGGCC GGAGATCCGC AAGAGCCGGC GGCGCCCGCC
GACCAGGCTC AGGCACCAGC CCAGCCCAAC CCGGCGCTGC GCGACGCCCG CGAGGTGCAC
ATCGCCGATA TCGTGCAGCT CACCGACGGC GGCGAGAACG CCGAGGCGTA CTGGTCCTTC
GACGGCGACG AGCTGATCTT CCAGAGCAAG CGGCCGCCCT ACGAATGCGA TCAGATCATG
CGCATGCCGG CCGATGGCTC GGCCGAGCCG AGGGTGGTGT CCACGGGCAA GGGGCGCACG
ACCTGCGCCT ACTTCCTGCC CGGCGATGAC GACATCGTCT ACTCCTCGAC CCACGAGCTC
GGCCCCGAGT GCCCGCCCGA GGCCGATATG TCGCAGGGCT ACGTGTGGTC GCTCTACGAC
TACGATATCT ATCGCGCCAA GGCCGATGGC TCGGAGCTGG TCAATCTCAC CCAGCGCGAC
GGCTACGACG CCGAAGCCAC GGTGTGCGCC AAGGACGGGA CCATCCTCTT CACCTCCGAC
CGCGACGGCG ACCTGGAGCT GTATCGGATG GATCCCAACG GCGAAAACGT CGTGCGACTC
ACCAACACGC CCGGCTACGA CGGCGGCGCG TTCTTCTCGC AGGACTGCTC GCAGATCGTG
TGGCGGGCCT CGCGGCCGCA GGGCGAGGCG CTGGCCGACT TTCAGCGGCT GCTCGGCGAG
GGCCTGGTGC GCCCCAGTCT GCTCGAGATC TACGTCGCCG ATGCCGACGG CCAGAACGCG
CGCCAGATCA CCTATCTCGG CGCGGCCGCG TTCGCGCCCT ATTTCCATCC CTCGGGCAAG
CGGGTGCTGT TTTCGACCAA CTATCCCAAC CCGCGCGGAC GCGAGTTCGA TATCTGGTCC
GTGAACACCG ACGGCACCGG TCTCGAGCAG ATCACGTTCA GCGAGGGCTT CGACGGCTTC
CCCATGTTCT CGCCCGACGG CACCAAGCTG GCGTTCGCGT CCAACCGCAA CAACAGCAAG
CCGGGCGAGA CCAACGTCTT TGTCGCCCGC TGGGTCGAAG ACGCCGCCCC GGTCGAGAAC
GCCCGCCAGC CGGGCGCGGC CGATCGTTTC CGCGCCGACG TCGCCTGGCT CGCCGACGAC
GCTCGCGAGG GCCGCGGGAT CGGCACCGGC GGCCTCGACG CCGCGGCCGA TTGGTTGGTC
GAGCAACTGT CCGAAATCGG CGCCGAGGGC GCGGCCGATG ACGAGGGCGG TTACCGCCAG
GGCTTCGAGG TCACGACCGC GATCACGCCG GGCGCGGCCA CGGCGCTGCG CATCGACCGC
AAGCCGGTGC CGGCCGAGGC CTTTGTGCCG CTGTCGCAGT CGGCGCCCGG GCGGGTGCGC
GGCCAGACCG TGTTCGTCGG CTATGGCATC GTCGCCGACG AGCTCGGCGT GGACGACTAC
AAGGGCAAGA ACGTGCGCGG CAAGATCGCG GTCGTGCGCC GCTTTGCGCC CGCGGGCGCG
CCCTTTGACG ACGAGGCCGT GCAGCGGCGC TACAGCGACC TGGCGTACAA GGCCTTCATC
GCCCGGCAGA AGGGCGCGCG CGGCCTGATC ATCGTCGACG CCCCGCCGGC TGCGCAGGAC
GGGGCCGAAT TGCCGCCCGA CGCGCCGCTG CCGCTGCTCG CGCCCACGGG CTCGGGTGAC
GCCGGCATTC CCATCGTCGT GCTCACGCGC GCGGCCGGCG CCGCGCTCAC CAGCGGCCGT
CACCAGGTCG AGCTGAGCGT GGCCCTCGAG GCCGAGAAGC GCCGGGTCGA CAACATCGTC
GCCAAGATCC CGGCCGGCAA CCCCGACGGC GGCGGCGCGG TGCTGGTCGG CGCGCACTAC
GATCACCTCG GCATGGGCGG CGCCGGTTCG CTCGAGGTCG GGGCCACGGT GGTGCACAAC
GGCGCTGACG ACAACGCCTC GGGGACCGCG GGTCTGCTCG AGGTCGCGCG CCAGCTCCAC
GCGCGCCGGG CCGAGCTGCG CCGCGACGTC TACCTGGTCG CGTTCACGGC CGAGGAGAGC
GGCATCATCG GCTCGCGTTA CTTCACCGAG CACCCGCCCG CGGGCCTGCG CATGGACGGC
CTCACCGCCA TGCTCAACAT GGACATGATC GGCCGCATGC GCGGCAATCG GGTGTCGGTC
ATGGGCGTGC AGACCGCGGC CGAGTGGGAG GCCACCGTGG CGCCGCTGTG CGCGGCCGCG
CGCGTCGATT GCACCCTGGG CGGCGACGGC TACGGCCCCT CCGACCACAT GCCCTTCTAC
ACTTCGGGCG TGCCGGTGTT GTTCTTCTTC ACCGGCGCGC ATCCTGACTA CCACCGCGCC
AGCGACGACA TCGCGCACAT CAACGCCGGC GGCGGTGCGC GCATCGCCCA GCTTGTCGGC
GAGGTCGCGG TGGCCGCCGC CACCGGGCCG GCCAAGCTCA GCTACCAGCG CGCGCCCGTG
CCCGAGCGCC AGGGCGACGT GCGCGCCCAG GGCGGCTCGC TCGGCACCGT GCCGGCCTAC
GGCGAAGAGG GCAAGATCCC CGGCGTGTTG CTCAGCGACG TGCGACCCGA GGGCCCGGCC
GCGCGAGCCG GCCTGCGCGC CGGCGACCGC ATCGTGGCCA TCGGCGAGGT CGATGTGCGC
AACATCCGCG ACCTGATGTT CGTGCTGCGC GCCGCCGTCC CCGGGCAGAA GGCGACTATC
GTCGTGTCGC GCGACGGCGA GCGGGTGTCG CTCGAGGCGA TCTACGGCGC GCCCAGCCCG
CGCATCTCCC GCTGA
 
Protein sequence
MRHVHLERRP AASILPVLGL LLGACASAPA GDPQEPAAPA DQAQAPAQPN PALRDAREVH 
IADIVQLTDG GENAEAYWSF DGDELIFQSK RPPYECDQIM RMPADGSAEP RVVSTGKGRT
TCAYFLPGDD DIVYSSTHEL GPECPPEADM SQGYVWSLYD YDIYRAKADG SELVNLTQRD
GYDAEATVCA KDGTILFTSD RDGDLELYRM DPNGENVVRL TNTPGYDGGA FFSQDCSQIV
WRASRPQGEA LADFQRLLGE GLVRPSLLEI YVADADGQNA RQITYLGAAA FAPYFHPSGK
RVLFSTNYPN PRGREFDIWS VNTDGTGLEQ ITFSEGFDGF PMFSPDGTKL AFASNRNNSK
PGETNVFVAR WVEDAAPVEN ARQPGAADRF RADVAWLADD AREGRGIGTG GLDAAADWLV
EQLSEIGAEG AADDEGGYRQ GFEVTTAITP GAATALRIDR KPVPAEAFVP LSQSAPGRVR
GQTVFVGYGI VADELGVDDY KGKNVRGKIA VVRRFAPAGA PFDDEAVQRR YSDLAYKAFI
ARQKGARGLI IVDAPPAAQD GAELPPDAPL PLLAPTGSGD AGIPIVVLTR AAGAALTSGR
HQVELSVALE AEKRRVDNIV AKIPAGNPDG GGAVLVGAHY DHLGMGGAGS LEVGATVVHN
GADDNASGTA GLLEVARQLH ARRAELRRDV YLVAFTAEES GIIGSRYFTE HPPAGLRMDG
LTAMLNMDMI GRMRGNRVSV MGVQTAAEWE ATVAPLCAAA RVDCTLGGDG YGPSDHMPFY
TSGVPVLFFF TGAHPDYHRA SDDIAHINAG GGARIAQLVG EVAVAAATGP AKLSYQRAPV
PERQGDVRAQ GGSLGTVPAY GEEGKIPGVL LSDVRPEGPA ARAGLRAGDR IVAIGEVDVR
NIRDLMFVLR AAVPGQKATI VVSRDGERVS LEAIYGAPSP RISR