Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5038 |
Symbol | |
ID | 8547448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 6950940 |
End bp | 6953774 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646389713 |
Product | peptidase M28 |
Protein accession | YP_003269419 |
Protein GI | 262198210 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.705528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACACG TACATCTTGA GCGACGCCCT GCGGCGTCCA TCCTGCCCGT CCTGGGCCTG CTGCTCGGCG CTTGCGCGTC CGCGCCGGCC GGAGATCCGC AAGAGCCGGC GGCGCCCGCC GACCAGGCTC AGGCACCAGC CCAGCCCAAC CCGGCGCTGC GCGACGCCCG CGAGGTGCAC ATCGCCGATA TCGTGCAGCT CACCGACGGC GGCGAGAACG CCGAGGCGTA CTGGTCCTTC GACGGCGACG AGCTGATCTT CCAGAGCAAG CGGCCGCCCT ACGAATGCGA TCAGATCATG CGCATGCCGG CCGATGGCTC GGCCGAGCCG AGGGTGGTGT CCACGGGCAA GGGGCGCACG ACCTGCGCCT ACTTCCTGCC CGGCGATGAC GACATCGTCT ACTCCTCGAC CCACGAGCTC GGCCCCGAGT GCCCGCCCGA GGCCGATATG TCGCAGGGCT ACGTGTGGTC GCTCTACGAC TACGATATCT ATCGCGCCAA GGCCGATGGC TCGGAGCTGG TCAATCTCAC CCAGCGCGAC GGCTACGACG CCGAAGCCAC GGTGTGCGCC AAGGACGGGA CCATCCTCTT CACCTCCGAC CGCGACGGCG ACCTGGAGCT GTATCGGATG GATCCCAACG GCGAAAACGT CGTGCGACTC ACCAACACGC CCGGCTACGA CGGCGGCGCG TTCTTCTCGC AGGACTGCTC GCAGATCGTG TGGCGGGCCT CGCGGCCGCA GGGCGAGGCG CTGGCCGACT TTCAGCGGCT GCTCGGCGAG GGCCTGGTGC GCCCCAGTCT GCTCGAGATC TACGTCGCCG ATGCCGACGG CCAGAACGCG CGCCAGATCA CCTATCTCGG CGCGGCCGCG TTCGCGCCCT ATTTCCATCC CTCGGGCAAG CGGGTGCTGT TTTCGACCAA CTATCCCAAC CCGCGCGGAC GCGAGTTCGA TATCTGGTCC GTGAACACCG ACGGCACCGG TCTCGAGCAG ATCACGTTCA GCGAGGGCTT CGACGGCTTC CCCATGTTCT CGCCCGACGG CACCAAGCTG GCGTTCGCGT CCAACCGCAA CAACAGCAAG CCGGGCGAGA CCAACGTCTT TGTCGCCCGC TGGGTCGAAG ACGCCGCCCC GGTCGAGAAC GCCCGCCAGC CGGGCGCGGC CGATCGTTTC CGCGCCGACG TCGCCTGGCT CGCCGACGAC GCTCGCGAGG GCCGCGGGAT CGGCACCGGC GGCCTCGACG CCGCGGCCGA TTGGTTGGTC GAGCAACTGT CCGAAATCGG CGCCGAGGGC GCGGCCGATG ACGAGGGCGG TTACCGCCAG GGCTTCGAGG TCACGACCGC GATCACGCCG GGCGCGGCCA CGGCGCTGCG CATCGACCGC AAGCCGGTGC CGGCCGAGGC CTTTGTGCCG CTGTCGCAGT CGGCGCCCGG GCGGGTGCGC GGCCAGACCG TGTTCGTCGG CTATGGCATC GTCGCCGACG AGCTCGGCGT GGACGACTAC AAGGGCAAGA ACGTGCGCGG CAAGATCGCG GTCGTGCGCC GCTTTGCGCC CGCGGGCGCG CCCTTTGACG ACGAGGCCGT GCAGCGGCGC TACAGCGACC TGGCGTACAA GGCCTTCATC GCCCGGCAGA AGGGCGCGCG CGGCCTGATC ATCGTCGACG CCCCGCCGGC TGCGCAGGAC GGGGCCGAAT TGCCGCCCGA CGCGCCGCTG CCGCTGCTCG CGCCCACGGG CTCGGGTGAC GCCGGCATTC CCATCGTCGT GCTCACGCGC GCGGCCGGCG CCGCGCTCAC CAGCGGCCGT CACCAGGTCG AGCTGAGCGT GGCCCTCGAG GCCGAGAAGC GCCGGGTCGA CAACATCGTC GCCAAGATCC CGGCCGGCAA CCCCGACGGC GGCGGCGCGG TGCTGGTCGG CGCGCACTAC GATCACCTCG GCATGGGCGG CGCCGGTTCG CTCGAGGTCG GGGCCACGGT GGTGCACAAC GGCGCTGACG ACAACGCCTC GGGGACCGCG GGTCTGCTCG AGGTCGCGCG CCAGCTCCAC GCGCGCCGGG CCGAGCTGCG CCGCGACGTC TACCTGGTCG CGTTCACGGC CGAGGAGAGC GGCATCATCG GCTCGCGTTA CTTCACCGAG CACCCGCCCG CGGGCCTGCG CATGGACGGC CTCACCGCCA TGCTCAACAT GGACATGATC GGCCGCATGC GCGGCAATCG GGTGTCGGTC ATGGGCGTGC AGACCGCGGC CGAGTGGGAG GCCACCGTGG CGCCGCTGTG CGCGGCCGCG CGCGTCGATT GCACCCTGGG CGGCGACGGC TACGGCCCCT CCGACCACAT GCCCTTCTAC ACTTCGGGCG TGCCGGTGTT GTTCTTCTTC ACCGGCGCGC ATCCTGACTA CCACCGCGCC AGCGACGACA TCGCGCACAT CAACGCCGGC GGCGGTGCGC GCATCGCCCA GCTTGTCGGC GAGGTCGCGG TGGCCGCCGC CACCGGGCCG GCCAAGCTCA GCTACCAGCG CGCGCCCGTG CCCGAGCGCC AGGGCGACGT GCGCGCCCAG GGCGGCTCGC TCGGCACCGT GCCGGCCTAC GGCGAAGAGG GCAAGATCCC CGGCGTGTTG CTCAGCGACG TGCGACCCGA GGGCCCGGCC GCGCGAGCCG GCCTGCGCGC CGGCGACCGC ATCGTGGCCA TCGGCGAGGT CGATGTGCGC AACATCCGCG ACCTGATGTT CGTGCTGCGC GCCGCCGTCC CCGGGCAGAA GGCGACTATC GTCGTGTCGC GCGACGGCGA GCGGGTGTCG CTCGAGGCGA TCTACGGCGC GCCCAGCCCG CGCATCTCCC GCTGA
|
Protein sequence | MRHVHLERRP AASILPVLGL LLGACASAPA GDPQEPAAPA DQAQAPAQPN PALRDAREVH IADIVQLTDG GENAEAYWSF DGDELIFQSK RPPYECDQIM RMPADGSAEP RVVSTGKGRT TCAYFLPGDD DIVYSSTHEL GPECPPEADM SQGYVWSLYD YDIYRAKADG SELVNLTQRD GYDAEATVCA KDGTILFTSD RDGDLELYRM DPNGENVVRL TNTPGYDGGA FFSQDCSQIV WRASRPQGEA LADFQRLLGE GLVRPSLLEI YVADADGQNA RQITYLGAAA FAPYFHPSGK RVLFSTNYPN PRGREFDIWS VNTDGTGLEQ ITFSEGFDGF PMFSPDGTKL AFASNRNNSK PGETNVFVAR WVEDAAPVEN ARQPGAADRF RADVAWLADD AREGRGIGTG GLDAAADWLV EQLSEIGAEG AADDEGGYRQ GFEVTTAITP GAATALRIDR KPVPAEAFVP LSQSAPGRVR GQTVFVGYGI VADELGVDDY KGKNVRGKIA VVRRFAPAGA PFDDEAVQRR YSDLAYKAFI ARQKGARGLI IVDAPPAAQD GAELPPDAPL PLLAPTGSGD AGIPIVVLTR AAGAALTSGR HQVELSVALE AEKRRVDNIV AKIPAGNPDG GGAVLVGAHY DHLGMGGAGS LEVGATVVHN GADDNASGTA GLLEVARQLH ARRAELRRDV YLVAFTAEES GIIGSRYFTE HPPAGLRMDG LTAMLNMDMI GRMRGNRVSV MGVQTAAEWE ATVAPLCAAA RVDCTLGGDG YGPSDHMPFY TSGVPVLFFF TGAHPDYHRA SDDIAHINAG GGARIAQLVG EVAVAAATGP AKLSYQRAPV PERQGDVRAQ GGSLGTVPAY GEEGKIPGVL LSDVRPEGPA ARAGLRAGDR IVAIGEVDVR NIRDLMFVLR AAVPGQKATI VVSRDGERVS LEAIYGAPSP RISR
|
| |