Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3870 |
Symbol | |
ID | 8546265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5324900 |
End bp | 5326075 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646388541 |
Product | Peptidase M75, Imelysin |
Protein accession | YP_003268262 |
Protein GI | 262197053 |
COG category | [R] General function prediction only |
COG ID | [COG3489] Predicted periplasmic lipoprotein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.755425 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.516703 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACTCG CCACCCGCTC TGTGCTCGCC CCGTATCTGC TGCCCCTGGC GCTGTTGGTG TCGGCTGCGC CCGCGTGCTC GGACGAGGGC TCGCCCAGCA AGGGCCCCGA CGCCAACCGC GGCCCCGACG TCCAGGACGT GCTGCGCGAC CTGGCCAACG TCGTGATCGT GCCCGCCTAC GACGACTTCC GCGCCAGCGC CGAGCAGCTC GAGGCCGCCA CCCGCAGCCT GTGCGCGGCC CCGGACGCGG CCCAGCTCAC CGCCCTGCGC GGCCAGTGGC GCGACACCCG GGCGCTGTGG AAGCGCGCCG AGGCGCACGA ATTCGGACCG GCGGCCGATC TCCGCATCGA CACCGCCGTG GACTTCTGGC CGGTGCGCGC CAGCTCCGTG GACATCGAGC TGGCCAAGAC CGACCCGGTG CCCGAGGACT ACGCCACCAC CCTGGGCGAC ACGCTCAAGG GCCTGCCGGT GATGGAGTAC ATCCTCTACG ACGGCGCCAG CGCGGCCGAC GACGCCGACA CCGAGAGCGT GCTGGCGCGC CTGGTCGACG CCGAGACCGG CGAACCCACG CGCACCTGCG CGTATCTGGT GGCGCTCAGC GTGGATGTGC ACGCCAAAGC CACGACCCTG TACCAGGCCT GGGCGCCCGA GGGCGAAAAC TTCGCCGCCG AGCTGGCCAC CGCCGGCCAG GGCAGCACCG CCTACCCCGA CCGCGCCAAG GCCGTGAGCG CGGTGGTCAA CGACTTCGTC TACCTCATCC AGGAAGTCGA GAGCGTCAAG CTCGCCGAGC CCCTGGGCAA ACGCGCCGGC GACGTGCCGC AGCCAGACGC GGTCGAGTCA GCCCGCAGCG ACAGCTCGCG CGCCGACATC GCCGCCAACC TCGCCGGCGT GCGCGCGGTC TACACCTGCA CCCGCGGCGA CGCCACGGGC GCCAGCTTCC AGGCCGCCGT GGCCGCGCTC AATCCCGAGC TCGACGCCGC CATCATGGCG CAGCTCGACG ACGCCGACGC CAAGGTCGCC GCCATCGCCC TGCCGCTCGA GCAGGCCGTG GTCGACGACC CGGCCCCGGT CGAGGCCGCC TTCGAGAGCA CCAAGGAGCT GTTCCGGCTG ATGGCCGTGG ACATGGTCAA CCTGCTCGGG GTGACCTTGA ACTTCAGCGA CAACGATGGC GACTGA
|
Protein sequence | MPLATRSVLA PYLLPLALLV SAAPACSDEG SPSKGPDANR GPDVQDVLRD LANVVIVPAY DDFRASAEQL EAATRSLCAA PDAAQLTALR GQWRDTRALW KRAEAHEFGP AADLRIDTAV DFWPVRASSV DIELAKTDPV PEDYATTLGD TLKGLPVMEY ILYDGASAAD DADTESVLAR LVDAETGEPT RTCAYLVALS VDVHAKATTL YQAWAPEGEN FAAELATAGQ GSTAYPDRAK AVSAVVNDFV YLIQEVESVK LAEPLGKRAG DVPQPDAVES ARSDSSRADI AANLAGVRAV YTCTRGDATG ASFQAAVAAL NPELDAAIMA QLDDADAKVA AIALPLEQAV VDDPAPVEAA FESTKELFRL MAVDMVNLLG VTLNFSDNDG D
|
| |