Gene Hoch_3870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3870 
Symbol 
ID8546265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5324900 
End bp5326075 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID646388541 
ProductPeptidase M75, Imelysin 
Protein accessionYP_003268262 
Protein GI262197053 
COG category[R] General function prediction only 
COG ID[COG3489] Predicted periplasmic lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.755425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.516703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTCG CCACCCGCTC TGTGCTCGCC CCGTATCTGC TGCCCCTGGC GCTGTTGGTG 
TCGGCTGCGC CCGCGTGCTC GGACGAGGGC TCGCCCAGCA AGGGCCCCGA CGCCAACCGC
GGCCCCGACG TCCAGGACGT GCTGCGCGAC CTGGCCAACG TCGTGATCGT GCCCGCCTAC
GACGACTTCC GCGCCAGCGC CGAGCAGCTC GAGGCCGCCA CCCGCAGCCT GTGCGCGGCC
CCGGACGCGG CCCAGCTCAC CGCCCTGCGC GGCCAGTGGC GCGACACCCG GGCGCTGTGG
AAGCGCGCCG AGGCGCACGA ATTCGGACCG GCGGCCGATC TCCGCATCGA CACCGCCGTG
GACTTCTGGC CGGTGCGCGC CAGCTCCGTG GACATCGAGC TGGCCAAGAC CGACCCGGTG
CCCGAGGACT ACGCCACCAC CCTGGGCGAC ACGCTCAAGG GCCTGCCGGT GATGGAGTAC
ATCCTCTACG ACGGCGCCAG CGCGGCCGAC GACGCCGACA CCGAGAGCGT GCTGGCGCGC
CTGGTCGACG CCGAGACCGG CGAACCCACG CGCACCTGCG CGTATCTGGT GGCGCTCAGC
GTGGATGTGC ACGCCAAAGC CACGACCCTG TACCAGGCCT GGGCGCCCGA GGGCGAAAAC
TTCGCCGCCG AGCTGGCCAC CGCCGGCCAG GGCAGCACCG CCTACCCCGA CCGCGCCAAG
GCCGTGAGCG CGGTGGTCAA CGACTTCGTC TACCTCATCC AGGAAGTCGA GAGCGTCAAG
CTCGCCGAGC CCCTGGGCAA ACGCGCCGGC GACGTGCCGC AGCCAGACGC GGTCGAGTCA
GCCCGCAGCG ACAGCTCGCG CGCCGACATC GCCGCCAACC TCGCCGGCGT GCGCGCGGTC
TACACCTGCA CCCGCGGCGA CGCCACGGGC GCCAGCTTCC AGGCCGCCGT GGCCGCGCTC
AATCCCGAGC TCGACGCCGC CATCATGGCG CAGCTCGACG ACGCCGACGC CAAGGTCGCC
GCCATCGCCC TGCCGCTCGA GCAGGCCGTG GTCGACGACC CGGCCCCGGT CGAGGCCGCC
TTCGAGAGCA CCAAGGAGCT GTTCCGGCTG ATGGCCGTGG ACATGGTCAA CCTGCTCGGG
GTGACCTTGA ACTTCAGCGA CAACGATGGC GACTGA
 
Protein sequence
MPLATRSVLA PYLLPLALLV SAAPACSDEG SPSKGPDANR GPDVQDVLRD LANVVIVPAY 
DDFRASAEQL EAATRSLCAA PDAAQLTALR GQWRDTRALW KRAEAHEFGP AADLRIDTAV
DFWPVRASSV DIELAKTDPV PEDYATTLGD TLKGLPVMEY ILYDGASAAD DADTESVLAR
LVDAETGEPT RTCAYLVALS VDVHAKATTL YQAWAPEGEN FAAELATAGQ GSTAYPDRAK
AVSAVVNDFV YLIQEVESVK LAEPLGKRAG DVPQPDAVES ARSDSSRADI AANLAGVRAV
YTCTRGDATG ASFQAAVAAL NPELDAAIMA QLDDADAKVA AIALPLEQAV VDDPAPVEAA
FESTKELFRL MAVDMVNLLG VTLNFSDNDG D