Gene Hoch_6572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6572 
Symbol 
ID8548989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9020282 
End bp9022285 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content67% 
IMG OID646391232 
Productpeptidase M19 renal dipeptidase 
Protein accessionYP_003270931 
Protein GI262199722 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.595868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGC AATCGATTTT ATTCTTCGCC ACCTGCGCGG CAGCCGCGGT GGCGGCGCCC 
CAGGCGGCCC ACGCCGCCGA CTGTGGCGAC ACCGTGGTCA TCTACAACTT GTTCGACATG
TCGATGGACG AGATGTTCGG CAACGACGAG GGCCACCTCG GTCCGCGCCG CCTGGTGACC
GAGCAGTACA GCACCATCCC GAGCTACGCC ACCAAGCGCT TCATCCAGGC CACGCCCAGC
GACATCGACT CGCTCAAAGT CGTGGTCACC AAGACCGACA ACGGCGGCAT CGGCGGCAAG
ACCGCCTTCG TGGTGTGCTC GACCGACGCC AACGACAACG TGGTCAAGCT CGAGGAGTTC
AGCATCTCGG GCGGCAGCAG CAACATCGGC ACCACGGTGC AGCGCACCTA CAGCAACCTG
CGCGACAAGC GTCTGTCGGT GCGCCTGGTG GGCAAGAGCC CGTTTGGTTC GGCGCGTTTC
AACATCGATA TCCGCCGTCC CGGTGTCGAG GGTCAGCCCT GGACGCCGGT GCAGAGCAGC
CACAGTCAAC CGCTGTCCGG CTTCGCCGAT CTGCACGTGC ATCAGGCCGC CGACCTGGCC
TTCGCGGGCG GCTGGTACTG GGGCTCGCAT CGCGAGGGCT CCGAGGCCAC GCGCCTGGCC
GAGTGCGGGG GTGACAACCA CGCCACCATC GAGATCTTCG GCGGCAACAC CGGCGTGGAC
TACATCGATC CCCACACCGG CGAGACCAAC GGCTATCCGA GCTTCGAGGA CTGGCCGCGC
TGGGACGACA TCAAGCACCA GCAGGTCGGC CTGCGCTGGC TGCAGCAGGC GCACGAGAAC
GGCCTCAACG TGATGGTCGT GTCGGTGGTC AACAACCAGT GGCTGTCGGC GGCCACGATC
GCGTCGGGTC ACAATGACAA CCGCATGTCG CCTTCGGACA TGGAGTCGGT GAAGCGCCAG
ATCCTGTCGA TCACGCGCCT GGCCGAAGTC ACGCCCTGGT ACACGATCGT GCGCGATCCC
TGGGAGGCGC GCCGCGCGAT CGAGGCCGGC CAGCTCGCCG TGGTGCTCGC GGTCGAGGTG
AGCGACGTGC TCCCGCCCAG CGATGGCCCG TGGATCCAGC AGCTCCACGA CCTCTACGAT
ATGGGCGTGC GCACGGTGCA GCTCGCCCAC CAGACCAACT CGCTGTTCGC GGGCGCCGCC
TTCCACCGCG AGATCCTCGA GTTCCTCGGC ATGATCAAGG CCTGGTTCGA TCCCGACATC
GAGTACGCCA CCACCGGCGA CGGCAACAAC AACCCCATCG GCCTGAGCGC CGACGGCGAG
GCCCTGCTGC GCGAGATGGT GCGCCTGGGC ATGCTCATCG ACATCGCCCA CCTGTCGCTG
GAGACGCAGC GCACGGTGTT CGACATGATG TCCGAAGACT ACGGCTACTA CCCGCTGTAC
GTGTCGCACA CGCGCGCCGA CGCCACCCTG CTGCCCGAGC AGGCGGACGT GTACCGCGAG
CTGGTGACCA CCGACGAGGT GCTCGAGTAC GTGCGCCAGA CCGGCGGCCA GATCGGCCTG
CGCACGGCCG AGGATCCCAT GCTCGACTAC GGCACGCCCA ACACGGGCGC GTACGTGGCC
AACAACTGCG ACGGCTCGAC GCGCTCGTTT GCCCAGAACT ACCAGTACGC GGCTGACCGC
GGCGTGAACA TCGCGCTGGG CTCGGATTTC AACGGCTTCA TCACCCAGAC CGTGCCGCGC
TTCGGCCCCG GCGCCTGCGC GGGCGCGCCC GACGAGGCCA CCCGCCTGCA GCAGGCCGCG
GCCCAGGGGA CGCCGCGCTC GAACGCCCCC GCCTACCTGC AGGAGTACTG GACCAAGGGC
ATGGCCCACA TCGGCCTGCT GCCCGCGATC ATCGATGACA TGGACGAGCT CGGCGTCGAT
ACCTCCAACG TCCGCAACTC GGCCGAGTCC TTCGTGCAGA TGTGGGAGCG CGTCTACGAT
CCCGCTCGCG GCCGCGTGAA CTGA
 
Protein sequence
MKLQSILFFA TCAAAAVAAP QAAHAADCGD TVVIYNLFDM SMDEMFGNDE GHLGPRRLVT 
EQYSTIPSYA TKRFIQATPS DIDSLKVVVT KTDNGGIGGK TAFVVCSTDA NDNVVKLEEF
SISGGSSNIG TTVQRTYSNL RDKRLSVRLV GKSPFGSARF NIDIRRPGVE GQPWTPVQSS
HSQPLSGFAD LHVHQAADLA FAGGWYWGSH REGSEATRLA ECGGDNHATI EIFGGNTGVD
YIDPHTGETN GYPSFEDWPR WDDIKHQQVG LRWLQQAHEN GLNVMVVSVV NNQWLSAATI
ASGHNDNRMS PSDMESVKRQ ILSITRLAEV TPWYTIVRDP WEARRAIEAG QLAVVLAVEV
SDVLPPSDGP WIQQLHDLYD MGVRTVQLAH QTNSLFAGAA FHREILEFLG MIKAWFDPDI
EYATTGDGNN NPIGLSADGE ALLREMVRLG MLIDIAHLSL ETQRTVFDMM SEDYGYYPLY
VSHTRADATL LPEQADVYRE LVTTDEVLEY VRQTGGQIGL RTAEDPMLDY GTPNTGAYVA
NNCDGSTRSF AQNYQYAADR GVNIALGSDF NGFITQTVPR FGPGACAGAP DEATRLQQAA
AQGTPRSNAP AYLQEYWTKG MAHIGLLPAI IDDMDELGVD TSNVRNSAES FVQMWERVYD
PARGRVN