Gene Hoch_6664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6664 
Symbol 
ID8549081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9135773 
End bp9137131 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content67% 
IMG OID646391324 
Productpeptidase M16 domain protein 
Protein accessionYP_003271023 
Protein GI262199814 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCC TATCTGTATT GGCGCTCGGC GCCGCCCTGC TCCTGCCCGG CGCGGCACTG 
GCCGAGGACA CGTCGCCGGG CCAGGACGCG CTGGCCACGA TGTTCCCCTT CCCGGTGCAT
CAGGAGACCC TCGACAACGG CCTGCAGGTC ATCGTCGTGC CCATGGAGAG CGATGAGCTG
GTGGCCGTGC GCATGGCCGT GCGCACCGGC GCGCGCGACG AGTACGAGCC CGGGCGCACC
GGCTTCGCGC ACTTCTTCGA GCACATGATG TTCCGCGGCA CCGAGAAGTA CCCGGCCGAG
GTGTACAACA AGCTGATCAC GCAGATGGGC GCCGACACCA ACGCCTACAC CTCGGACGAC
GTGACCGTGT ACCAGCTCAA CGTGGTCGCC GACGACCTCG CCCAGGTCAT GGAGCTCGAG
AGCGACCGCT TCAAGAACCT GTCGTATCCG CCGCAGGCCT TCGAGACCGA GGCCGGCGCG
GTCTACGGCG AGTATCGCAA GAACCGCACC AGCCCGTTCT TCACCCTCTA CGAGGCCATG
CGCAAGGCCG CGTATACCGT CCACACGTAC GGACACACCG CGATGGGCTA CGAGGCCGAC
ATCAAGAACA TGCCCAAGAT GTTCGACTAC TCGCGCACCT TCTTCTCGCG CTACTACCGC
CCGGACAACG CGATCCTGGT GGTCGCCGGC GACGTCGAGC CGCAGGCCAC CATCGCCATG
GCTCGCAAAT ACTACGGCGA CTGGGAGCGC GGCTACGTCA AGCCCAAGGT CAAGCCCGAG
CCCGAGCAGA AGCAGGAGCG CCGCATCGAG GTGAGCTACG AGGGCCGCAG CCTGCCGCTC
GTGGCCATCG CCTACAAGAG CGACGCCTAC TCGCCCAGCG ACCGCATCTA CGCCGCCTCG
CACGTGCTGG CCTCGCTGGC CTTTGGCGAG ACCAGCGACA TCTACCGCCA GCTCATCATC
GAGCAGCAGG CGGTGCAGTT CCTCGAGGCC GAGGCCTCCG ATTCGCGCGA CCCGGGCCTG
TGGGGCATCT GGACCATGGT CAAGGACCCG AGCAAGGTCG ACGAGGTTAT CGGACAGATC
GACGAGACCG TGGCCCGCTT CCGCAATGAG GCGCCCGACG CCGAGCGCCT CGAGGCGGTC
AAGTCGAACC TGCGCTACTC GCTGCTCATG GAGCTCGACT CGCCGGCCGC GGTGGCCGGA
ACCGTCGCGC ACCAGGCCGG CGTCGCCGGC AGCCTCGAGA ACGTCGCCCT GTTCCTGCAA
ACCCTCACCG AGGTGACTCC CGAGGACGTG CGCGCCGCCG CGCAGAAGTA CCTCGCCCCC
GAGCGCCGCA CCATCGCCAT CCTGCGGGAG AAGAACTGA
 
Protein sequence
MKSLSVLALG AALLLPGAAL AEDTSPGQDA LATMFPFPVH QETLDNGLQV IVVPMESDEL 
VAVRMAVRTG ARDEYEPGRT GFAHFFEHMM FRGTEKYPAE VYNKLITQMG ADTNAYTSDD
VTVYQLNVVA DDLAQVMELE SDRFKNLSYP PQAFETEAGA VYGEYRKNRT SPFFTLYEAM
RKAAYTVHTY GHTAMGYEAD IKNMPKMFDY SRTFFSRYYR PDNAILVVAG DVEPQATIAM
ARKYYGDWER GYVKPKVKPE PEQKQERRIE VSYEGRSLPL VAIAYKSDAY SPSDRIYAAS
HVLASLAFGE TSDIYRQLII EQQAVQFLEA EASDSRDPGL WGIWTMVKDP SKVDEVIGQI
DETVARFRNE APDAERLEAV KSNLRYSLLM ELDSPAAVAG TVAHQAGVAG SLENVALFLQ
TLTEVTPEDV RAAAQKYLAP ERRTIAILRE KN