Gene Hoch_0092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0092 
Symbol 
ID8542463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp144664 
End bp146112 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content70% 
IMG OID646384880 
Productpeptidase M16 domain protein 
Protein accessionYP_003264626 
Protein GI262193417 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGC CCGATTCGAT GACGAACGGC TCGGGGCGCG CCCGTCGGCG CATCCCGGGC 
CCCGCATTGC CGCGCCGCGC GCTGCTGGCC GCCGCGCTGG CGCTGGCGCT GACCGCCACG
GCGCTGGCGC TACCCGCGCA GGCACAACCG GCCGCGCGCG CAGGTGCTCA GGCGAGCGCC
GAGAGCCAGC TCGCCCTGCC CGCGCTCGAC ACCTGGCAGC TCGACAACGG CATGCAGGTG
GCGTTTTTGC GCATCGCCGA TGCCCCGGTG TTGTCGGTGC AGGTCTGGTA TCACGTGGGC
TCGAAAGACG AGCCGCGCGA TCGCCGCGGC CTGGCCCACA TGTTTGAACA CATGATGTTC
AAGGGTACCG AGAACCTGCG CTCCGAGGAG CACGCACGCT TCATCGACTC GCTCGGCGGA
TACACCAACG CGGTCACCTC GGAAGACGCC ACTCGCTATA TCAACGTCAT TCCAAAACAG
TATCTCGATT TCGTCTGCCA GCTCGAGTCC GAGCGCATGC GCAAGCTGCT GTTTCGCGAC
AGCATGATCC GCACCGAGCG CGAGGTGGTG AAAGAGGAAG TCCGGCAGCA GGAGAACAAC
CCGCTCACCG TGGGTCTGCT GCGCTTTCTC GCCACCGCGT ACACCAAGCA TCCCTACGCG
TGGACCGCGG GCGGCACCAT CGCCGACCTC GACGCCGCCA GCACCGCCGA CCTCAAGCGC
TTCTACGACA CCTATTACGT GCCCAATAAC GCCATGCTGG TGGTCGTCGG CGACGCCTCT
GCCGACGCGG TCAAAGCCGC CGCCGAGCGC TGGTTCGCGC CCATCCCGCG CGGCCAGGAG
CCGCCGCGAC CGGCCGACGA CGCCACCGAG CCCAAGCAGA CCAGCAAGCG CCGCGAGGTG
GTCGCCCCCG GCCAGGTCGG CGTGCTGCTC GCCGGCTATC ACGTGCCCAG CGCGTCGGAC
GACGACTCCT ACCCGCTGCA GGTCGCCAGC CTGGTGCTCG GCGCGGGCGA GTCCTCGCGC
CTCACCCAGC GCCTGGTGCG CGGCGACGAG CTGGCCGTGC AGGCCGGCGC CCTGCTGCTG
GCGCGCGAGC ATCCCGGCAT GCTGTGGACC TTCGCCATCT TCCTGTCGCC CGGAGCCGCC
GACGACATCG AGAGCGCGCT CGCGGCCGAG GTCGCGCGCC TGGCCAGCGA GGGCCCCAGC
GCCGACGAGC TGCGCAAAGC CAAGCATCAG CTCCAGGCCG GGCTGGCGTT CTCACTCGAG
AACGTCGCCG GCCTGGCTGA GCAGATCGGC ATGTCGTGGA TCCTGTCCGG CGACCCCGGC
CGCTGGCGCA ACGATCTCGC CCGCTACCGC GCGGTCACGG CCGATGAGGT CAAACGCGCG
GCCGCCGCCT ACCTGGTCGA CAGCAACCTC ACCGTGGTGG TGGTACCGCC CGCCGGAGCG
CCGCAATGA
 
Protein sequence
MTMPDSMTNG SGRARRRIPG PALPRRALLA AALALALTAT ALALPAQAQP AARAGAQASA 
ESQLALPALD TWQLDNGMQV AFLRIADAPV LSVQVWYHVG SKDEPRDRRG LAHMFEHMMF
KGTENLRSEE HARFIDSLGG YTNAVTSEDA TRYINVIPKQ YLDFVCQLES ERMRKLLFRD
SMIRTEREVV KEEVRQQENN PLTVGLLRFL ATAYTKHPYA WTAGGTIADL DAASTADLKR
FYDTYYVPNN AMLVVVGDAS ADAVKAAAER WFAPIPRGQE PPRPADDATE PKQTSKRREV
VAPGQVGVLL AGYHVPSASD DDSYPLQVAS LVLGAGESSR LTQRLVRGDE LAVQAGALLL
AREHPGMLWT FAIFLSPGAA DDIESALAAE VARLASEGPS ADELRKAKHQ LQAGLAFSLE
NVAGLAEQIG MSWILSGDPG RWRNDLARYR AVTADEVKRA AAAYLVDSNL TVVVVPPAGA
PQ