Gene Hoch_3323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3323 
Symbol 
ID8545711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4588739 
End bp4590808 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content72% 
IMG OID646387990 
Productpeptidase M61 domain protein 
Protein accessionYP_003267718 
Protein GI262196509 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0788147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00902527 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCAGCGCC ACCCAGGCGG GAGAAGTAAG CGATCGGCCG CGCCCGCGCG CAGCCGGCTA 
CCCCGGGTCG CGATCCTCGG CATCGCGTTC GCGCTGAGTC TCGGCACAGC CTGCCGCGGC
GGCGGCGCCA GCGCACCGCC GGGCGACGAG GGCGCCGAGC CCGGCGCCAG CCGGCGCGCG
CCCTCCGACG ATCCCGCGAG CCTCGACGCG CCGCCGCTGC GCATGCCGGC GCCCGGCGAT
CCGCGCGTGC TCTACCAGCT CTCGTTTCCC GCGCCGCAGA CCCATTACGT CGAGGTCGAG
GCCGTGATCC CGGTGCCCGC GGCCGCGACC GAAGCCGAAG CCGAAGCCGG GGAGGGCGAC
GGCGCCGCCG CGGCCGATAG CGCACCGGGC GACGCCATGG ATCTGTTCAT GGCGGTGTGG
ACGCCGGGTT CGTATCTGGT GCGCGAGTTT TCGCGTCACG TCGAGGACCT GCGCGTGAGC
ACGCCCGCGG GCGCGACCCT GGCGGTCGAC AAGGTGCGCA AGAACCGCTG GCGGGTGCGT
CTGCTGGGCA ACAAGGTCAA GAAGCGGCCC GAGCATGTGG TGGTTCGCTA CCGCGTGTAC
GCGCGCGAGA TGAGCGTGCG CACCAGCTTT GTCGACGCCG ACATCGCCGT GCTCAACGGC
GCCTCGCTGT TCGTGAGCGC GCTCGGCGGG CAAGCGCTGC CGCACGAGGT CCGGCTCAGC
TTGCCGGCGG CTTGGTCGGA CAGCGTGACC GGCATGCCGG CGCACCCCGA GGGCGCGCCG
CATCACTATC TGGCCGAGGA CTACGACGCG CTGGTCGATG CGCCCATCGT CGCCGGCAAC
CCCACGCTGC ACAGCTTCGA GGCCGGCGGC GCCAACCACC GCATCGCCAC CTTCCTGAGC
GACGAGCGCT GGGACGGCGA GCGCGTGGCG GCCGACATCG AGACCCTGGT GCGCGCTCAG
ATCGACTTCT GGCGCCAGGT GCCGTATCGC GACTACGTGT TTCTCGCCGT GCTCTCGGGC
ACCGGCGGCG GGCTCGAGCA CGGCAACTCG ACCCTGATGA TGGGCGACCC CTGGCTCACG
CGCGAGCGCG ACAGCTACCT GCGCTGGCTC GGGCTGGCGA GCCACGAGTT CTTTCACACC
TGGAACGTCA AGCGGCTGCG GCCGGTGGCT CTGGGCCCCT TCGACTACGA GAGCGAGGTC
TACACCGAGT CGCTGTGGGT CGCCGAAGGC ATCACCTCGT ATTACGCGGA CCTGCTGCTG
CGCCGCGCCG GCCTCATCGA TGACGCCGGC TACCTGCGCA ATCTGTCGCG GCGGCTGGGG
CAGGTGCAGC GCGTGCCCGG GCGCCTGGTG CAGCCGCTGG CGGCCAGCTC CTACGACGCC
TGGATCAAGT TCTATCGCAA CGACGAGAAC AGCGACAACA GCAGCGTGAG CTACTACAGC
AAGGGCGCCC TGGTCGGCTT CCTGCTCGAC GCCGAGATCC GCCGCCAGAC CCGGGGCACG
CGCAGCCTCG ACGACCTCAT GCGCCTGGCC TACGCGCGCT ACGCGGGCGA GCGCGGCTTC
ACCGAGGCCG AGTTCCGGGC GCTGGCCGGC GAGATCGCGG GCGCCGATCT GTCGGCGTTT
TTCGACCAGA CGGTCGACAG CGCGGCCGAG CTGTCCTTCG AGCCCGCGCT CGAGTACTTC
GGCCTCACCA TGGGCAGCGG CGACGACGCC GCCGCCGGCA GCGCCGACGA GCCCGCGGGC
TGGCTGGGCG CGAAGCTGCG CGAGGACGGC GGACGCTCGC TGGTGAGCGA GGTGCCGCGC
GATACCCCCG CGCACCGCGC CGGCGTCAAC GTCGGCGACG AGCTGCTGGC CATCGACGAG
CGCCGCATCT CCAGCGACGG CCCCGATGAT GTATTGCGCT ACCTGCGGCC GGGCGAGCGA
GTCGAGCTGC TGGTCGCGCG CCGCGAGCGG CTGCGGCGCC TGGCTGTGAC GCTGGGCGAT
AAGCCCGATG AGGAGTGGAA GCTGGCGCTG GCGCCGCGTC CCAGCAACGA TCAGGCGCGC
CGGCGCGGCC GCTGGCTGTC GGGGTTATGA
 
Protein sequence
MQRHPGGRSK RSAAPARSRL PRVAILGIAF ALSLGTACRG GGASAPPGDE GAEPGASRRA 
PSDDPASLDA PPLRMPAPGD PRVLYQLSFP APQTHYVEVE AVIPVPAAAT EAEAEAGEGD
GAAAADSAPG DAMDLFMAVW TPGSYLVREF SRHVEDLRVS TPAGATLAVD KVRKNRWRVR
LLGNKVKKRP EHVVVRYRVY AREMSVRTSF VDADIAVLNG ASLFVSALGG QALPHEVRLS
LPAAWSDSVT GMPAHPEGAP HHYLAEDYDA LVDAPIVAGN PTLHSFEAGG ANHRIATFLS
DERWDGERVA ADIETLVRAQ IDFWRQVPYR DYVFLAVLSG TGGGLEHGNS TLMMGDPWLT
RERDSYLRWL GLASHEFFHT WNVKRLRPVA LGPFDYESEV YTESLWVAEG ITSYYADLLL
RRAGLIDDAG YLRNLSRRLG QVQRVPGRLV QPLAASSYDA WIKFYRNDEN SDNSSVSYYS
KGALVGFLLD AEIRRQTRGT RSLDDLMRLA YARYAGERGF TEAEFRALAG EIAGADLSAF
FDQTVDSAAE LSFEPALEYF GLTMGSGDDA AAGSADEPAG WLGAKLREDG GRSLVSEVPR
DTPAHRAGVN VGDELLAIDE RRISSDGPDD VLRYLRPGER VELLVARRER LRRLAVTLGD
KPDEEWKLAL APRPSNDQAR RRGRWLSGL