Gene Hoch_4981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4981 
Symbol 
ID8547389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6867384 
End bp6868538 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content71% 
IMG OID646389655 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003269363 
Protein GI262198154 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.389941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTTGG TGGCGTGTGG TGGTGATTCG AAAGGCGCCG AGCAGCCCGA GCAGCCCGAC 
CCCGCGGCTC AGGCGGAAGA GGCCTTGTCC CAGGCCGATG AGGCCCGCGA CAGCGGACAG
GCCGCCGCCG CGGAAGGCCA CTACGAGCGC GCCCGCGAGC TGCGCCCCGA GCACTACGAG
ACCGCCGAGC GCTACGTCGG CTTCCTCATC GCCGAGGGCC GCGCCGATGA CGCGGTCGCC
GAGGCGCAGG AGTACCTCGA GCAGGCCATC GGCGAGCTCA AGGGATACCA CCTGCTCGCC
GAAGCGCAGA TGGCGGCCAA GGACTACGAG GGCGCGCGCA GCACCCTGTC GCAGCTCCTC
GGGCTCGACG AGACCGACGC CGCGGCCTAC GCCAAGCGCG GCGAGGCCGC GATCGCCCAG
AAGGACTACG AGTCCGGCCT CGAGGACATC CGCAAGGCCA TGGAGCTCGA GCCGCAGAAC
CTCGAGTACC GGGTGACCCT GGGCAAGGGG CTGCAGGAGA CCGGGCAGAA CGGCGAAGCC
GCCGAGGTGC TCGCGGCCGT GGTCGAGGAG AACCCGGCGT ATCTCGACGG CCTGCTGGTC
TACGGCGCGC TGCAGCGCTC TGCCGGCAAG CTCCAGGACG CGCGCGAGCT GCACCAACGG
GCCGTGGAGA CCAGCCCCGA GTCGGCGCTG GCGCACTACG AGCTGGGTAT CACGCAGTTC
TACATGGGCG ACCGCGACGA CGCGCTCAGC AGCCTGCAGC AGGCCACCGA GCTCGACGCC
AGCGACGCGC AGATCCGCTA CGTGCACGGC GAGCTGCTGC GCAACATGGG GCGCTTCGAA
GAGGCGGCCG AGCGCTATCG CGATGCGCTC GACCGGCAGA AGGATCACGA CAAGGCCGCC
GCCAAGCTGG GCCTCATGCT GACCAAGCTC GAGCGCTTCG ACGAGGCCGC CGAGGTGCTG
AGCGCCCGCG TCGAGCGCGA GCCCCAGGAC GCCGACGCGC TGCTCTACCT GGGCCAGCTC
CACGAGTCGC AGGAGCAGTT CGCCGAGGCG GTCGCCGCTT ACGAGCGCTT CCTCGAAGTC
GCCGGGCCCG ATGAGCAGGC CAGCGTCCCC GAGGTCAAGC GCAAGGTCCG CATCCTCAAG
CGCAAGGTGC GCTGA
 
Protein sequence
MGLVACGGDS KGAEQPEQPD PAAQAEEALS QADEARDSGQ AAAAEGHYER ARELRPEHYE 
TAERYVGFLI AEGRADDAVA EAQEYLEQAI GELKGYHLLA EAQMAAKDYE GARSTLSQLL
GLDETDAAAY AKRGEAAIAQ KDYESGLEDI RKAMELEPQN LEYRVTLGKG LQETGQNGEA
AEVLAAVVEE NPAYLDGLLV YGALQRSAGK LQDARELHQR AVETSPESAL AHYELGITQF
YMGDRDDALS SLQQATELDA SDAQIRYVHG ELLRNMGRFE EAAERYRDAL DRQKDHDKAA
AKLGLMLTKL ERFDEAAEVL SARVEREPQD ADALLYLGQL HESQEQFAEA VAAYERFLEV
AGPDEQASVP EVKRKVRILK RKVR