Gene Hoch_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1839 
Symbol 
ID8544221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2530923 
End bp2533883 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content74% 
IMG OID646386545 
ProductMJ0042 family finger-like protein 
Protein accessionYP_003266280 
Protein GI262195071 
COG category 
COG ID 
TIGRFAM ID[TIGR02098] MJ0042 family finger-like domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0213586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.108328 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTC GCTGCGAGAA GTGCGGAACC GAATATGAGC TTGAAGAAGA TCGCCTCAAG 
CCGGGTGGTG TATCGGTCAA GTGCACCGAG TGCGGGCACA TCTTCCGGGT GCGCCGGCAG
GCGATCACGA GCGTCGGCTT CGGCGGTCTG GACAGCGGTC CGACGCGCGT AAGCGAGTCA
TCGTTCGACG ACGAGCGCAC CCAGGTCGAG AACAGGCCCA AGGGCGAGCG CGGCGAGGAG
GACGAGAGCC GCGACACCAT CCCGGTGCCG ACGCTGTCGC CCGCGGATGC CGAAAAAGCG
CGCCGCGCCA TGCGCATGAC GCCCTCGAGC GCGCGCTCGC AGCGGCGCTC GCGCGAGCGC
AACTGGCTGG TGCGGCTGCC CGGCGGCACC ATCGAGGCGT GCCGCGAGCT GGCCACGCTG
CACGAGTGGA TCGTGCTCGG CAAGGTGACG CGCAGCTCGG GGATTTCGCG CACCGGCAAG
ACCTGGAAGC GCCTGGGTGA CATCAAGGAG CTCGGCGCTC TGTTCGACGC CGCCGAAGAC
GCCCGGCGCG CGCGTCGCTC GGGGCAACCC GGACAGCCGG TCGCGCGACC CGGCGACGGC
ATCCCGCGCG AGCATATCCC CGACCCGGCC TCGGGGCCCA TCCCGGATTC GTTCTCGGCC
AGCGGCGCCG TGGCCCAGTC CGATGCGACG CGCCGCTCCG GGCCGCTGGT GGCGCCACCC
GAGAGCGCGT TGCCGGCGTC GTCGCTGTTA GACGGCGCCT TTGCCGACGA CGACGACGAC
GAGCTGGACG ACGACGGCAG CGAGCCGCTC GCGCTGCGCA ACCGGGCTCG CGACGACGAT
GACGGCGCGC TGATGGCCTT TGACGAGGCC GGCGAGAGCG TGACCACCGA GGGCGAGCGT
CCGGTGCTCG CCGAGGACGC GCCCGCACGC GCGCCCGGTG CCGCTCGCGC TGCGGCGGCA
GTCGCGCCAG AGCCCGACGA TGACGACATC GACGATGCCG ACATCGACGA TACCGCGGAC
GCGCCCGATC ATGATGTCGA TGCCGATGCC GCGGACGCCG CCGCCGACGC CGCCGCGCTC
GACGAGGACG ATATGCCCAA GGATACCTCC AACGCCGATT CACCGACCGC GGACGCGGCC
GCCGAGGGCG ACGCCAACGC CGCTGAGCCG CGCCAGCGCC CGGCGTGGGC GGCCCGCGCC
AGCGCCGACG GACCCGAAGG ACCGCGCGCG CTCGACCGCG CCGGCCTGGC CAAGGTCGCG
GCCGAGGATG GCCGCACCGG GCCCTCGGGC GGGCTGTCGC GCCGCAGCAG GGTGCAGGAC
GTGGCCTTTG GCCCCGGCCG CGTGCGCGCG CTGTCCGAGA GCGAAGACAG CGAGGACGGC
GAGGGCGCGG CGGCGCGCGA CGAGGAGCGA GACCAGCCCT CGAGCGGCGT GGGCCGCTGG
GTGGTGCTGG TCGCGTTGCT GCTGATGGCG GCGTCGGCCG GCGTGGTCTA CATGCTGGTG
TTCCGGCCCA CGGGCGAGAC CATCGAGGGC GTGCTGGTGG GCGGCGACGC GGGCCTGGCC
GAGCTGGCCG GAGGCGCCGA TGGCGGCCTC GACACGGTCG CGCTGGCCGA TCAACTGGGC
GCCGCCCTGG CGCGCGATAC CGACGCCGGA CTCGACGCCT TTGCCCGGCA GCTCGAGGCG
CTCGAGGCCG AGCTCGGCCA GAGCGAGGAG CTGCTGGTGG CCCGGGCCCG GGTGCGGGCG
GCGCGCGCGC AGCATCTCTT CGACCGCGCG GCCTTGGCCG ACCAGGCGGA CAAAAACCAG
CAGGCGAGCA CGCTGCAACG CGAGGCCGAC GAGATGGTGC TCGAGGCCCT CACGCTGGCT
CAGCGCGCGC TGCAGAAGCA GCGCAGCAAC CCCGAGGCGC TGGTGGCCAT GGCCGACGTG
CAGCGCCTCC AGGGGCGCAA CGCGCGCCAG ATCGACAGCT ATCTCGGCGA GGTCGGCAAG
GACACCGAGG CCCACCGCGA GGCCCGCCTG GTGCGCGCCA TGGCGCTGGC CCAGAACAAG
CGCGAGCGCG AGCAGGCGGT CAAGATCTTC GAGGCGCTCG AGGCCGAGGG CGAGGAGGAG
GGCTGGGGCG ACCTGCGGCC GCGCACGCGC CTGGCCATGC TGGCGTTTAT CAGCGAGCGC
TACGAGGCCG CCGAGGCCCA CGCCAACGCC GTGCTCAAGC GCGAACCCGC GCACGAGATC
GCGCGCGCGC TGCTCGACCG CCTCGAGGCC AGCGCCGGCG TCGATACCAG CGATCCCATG
CCGGTGGAGG AGGGGCCGGA CGATGGCGAT GGCGATGGCG ATACCGGCGC GGGCCGCGAC
ACCACGCCCA GCAAGCCGAC GACGCCGACC CCGACGCCGT CGCCGACGCC GACCCCGCCG
GCCAAGGATC CGCCGAGCAA GCCGGCGACC TACGCCGAGC TGCTGGCCCA GGCCAAGAGC
AAGGCGCAGG CCGGACAGTG CGGCGACGCG ATCCTGCTGT TCGAGCGCGC GCTCGACGAG
AACCCCATCG GCGTCGAGGC GCTCAACGGC ACCGGCTCCT GCCACCTGAG CCGGCGCGAA
TACTCGACCG CGCGCGGCAA GTTCCGCGCG GTGCTCGGCA TCGCCTCGGG CAACGCCGAC
GCCATCTGGG GCATGGCCGA GTCGTATCGC CAGCAGGGCA ACGGCGGCGA GGCCGTGCAG
TGGTATCGCC GCTACCTCGA CGCCCACCCG GCCGGCGGCC GCGCCGACGA AGCGCGTCGC
CGCGTCGACG AGCTGGGCGG CAGCGGCGGT GGCGGCTCGG ACGATGGCGC GGACGACGAC
GACGCGCCGG CTCCCAGCGC GCCGGCTCCT GCGCCAGCTC CGACCCCGGC CCCGACGCCG
GCTCCGGCTC CGGCGCCAGC GCCGAGCCCG GCTCCGGGCG GCGACGGTAC GGCTCCGGGC
GCGGCCGAAC AGATTCCCTG A
 
Protein sequence
MDIRCEKCGT EYELEEDRLK PGGVSVKCTE CGHIFRVRRQ AITSVGFGGL DSGPTRVSES 
SFDDERTQVE NRPKGERGEE DESRDTIPVP TLSPADAEKA RRAMRMTPSS ARSQRRSRER
NWLVRLPGGT IEACRELATL HEWIVLGKVT RSSGISRTGK TWKRLGDIKE LGALFDAAED
ARRARRSGQP GQPVARPGDG IPREHIPDPA SGPIPDSFSA SGAVAQSDAT RRSGPLVAPP
ESALPASSLL DGAFADDDDD ELDDDGSEPL ALRNRARDDD DGALMAFDEA GESVTTEGER
PVLAEDAPAR APGAARAAAA VAPEPDDDDI DDADIDDTAD APDHDVDADA ADAAADAAAL
DEDDMPKDTS NADSPTADAA AEGDANAAEP RQRPAWAARA SADGPEGPRA LDRAGLAKVA
AEDGRTGPSG GLSRRSRVQD VAFGPGRVRA LSESEDSEDG EGAAARDEER DQPSSGVGRW
VVLVALLLMA ASAGVVYMLV FRPTGETIEG VLVGGDAGLA ELAGGADGGL DTVALADQLG
AALARDTDAG LDAFARQLEA LEAELGQSEE LLVARARVRA ARAQHLFDRA ALADQADKNQ
QASTLQREAD EMVLEALTLA QRALQKQRSN PEALVAMADV QRLQGRNARQ IDSYLGEVGK
DTEAHREARL VRAMALAQNK REREQAVKIF EALEAEGEEE GWGDLRPRTR LAMLAFISER
YEAAEAHANA VLKREPAHEI ARALLDRLEA SAGVDTSDPM PVEEGPDDGD GDGDTGAGRD
TTPSKPTTPT PTPSPTPTPP AKDPPSKPAT YAELLAQAKS KAQAGQCGDA ILLFERALDE
NPIGVEALNG TGSCHLSRRE YSTARGKFRA VLGIASGNAD AIWGMAESYR QQGNGGEAVQ
WYRRYLDAHP AGGRADEARR RVDELGGSGG GGSDDGADDD DAPAPSAPAP APAPTPAPTP
APAPAPAPSP APGGDGTAPG AAEQIP