Gene Hoch_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1301 
Symbol 
ID8543683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1712388 
End bp1713596 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content60% 
IMG OID646386017 
ProductTranscription factor jumonji 
Protein accessionYP_003265752 
Protein GI262194543 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.140521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATC CGAAACTTCG TTTATCAGAT ACGCTCGCCA CGGTGACACG CCGCGTTAAG 
CAGTTCGAGC CCGAGGTGTG TCGTGCATTG AAGGGCCATT ATCACCTGTG CCTACACAGC
GAGAGTGGCA GTCAGCGAAA CCTCTTCATT TCCATCGGAG AAGAGTCGGC GCGACTCGTC
GATAGTTCAT CTGCAGAAGT GGACCTTCGG CTCAAGCTCC GCGAGACTGA CTTCGTCTCA
CTCATCGCGG GCTCGACAGA CCCCTTCACT CTCTTCACGA ATGAGCGCCT TTTGCTGCAC
GATGGCCGAG GCAACGCGGT GCCATTCGCG GCTGGCCGGC GGCTACTGCA GACCTTGCTG
GTGGAATGGC GGATCGATTC CACGTCTGTC GCTGCGATAG AAGCGCACGC CGCGCGACTT
GCAGGCCGAA CCCATGTCGA CCGGGTTACT GGCTGTGAGG CAGATGAGTT CTTTGCGCGC
TACGCGCGTC CCGGAGTGCC TGTCATTCTG GGTGGAATGG CGCGCGACTG GCCGCTCACC
TCGTTGGATC CGCAGACGTT GGGAGAGCGG TTTGGCGGCT ATCAGGTGGC GGTATTTTCG
GACCTGGTCG AGGACGCCGG TACGAGCAAC GAGGGCAAGC AGGCGGTACA GACCTCGGGA
CGCGTGTTCA CCAATATGCC CTTGCGCGAA TTCATCCAGA TGTCGTTCTC TGGCCCACGC
CGCTTGGGCG ACCTGAGTCG AGTGACCCCG TATATCACCG CCCATTCCTT GCCGGACGAG
CTGCTGGAAT GGATCGAGTA CCCGCCGTTC TTTCCCAGGG AGGCGTTCGT TCGCCCCAAG
ATGTGGATGG GGCCTTCACA CACGGAGACG CCCTTGCATC GGGACCTGAT CGACAACTTC
TTGGCGCAGG TGTGGGGGTT CAAGCAAATG CGTTTGATCT CACCCGCCCA CACGGCAAAA
CTGTACGCGA TAGCCGAGAA TCTCAACCCC TACTATCAGC CCTCTCAGCT CGATGCCGAT
CGACCGGACC TAGCTCAGTT TCCGATGTGC GCAGACGTGC CCTACACCGA CTGCGTGTTG
TCGCCAGGAG ACATTCTCTA TTTGCCGGCG GGGTGGTGGC ATCGCGTGCG CTCGCTCGAG
CCGTCGTTGT CCGTCAACTT TTTTGCCCTC AACCAGGCCC CGTCGAGCAT CTCGTCCACG
TCCGCTTGA
 
Protein sequence
MIDPKLRLSD TLATVTRRVK QFEPEVCRAL KGHYHLCLHS ESGSQRNLFI SIGEESARLV 
DSSSAEVDLR LKLRETDFVS LIAGSTDPFT LFTNERLLLH DGRGNAVPFA AGRRLLQTLL
VEWRIDSTSV AAIEAHAARL AGRTHVDRVT GCEADEFFAR YARPGVPVIL GGMARDWPLT
SLDPQTLGER FGGYQVAVFS DLVEDAGTSN EGKQAVQTSG RVFTNMPLRE FIQMSFSGPR
RLGDLSRVTP YITAHSLPDE LLEWIEYPPF FPREAFVRPK MWMGPSHTET PLHRDLIDNF
LAQVWGFKQM RLISPAHTAK LYAIAENLNP YYQPSQLDAD RPDLAQFPMC ADVPYTDCVL
SPGDILYLPA GWWHRVRSLE PSLSVNFFAL NQAPSSISST SA