Gene Hoch_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0031 
Symbol 
ID8542401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp45148 
End bp46860 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content68% 
IMG OID646384819 
ProductProtein of unknown function DUF1592 
Protein accessionYP_003264566 
Protein GI262193357 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCC GTACCACGAT GTCCAAGCTG GGCTCGGTGA TCTTGACCCC GGTGTTGTTC 
GCGGCCGTGG GATGCAGCGG AGGAGGGGAC TCGACGGGAG GCGATCGACC CGCATCCTGT
ACCAGCGAGA TCTCGCCGGG ACCGGCCCCG ATCCGACGGT TGACCAGAGA GGAATACAAC
AACACGGTAT TTCAGCTACT GGGCGACTCA ACATTTCCAG CAAACGCATT CGCTCCCGAT
GAAGAGGCGA ATGGGTTTTC CAACCAGGCC AGCGCACTTG TGGTGAGCCC GCTGCTCGCC
GAGCAGTACA TGGGCTCGGC GGAGGCGCTC GCGGCCAGCC ACGCCGATAC CTTGCTCCAG
CGCAACATGG CGCAGTGTGC GAGCGGCGCC GATACCCAGA CCTGCGAGGA GGAGGCCGAG
GCGTTTGTCC GCGGCTTTGG CAAGCAGGCC TTCCGGCGGC CGCTGAGCGA GGACGAGGTG
CAGGCGCACG TGGCGCTGTT CCTGATGGGC TCCGAGGGCG GGGCCGATGG CTACGCGCCG
GCGGCCGGTG TCGAGCTGGT GGTGCAGGCG ATGCTGCAGT CGCCGCACTT CCTCTACCGG
GTCGAGTTCG GCCGCGAGGG CACCGAGAAG AACGGCATCG TCGAGCTGAG TTCCTACGAG
GTCGCCTCGC GCCTGTCCTA TCTGTTCTGG GGCACCATGC CCGACGCGGC GCTGTTCGAG
GCCGCCGACC GCGACGAGCT GCGCACGCCC GAGCAGATCG AGGTGCAGGC GCGGCGCATG
GTCAACGCGC CGCGCGCGCG CGACGCGGTC AAGAACTTCC ACCGCCAGTG GCTCAATCTC
GACGAGATCC CGGGCATCGC CGCCATCGGC CGCAACCGCG AGATCTACCC CGACTATCGC
GAGTCGCTGC TGCCGCTGTT GCAGCGCGAG ACCGAGGAGT TCCTCGACTA CGCCATCTTC
GAGGCCAACG CCAGCGTCGA GGACATGTAC ACCGCGCCCT ATACGATGAT GAACGCCGAG
CTCGCCGAGT TCTACGGCAT CGAGAACGGC CCCACCGGCA GCGAGTTCGA GCGCGTGGAC
CTCGACGGCG AGCGCTACAG CGGCTTGCTC ACCCACGCCG GCATCCTGGC GCTGCACTCG
CGCTTCGATT CCTCGTCGCC CGTGCACCGC GGCCTGTTCG TGCGCACGCA GCTCCTGTGC
CAGCCGCCGC CGCCGCCGCC CGACTTCGTG CCCGAGCCGC CCGTGGTCGA CCCCAACGTC
ACCACCCGTG AGCAGTATCG GCAGCACTCG GACGATCCCG GCTGCAGCCG CTGCCACCAG
CTCATGGATC CCCTGGGCCT CGGTTTCGAG CACTTCGACG CCCTGGGCCG CTACCGCGAG
ACCCAGAGCG GACTCGCGAT CGACGACTCC GGCACCATCG TGGACGCCGA CGCCGACGGC
ACGCCGGCGG CCGAGAGCAT CGACGGACCC TTCGACGGTC CGGTCGAGCT CGGCGCGCGC
CTGGGCAACA GCAGCACCGT GCGCGAGTGC GTGAGCGCGC AGTGGTTCCG CTTCGCCTAC
GGTCGCGCGG AGACCGAGGC CGACGACTGC TCCATGGACA CCATCAACCA GCGCTTCGCG
GACTCGGGGT ACGACATCAA AGAGCTGCTC GTTGCGCTCA CCCAGACCGA CGCGTTCCGC
TATCGCCCGA TCACCACGGA GGTCAGCGAA TGA
 
Protein sequence
MKTRTTMSKL GSVILTPVLF AAVGCSGGGD STGGDRPASC TSEISPGPAP IRRLTREEYN 
NTVFQLLGDS TFPANAFAPD EEANGFSNQA SALVVSPLLA EQYMGSAEAL AASHADTLLQ
RNMAQCASGA DTQTCEEEAE AFVRGFGKQA FRRPLSEDEV QAHVALFLMG SEGGADGYAP
AAGVELVVQA MLQSPHFLYR VEFGREGTEK NGIVELSSYE VASRLSYLFW GTMPDAALFE
AADRDELRTP EQIEVQARRM VNAPRARDAV KNFHRQWLNL DEIPGIAAIG RNREIYPDYR
ESLLPLLQRE TEEFLDYAIF EANASVEDMY TAPYTMMNAE LAEFYGIENG PTGSEFERVD
LDGERYSGLL THAGILALHS RFDSSSPVHR GLFVRTQLLC QPPPPPPDFV PEPPVVDPNV
TTREQYRQHS DDPGCSRCHQ LMDPLGLGFE HFDALGRYRE TQSGLAIDDS GTIVDADADG
TPAAESIDGP FDGPVELGAR LGNSSTVREC VSAQWFRFAY GRAETEADDC SMDTINQRFA
DSGYDIKELL VALTQTDAFR YRPITTEVSE