Gene Hoch_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0237 
Symbol 
ID8542616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp352310 
End bp355264 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content73% 
IMG OID646385033 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003264771 
Protein GI262193562 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATT TTGCCCGCCG ACCCACGATC GATGAGATCG AGCAACTCAG GGCCCAGGTC 
AGGAGAGAGC CGGGCTCGCC CGCCTTCGTC CGCCTCGGCG AGGCCTACCT CGCCCTCGGC
CGTCCCCAGG ACGCGATCGA GGTCGGCGTT CCCGGCCTGC GCGACAACCC CGACAGCACC
GCCGGCCGCA TGATGATCGG CCGCGCCTAC GTCATGCAGC ACCAGTGGAA GGAGGCGCAG
GACGAGCTAC TGCGCCTGGT CAAAGCCGAC CGCAACCACG CCGCTGGCTT TGCGCTGCTC
GGCGAGGTGC TGCTGCGCCG CCAGGATCTC AAGCGCGCGC TGCCGGTGTT GCAGCACGCC
CAGAACCTCA ACCCGACCAA TCCCTACGTC GAGGATCTGC TCAAGCGCGC GCGCACCGGA
CGCGCTCCCG ACCCGCCGCC GCCGCCGCCG GTGCCGCAGG ATCCCCAGCC CGCGGCCGCG
CCCTACGCCA GCGCCAGCGC GCACAACGAT CCCCTGCAGT TCGGCGTCGA CGAGCCCACG
CGCGTGGCCG GCGACATGAG CAGCGGCCTC ACCCCGCCGC CCGCGGCCGC CCCGCACGGG
CACGCGCCGC AGTATCCCGG CCATCAATCC GCGCCCGCGC CCGCCCCGGC GCCGCGGCCC
AGGCAGCAGC AACAGGCGAT CACCGATCCC GGCGCCAAGA CCATCATGGC CGGCGCCGCG
CCCGCGGCCG AGTCCAAGCC GCCGTCGCCG CGTCCGATCC CGCGGGCCCT GGCCCAGCCC
GCCCCCCAAG CGGCGCCCGC GCCCGCGCCC GCGCCCAAGG CCGAGCCCGA AACGAAGAAG
AAGCCCAAGG TCGCGGCCGC CGACATGCCG CCGCCGGGAC CGCCGACGGG CGTGCGTCCT
CGCATCCTGT CGATGGACAA GCCCACCGAC GCCGCGCGCG AGGCCATGCA TCAGGCCGCC
GACGTCGGCG ATTACCTCAA CGCCCTGCTC ACCCAGGGGC TGCTCAACGT GCCCGCCGTG
CAGGCGCAGC ACACGCCCCA CGCCTACCGC AGCGGCAAAC GCTGGGGCCA CTCGGTCACG
CGCACCTTCG TGTTCCTCTT CGTGCTGCTG GCCGTGGGAC TCGGCGGCGG CGCCTTCTGG
ATCTACCAGA CCGAGCAGCA GCGCGAAGAA GAGGTCGCCC GGCACATCGC CAGGGCCGAC
ACCCTGCTGA CCACGGGCAC CTACAACGAC GTGCAGGAGG CGCGCGAGGC CACGGCCAGC
GCGCTCAGGC GCGACCCGGC CAGCGTCAGC GCCATGGCCA AGTTCGCGCG TGTCGAGACC
GCGGCCGCCC TGCTCTACGG CACCCCGGCG GCCAAGGGCA CCTCGGCCAT GCTGCGGGCC
CGCAAGGAGC TCACCGAGGA AGACGCGGCC TGGGCCGACA TCGTCTTCGC CGAGATCGCC
AGCACCCTGG CCACGCTCAG CGACGAAGCC GCGGGCGCGC CTCAGGAGCG GCTCGGCGAC
GCGCGCAAGA CCGCGGACGC CTGGCTCGAG AAGCATCCCG ACGACGCCTG GGTGCGCTGG
CTCAGCGGCG TGGCCATGCT CTACGCCAAC GACCTCAGCG GCGCGGCCGA GGCCTTCGAA
GCCGCCGAGG CCGACGGCGA GGGCCCGGTG GTCGCCAGCA TCTACCGCGC CGACATGCAG
ACCGACGCCG GCGAGCTCGA CGGCGCGTCC GTGCGCTACG AGGCCGCGCT CGAGCGCGCG
CCCAAGCATC CGCTGGCCAT CATCGGCAAC GCCCTGGTGC AGCTCGCGCG CGCCGAAGAG
GCCGCCGTGG TGCTTGGCAA GATCAACACC ACCATGACCG AGGACGAGGG CCCGCGGGCC
AACGCCTACC GCGCCCTGGT GTTTGCCCTG GCCTATCTGT GGGTGGCCTG GGATTACGAT
CAGTTCACCG AGAACCTGGC CAAGGCCGAG GGCGTGGCCG AGCCGCGCTT CCTCGGACGC
GTGGCCCTGG CGCAGCTCGC CAACGGCCAG TTCGCGGCCG CTGGCGACAC CCGCAACCGC
ATCGTCTGGT ACGTCGCCGA GCCCGAGCAG ACCCACCCCG TGGTGGCCGC GGTCGACGCC
GAGTTGCAGT GGAGTTCGGG CCTGTCGGCG CCCGCCATCG AGCAGGTCGG CGACAGCGAG
AACATCCGCG CCCGGCACCT GGTCGGCCGC GCGCTCTTCG ATCTCGGACG CCTGGACGAA
TCCGAGAAGG CCTTCGCCGA GATCCTCGAA TTTGCGCCCG AGGACTGGGA GGCGCAGACC
TGGCACGCGG CCGCGCAGCT CGCCCAGGCC AAGGGTCGGG CGCGCAACGA ATTCGACGAG
ACCCTGCAGC AGATCGCGCG CACCCAGTCC AACCAGGTGG TGCGCTACGT GCGCGGCCTG
GCCTGGCAGC GCTCGGGCGA CATGCGCGAG GCGCGCCGAC ACTTCGAGGA GTCGCTCGAG
GGCGTCGCCG ACGAGATCAC CGAGTTGGCC GAGGAGCGCG CCAACTCGAT GGCGTATCGC
GCGCACGCGG CGCTGGCCGA GCTCGACCTC GACGCTGGCA GCACTGACAA AGCCGTCGCC
CATCTCGAGC GCGCCGTGGC GCTCAACCCC GCGTATCTGC CGGCCATGGC CTTGCTCGGT
CGCGTGCAGG TGCAGCGCGG CCAGCACGCC GAGGCCGCGG CCACGCTCAA GCCGCTGCTC
GAGGAGCCCG AGATCGCCAA CGCCGCCGTG GAGCTGGCCT ACGCCGAAGC CCTGGTCGGC
GGCGGATCGC CCTCGGACGA GGCCCGCGGC CAGGCCCGCG AGGCCGTGTT GCGGGCCAAG
GACAAGGGCG CCTCGGCCGA GGAGCTGGGC CGCGTCGCCG CCCTGGTCGA CGAGACCCTG
GCCACCGAGA TCGGCGCTGG CGGCGCCGAA GAGAAGCCGA GTCGGCGTCG CGGGCGTCGT
CGCGGCCGCC GCTGA
 
Protein sequence
MSDFARRPTI DEIEQLRAQV RREPGSPAFV RLGEAYLALG RPQDAIEVGV PGLRDNPDST 
AGRMMIGRAY VMQHQWKEAQ DELLRLVKAD RNHAAGFALL GEVLLRRQDL KRALPVLQHA
QNLNPTNPYV EDLLKRARTG RAPDPPPPPP VPQDPQPAAA PYASASAHND PLQFGVDEPT
RVAGDMSSGL TPPPAAAPHG HAPQYPGHQS APAPAPAPRP RQQQQAITDP GAKTIMAGAA
PAAESKPPSP RPIPRALAQP APQAAPAPAP APKAEPETKK KPKVAAADMP PPGPPTGVRP
RILSMDKPTD AAREAMHQAA DVGDYLNALL TQGLLNVPAV QAQHTPHAYR SGKRWGHSVT
RTFVFLFVLL AVGLGGGAFW IYQTEQQREE EVARHIARAD TLLTTGTYND VQEAREATAS
ALRRDPASVS AMAKFARVET AAALLYGTPA AKGTSAMLRA RKELTEEDAA WADIVFAEIA
STLATLSDEA AGAPQERLGD ARKTADAWLE KHPDDAWVRW LSGVAMLYAN DLSGAAEAFE
AAEADGEGPV VASIYRADMQ TDAGELDGAS VRYEAALERA PKHPLAIIGN ALVQLARAEE
AAVVLGKINT TMTEDEGPRA NAYRALVFAL AYLWVAWDYD QFTENLAKAE GVAEPRFLGR
VALAQLANGQ FAAAGDTRNR IVWYVAEPEQ THPVVAAVDA ELQWSSGLSA PAIEQVGDSE
NIRARHLVGR ALFDLGRLDE SEKAFAEILE FAPEDWEAQT WHAAAQLAQA KGRARNEFDE
TLQQIARTQS NQVVRYVRGL AWQRSGDMRE ARRHFEESLE GVADEITELA EERANSMAYR
AHAALAELDL DAGSTDKAVA HLERAVALNP AYLPAMALLG RVQVQRGQHA EAAATLKPLL
EEPEIANAAV ELAYAEALVG GGSPSDEARG QAREAVLRAK DKGASAEELG RVAALVDETL
ATEIGAGGAE EKPSRRRGRR RGRR