Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0237 |
Symbol | |
ID | 8542616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 352310 |
End bp | 355264 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646385033 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003264771 |
Protein GI | 262193562 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATT TTGCCCGCCG ACCCACGATC GATGAGATCG AGCAACTCAG GGCCCAGGTC AGGAGAGAGC CGGGCTCGCC CGCCTTCGTC CGCCTCGGCG AGGCCTACCT CGCCCTCGGC CGTCCCCAGG ACGCGATCGA GGTCGGCGTT CCCGGCCTGC GCGACAACCC CGACAGCACC GCCGGCCGCA TGATGATCGG CCGCGCCTAC GTCATGCAGC ACCAGTGGAA GGAGGCGCAG GACGAGCTAC TGCGCCTGGT CAAAGCCGAC CGCAACCACG CCGCTGGCTT TGCGCTGCTC GGCGAGGTGC TGCTGCGCCG CCAGGATCTC AAGCGCGCGC TGCCGGTGTT GCAGCACGCC CAGAACCTCA ACCCGACCAA TCCCTACGTC GAGGATCTGC TCAAGCGCGC GCGCACCGGA CGCGCTCCCG ACCCGCCGCC GCCGCCGCCG GTGCCGCAGG ATCCCCAGCC CGCGGCCGCG CCCTACGCCA GCGCCAGCGC GCACAACGAT CCCCTGCAGT TCGGCGTCGA CGAGCCCACG CGCGTGGCCG GCGACATGAG CAGCGGCCTC ACCCCGCCGC CCGCGGCCGC CCCGCACGGG CACGCGCCGC AGTATCCCGG CCATCAATCC GCGCCCGCGC CCGCCCCGGC GCCGCGGCCC AGGCAGCAGC AACAGGCGAT CACCGATCCC GGCGCCAAGA CCATCATGGC CGGCGCCGCG CCCGCGGCCG AGTCCAAGCC GCCGTCGCCG CGTCCGATCC CGCGGGCCCT GGCCCAGCCC GCCCCCCAAG CGGCGCCCGC GCCCGCGCCC GCGCCCAAGG CCGAGCCCGA AACGAAGAAG AAGCCCAAGG TCGCGGCCGC CGACATGCCG CCGCCGGGAC CGCCGACGGG CGTGCGTCCT CGCATCCTGT CGATGGACAA GCCCACCGAC GCCGCGCGCG AGGCCATGCA TCAGGCCGCC GACGTCGGCG ATTACCTCAA CGCCCTGCTC ACCCAGGGGC TGCTCAACGT GCCCGCCGTG CAGGCGCAGC ACACGCCCCA CGCCTACCGC AGCGGCAAAC GCTGGGGCCA CTCGGTCACG CGCACCTTCG TGTTCCTCTT CGTGCTGCTG GCCGTGGGAC TCGGCGGCGG CGCCTTCTGG ATCTACCAGA CCGAGCAGCA GCGCGAAGAA GAGGTCGCCC GGCACATCGC CAGGGCCGAC ACCCTGCTGA CCACGGGCAC CTACAACGAC GTGCAGGAGG CGCGCGAGGC CACGGCCAGC GCGCTCAGGC GCGACCCGGC CAGCGTCAGC GCCATGGCCA AGTTCGCGCG TGTCGAGACC GCGGCCGCCC TGCTCTACGG CACCCCGGCG GCCAAGGGCA CCTCGGCCAT GCTGCGGGCC CGCAAGGAGC TCACCGAGGA AGACGCGGCC TGGGCCGACA TCGTCTTCGC CGAGATCGCC AGCACCCTGG CCACGCTCAG CGACGAAGCC GCGGGCGCGC CTCAGGAGCG GCTCGGCGAC GCGCGCAAGA CCGCGGACGC CTGGCTCGAG AAGCATCCCG ACGACGCCTG GGTGCGCTGG CTCAGCGGCG TGGCCATGCT CTACGCCAAC GACCTCAGCG GCGCGGCCGA GGCCTTCGAA GCCGCCGAGG CCGACGGCGA GGGCCCGGTG GTCGCCAGCA TCTACCGCGC CGACATGCAG ACCGACGCCG GCGAGCTCGA CGGCGCGTCC GTGCGCTACG AGGCCGCGCT CGAGCGCGCG CCCAAGCATC CGCTGGCCAT CATCGGCAAC GCCCTGGTGC AGCTCGCGCG CGCCGAAGAG GCCGCCGTGG TGCTTGGCAA GATCAACACC ACCATGACCG AGGACGAGGG CCCGCGGGCC AACGCCTACC GCGCCCTGGT GTTTGCCCTG GCCTATCTGT GGGTGGCCTG GGATTACGAT CAGTTCACCG AGAACCTGGC CAAGGCCGAG GGCGTGGCCG AGCCGCGCTT CCTCGGACGC GTGGCCCTGG CGCAGCTCGC CAACGGCCAG TTCGCGGCCG CTGGCGACAC CCGCAACCGC ATCGTCTGGT ACGTCGCCGA GCCCGAGCAG ACCCACCCCG TGGTGGCCGC GGTCGACGCC GAGTTGCAGT GGAGTTCGGG CCTGTCGGCG CCCGCCATCG AGCAGGTCGG CGACAGCGAG AACATCCGCG CCCGGCACCT GGTCGGCCGC GCGCTCTTCG ATCTCGGACG CCTGGACGAA TCCGAGAAGG CCTTCGCCGA GATCCTCGAA TTTGCGCCCG AGGACTGGGA GGCGCAGACC TGGCACGCGG CCGCGCAGCT CGCCCAGGCC AAGGGTCGGG CGCGCAACGA ATTCGACGAG ACCCTGCAGC AGATCGCGCG CACCCAGTCC AACCAGGTGG TGCGCTACGT GCGCGGCCTG GCCTGGCAGC GCTCGGGCGA CATGCGCGAG GCGCGCCGAC ACTTCGAGGA GTCGCTCGAG GGCGTCGCCG ACGAGATCAC CGAGTTGGCC GAGGAGCGCG CCAACTCGAT GGCGTATCGC GCGCACGCGG CGCTGGCCGA GCTCGACCTC GACGCTGGCA GCACTGACAA AGCCGTCGCC CATCTCGAGC GCGCCGTGGC GCTCAACCCC GCGTATCTGC CGGCCATGGC CTTGCTCGGT CGCGTGCAGG TGCAGCGCGG CCAGCACGCC GAGGCCGCGG CCACGCTCAA GCCGCTGCTC GAGGAGCCCG AGATCGCCAA CGCCGCCGTG GAGCTGGCCT ACGCCGAAGC CCTGGTCGGC GGCGGATCGC CCTCGGACGA GGCCCGCGGC CAGGCCCGCG AGGCCGTGTT GCGGGCCAAG GACAAGGGCG CCTCGGCCGA GGAGCTGGGC CGCGTCGCCG CCCTGGTCGA CGAGACCCTG GCCACCGAGA TCGGCGCTGG CGGCGCCGAA GAGAAGCCGA GTCGGCGTCG CGGGCGTCGT CGCGGCCGCC GCTGA
|
Protein sequence | MSDFARRPTI DEIEQLRAQV RREPGSPAFV RLGEAYLALG RPQDAIEVGV PGLRDNPDST AGRMMIGRAY VMQHQWKEAQ DELLRLVKAD RNHAAGFALL GEVLLRRQDL KRALPVLQHA QNLNPTNPYV EDLLKRARTG RAPDPPPPPP VPQDPQPAAA PYASASAHND PLQFGVDEPT RVAGDMSSGL TPPPAAAPHG HAPQYPGHQS APAPAPAPRP RQQQQAITDP GAKTIMAGAA PAAESKPPSP RPIPRALAQP APQAAPAPAP APKAEPETKK KPKVAAADMP PPGPPTGVRP RILSMDKPTD AAREAMHQAA DVGDYLNALL TQGLLNVPAV QAQHTPHAYR SGKRWGHSVT RTFVFLFVLL AVGLGGGAFW IYQTEQQREE EVARHIARAD TLLTTGTYND VQEAREATAS ALRRDPASVS AMAKFARVET AAALLYGTPA AKGTSAMLRA RKELTEEDAA WADIVFAEIA STLATLSDEA AGAPQERLGD ARKTADAWLE KHPDDAWVRW LSGVAMLYAN DLSGAAEAFE AAEADGEGPV VASIYRADMQ TDAGELDGAS VRYEAALERA PKHPLAIIGN ALVQLARAEE AAVVLGKINT TMTEDEGPRA NAYRALVFAL AYLWVAWDYD QFTENLAKAE GVAEPRFLGR VALAQLANGQ FAAAGDTRNR IVWYVAEPEQ THPVVAAVDA ELQWSSGLSA PAIEQVGDSE NIRARHLVGR ALFDLGRLDE SEKAFAEILE FAPEDWEAQT WHAAAQLAQA KGRARNEFDE TLQQIARTQS NQVVRYVRGL AWQRSGDMRE ARRHFEESLE GVADEITELA EERANSMAYR AHAALAELDL DAGSTDKAVA HLERAVALNP AYLPAMALLG RVQVQRGQHA EAAATLKPLL EEPEIANAAV ELAYAEALVG GGSPSDEARG QAREAVLRAK DKGASAEELG RVAALVDETL ATEIGAGGAE EKPSRRRGRR RGRR
|
| |