Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_6872 |
Symbol | |
ID | 8549296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 9408967 |
End bp | 9413346 |
Gene Length | 4380 bp |
Protein Length | 1459 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646391537 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003271229 |
Protein GI | 262200020 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.795402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGGCA AGATGGCGCC GCCCAGCGTC CCGGTGACGC TGGCCCCCCC GGGCGCCAAG CCGGGCGCCA AGCCGGCTCC GCCCGCAGCT CAGGGCAATG ACGACATCGG CCTGGGCGGC TCGGGCAACG AGGCCGCGAG CAGCGCCGCG CCCGAACCCG AGGTCAAGGC GCCCAACCTC GATCTCGGCG CCCCCGGCGA TGCCGATATC GTCCGGCCGA CGCTGCCGCC CAGCGCCGAC GATGTCGATC TCGGGCACTC GGGCAATGAG GATTACGACG ACGAGCCCAA CGAGTTCCTC GACCTCAGCC CCGCGGCCGC CGACGACGAC CTGCTCGATC TCGGCGCCCC GGGCTCGAAC GACGCAGAGC GCAAGCTGGC CCGCGACACC GCCGCGACCA TGGATATCGG CCCGCTCGGC ATCGGCGACC TCGATGTCGG CGCTGCCGAT CTGCCCGTGA GCGCCGGCAA ACGCGGCAGC GAGGGCGCCG ATCTCCCAGC CCCGCTCGAG GATCCCGAGA TCTTCGATCT GCCCGCGCCG CTCGAGGCCG CCGACGACGA TGCCGATCTG CCCGCGCCCA GCCACGTCGC GCGGCAGCCT TCGGCCGATC TGCCCGCGCC CTCCATGCGC CGAGCCTCGG CCGAGTTGCC GACCCCGGCC GCGCGCCCGT CGCTGAGCGA TCTGCCCGCG CCCTCTGCGC GCGGCGGCAG CGACCTCCCG GCTCCGCGCG GCGCCGGCGC GAGCGCTCTC GACCTGCCCG CTCCGGCCGG CGGCGCCGAG CTGCCGCAAG CCGCGGGCAA CGCCATCGAC CTGCCCGGCG ATATCGGCCT CGATCTACCG GCCTCCTCCG CCGAGCTGCC GGCCTCCTCT GCCGAGCTGC CGGCCTCGGC CGACGACTTC TTCGGCGACC TGCCCCAGGC GGCCCAGGAC TCCCTGCCGA CGCCGGCGAG CGAGCTGCCC GCCGCCACCG GCGAGCTGCC CGCGCCCACC GACGACTTCT TCGGCGACCT GCCCCGGCCG GCCGGCACGG GTGCATCGCC CGCTGGCAAC CGCTCTGGGG CCGGCGCTGA GCTGCCGGCC TCGGCCGACG ACTTCTTCGG CGATCTGCCC CAGACCGCGG CCGCGCCCCC GGCCGCGAGC CAGGCGGGCG CCGAGCTGGA CGATCCGCTC GGCGGCGACG ACTTCGACCC GCTGGGCGAC ATCGGCCTGG GCCTGGGCTC GGGCGACGCC GACGCGCGCA GCGACTTCAA CCCGCCCGAC GTGTCCAAGC CGGCCGCGCC CTCGCCCGCC GGTGGCGGCC TCGACGACGA TCCCTTCGGT CTCGGCGACA GCAGCAGCCC GCTGTCGCTG TCCACCGACG ACGGCCCGGC GCCCGCCAAA TCGGCCGACG ACGCCATGTC TCTGGGAGGG CTCGAGCTCG AGCCCTCGCG CGAAGACGAC CCGCTCGGCA TGGCGTCCAT CCCGCTCGAC CACGGCGGCG CCTCGCCATC ACCCGCGCAG GCCAGCGCCG AGAAGAGCGG TATCGACGAC CTCGGCGACA TGGGTCTCGG CGGCGGCGAC GATACCCTTG GCCTGGATCT CGGTCTGCCC GATCCCGGCA GCATGGCGGG CGCGCCGGCG CCGCCCCCGA CCGCGGCCAA AGCGCCGGCT CCGGCGGCCG ACGCCGGCGG CGACGACTTC GGTCTGCCCA GCATCAGCGG CGGCGGCGAC GACTTAGGTC TGCCCAGTAT CCCGGCCTCC TCCGAGGCGC CCAGGGCCAA ACCCAAAGCC AAAGCCAAGC CCAAGACCAA GCCCAAACCC GAGCGCCGTC GCCCGGGCCG GCGCGGCGAG AAGGAGCCAG AGGAGAAGCT CGACCTCGAG GACACCGCGA TCCCGCGTGT CCAGCAGGGG CACATGGCCG CGTCCGCGCT CACCGAGCGC CACCTGATCC GCCGCCGCCA GCGGCGCAGG CGCATCATCA TGTCGGTGGC CGCGGTCCTG GTGCTGGCCG CGGGCACCGG CGGCTTCTTC GTGTATCGCA GCTGGGCCGC GCAGCAGGAG ATCGCCGCCA ACCTCAGCGA GCAGCGGAGT GCGGCGCTGG CCGCGCTGCA GGCAGGCGAG CCCGATCACT GGGAACGCGC CGTCGCCGCC GCCGACAAGG CGCTCGCGGT CGCGCCCCAA GATCCCGACG CGCTGGGCAT CGCGGCGCAG GGCGCGTACG CGGCTATCAT CGATCAGGGC CGCGACGCCG AGCGCTGGCG CCAGGCCGGT CTCGACTACA CGCGCCGCAT CGACGACAGC TCGCTCGGCG GCGCTCACGT GGACAAGGCG CAGGCGCTCA AATCCATCGA TCAGGGCCGC CCGGCGGCCG CCATCAGCCG TCTGCAGCAG GTGCTCTCGC GGACGCCGAA CGACCCCGAC GCGGTGCTCT ACCAGGGCTG GGCGCACGCC GCCGCGCGCG ACAACACACA GGCCGCGCAG TTCTTTGAAA AATCCCTGGA GCTGTCCCCG AAGCGCCCGC TGCCGGCCCT CTACGGTCTG GGCCGTGCGC AGCTTGCCCA GGGCGATCGG GACAGCGCGC GGGCGACCTT TGCCCAGATC CTCGAACAGC GCGAGAGTCA CCTGGGCGCG CTGGTCGGCG CCGCCCAGGC CGCCGATGTC GGCGGCAGCG ACAAACGCGA GGCCAAGCTG CTCGAGATCG TCAACCGCCC CGACGCGGCC GAGGGCGATC CCCGCGAGCT GTCGCGGGCG TGGTCACTGG CCGCCTACAT CTCGCTCGAC AGCGGCCGCA TCGACGAGGC CGGCCGACGC TTCGAGCAGG CCATCAACGC CCACCCGGCC AACGTCAACG CCCTGGTCGG CCGGGCCCGC GTCGCGCTCG CCCAGGAGCG CTACGAGGAC GCCACCGAGC AGCTCACCCG CGTGATCGGT ACCGATCTCA ACGCCGTCGA TCCGGTGCGT AATCTCGACG CCCTGCTCAC CCTCACCGAG CTGGCCATGC GCACCGACAA GCCCGACGAG GCCACCGCCC ATCTCGAGCG CGTCTTCGCC GCCAAGGAGC AGATCGACGA CCGCCGCGGG CTGTCCCGGG CCTACGTGAT GCAAGGCCAG ATCCTCGGCG CCGACGAGAG TCAGCGCGAG GCCGCGATCG CGGCCTACGA GGAGGCCCTG ACGCTGGCCG GCGACGACGC CCTGGAATCG GCGCTCGCGC TCGCCGACCT GTACACCTTG CTCGGCCGCA TGGAGAAAGC GCGCGCCGTT CTCGCCCCGG TCGAGCGCCG CGCCGCCAGC GACGCCGCGG CATCGGTGGC GCTCGGCATT CGCTACATGC GCGCCAACGC CTGGACCGAC GCCGAGACCT GGCTGCGCAA GGCGCTCGAG CTCGAGCCCG GCGACGTCGA CGCGCAGTTC CAGCTCGGTC AGGTCCTGGC CTCGCTGGAG CGCTACGACG AGGCCTTTGA GATGCTCAAG AGCGCGGCCG AGGCCGCTCC CGAGCGCTCC GATATCGGCC TGCGTCTGGC CATCCAGTAC GAGAAGCTCG ACCGCGACGA GGAGGCCGCC GCCGCCTACG AGCAGCTCCT CAGCACCTCC TCGCCCACCG TGGATACGCT GGCGCGCGCC GGCCGCTTCT ACGCCCGCCA GGGCCAGACC GACAAAGCCG GCGAGATCGG CGAGCGCATC CTGGCCATCA AAGACACCGC CGCCGCCGGC CACTACCTGC TCGGCGAGGG CCAGTTCGCC CAGGGCGAAC ACGCCGCCGC CCGCGACAGC TTCCGCCGCG CCGCCGACAT CGAGCAAGAC CCCCAGTACC TGGAGGCCAC CGCGCGCGCG GCCGAGCGCA TGGAGCTCTA CGACGAGGCC TTCGAGGCCT ACTCCGAAGC CTCGCGACGC GCGCCCGAGT ACATTGCCCC GCGGCTGGGT CGCGCGCGCC TGCTCATCGC CCGCCGCGAC TTCCAGCGCG CTACCGAAGA ACTCGAAGAC CTGCGCAAGA TCGCGCCCAA CGAGGCCAGC GTGTTCCACT ACCTGGGCGA GAGCCTGCAG GCGCAGGAGA AGCACAAAGA GGCGATCTCG TACTTCCGTA CCGCGCTCGG CATCGACGGC AGGCGCGCCG AGACCCACTA CCGCCTCGGC AAGTCGTACA TCGAACGCGG CGACGAGCGC GACGCCGCCA GCGAGTTCAC CACCGCCACC CAGATCGCCC GCAACAGCGC GACGCCGCCG TCGTGGCTGG TCGATGCCTA CTACGAGCTG GGCTACGTGC AGCGCGCGCT GAGCCGGCGC GGCGAAGCCG TCCGCGCCTG GGACGCGTAT CTCGAGCTGG TGCCCGAGGC CGAGCAGAAC GAGACCAAGG TCAAAGAGGT CAAGCGGCTG CTCATGGGTC TCAAGGCACA GCTCCGATAG
|
Protein sequence | MPGKMAPPSV PVTLAPPGAK PGAKPAPPAA QGNDDIGLGG SGNEAASSAA PEPEVKAPNL DLGAPGDADI VRPTLPPSAD DVDLGHSGNE DYDDEPNEFL DLSPAAADDD LLDLGAPGSN DAERKLARDT AATMDIGPLG IGDLDVGAAD LPVSAGKRGS EGADLPAPLE DPEIFDLPAP LEAADDDADL PAPSHVARQP SADLPAPSMR RASAELPTPA ARPSLSDLPA PSARGGSDLP APRGAGASAL DLPAPAGGAE LPQAAGNAID LPGDIGLDLP ASSAELPASS AELPASADDF FGDLPQAAQD SLPTPASELP AATGELPAPT DDFFGDLPRP AGTGASPAGN RSGAGAELPA SADDFFGDLP QTAAAPPAAS QAGAELDDPL GGDDFDPLGD IGLGLGSGDA DARSDFNPPD VSKPAAPSPA GGGLDDDPFG LGDSSSPLSL STDDGPAPAK SADDAMSLGG LELEPSREDD PLGMASIPLD HGGASPSPAQ ASAEKSGIDD LGDMGLGGGD DTLGLDLGLP DPGSMAGAPA PPPTAAKAPA PAADAGGDDF GLPSISGGGD DLGLPSIPAS SEAPRAKPKA KAKPKTKPKP ERRRPGRRGE KEPEEKLDLE DTAIPRVQQG HMAASALTER HLIRRRQRRR RIIMSVAAVL VLAAGTGGFF VYRSWAAQQE IAANLSEQRS AALAALQAGE PDHWERAVAA ADKALAVAPQ DPDALGIAAQ GAYAAIIDQG RDAERWRQAG LDYTRRIDDS SLGGAHVDKA QALKSIDQGR PAAAISRLQQ VLSRTPNDPD AVLYQGWAHA AARDNTQAAQ FFEKSLELSP KRPLPALYGL GRAQLAQGDR DSARATFAQI LEQRESHLGA LVGAAQAADV GGSDKREAKL LEIVNRPDAA EGDPRELSRA WSLAAYISLD SGRIDEAGRR FEQAINAHPA NVNALVGRAR VALAQERYED ATEQLTRVIG TDLNAVDPVR NLDALLTLTE LAMRTDKPDE ATAHLERVFA AKEQIDDRRG LSRAYVMQGQ ILGADESQRE AAIAAYEEAL TLAGDDALES ALALADLYTL LGRMEKARAV LAPVERRAAS DAAASVALGI RYMRANAWTD AETWLRKALE LEPGDVDAQF QLGQVLASLE RYDEAFEMLK SAAEAAPERS DIGLRLAIQY EKLDRDEEAA AAYEQLLSTS SPTVDTLARA GRFYARQGQT DKAGEIGERI LAIKDTAAAG HYLLGEGQFA QGEHAAARDS FRRAADIEQD PQYLEATARA AERMELYDEA FEAYSEASRR APEYIAPRLG RARLLIARRD FQRATEELED LRKIAPNEAS VFHYLGESLQ AQEKHKEAIS YFRTALGIDG RRAETHYRLG KSYIERGDER DAASEFTTAT QIARNSATPP SWLVDAYYEL GYVQRALSRR GEAVRAWDAY LELVPEAEQN ETKVKEVKRL LMGLKAQLR
|
| |