Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3961 |
Symbol | |
ID | 8546357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5460496 |
End bp | 5463672 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646388633 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003268353 |
Protein GI | 262197144 |
COG category | [S] Function unknown |
COG ID | [COG1729] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.357683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.258592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTGC TGCACCGACG CCATCGCCGC CGCGCCGCGG CCGCCGACAC GCGCCAGCGC GGCCGCTTCT CGGGCCTGCG GGCTGGGCTC GCCGGCCTGC TGCTGCTCGG CAGCACCAGC GCCGCCTTTG CCCAGGCCGA CATCGAGTAC GACCGCGGCC CGGCGGCCGA GCTGTACATC CGCAAGCGCC CGCCGCCGCC GGCGAGCCCG ACGCTCACGG CCGAGCTCGA GAGCATGCTC ACGGAGAAGG AGGCCGCGGC CGACGAGAAG CGCCGCGAGG CCATCGAGCT GTTGCGCGCC TTCATCGACA CCAAGCCCCA GGGCGAGGCC CGCGCCGAGG CGCTGTTCAA GCTGGCCGAG CTGCTTTGGG AAGACGCGCG CGTGGGCTTT ATCGCCCGCA TGGACCAATA CGAGCGCGCG CTCGAGGCCT GCCGCCAGGA CGACGAGGGC TGCAAGGAGC GCCCGAGCGA GCCGCGCATC GACCTCGACG AGCCCGCCGC GCTCTACCGC CAGCTCCTGG CCGAGTTTCC GCAGTTCCGG CGCGCTGACC TGGTGCTCTA CCTGGTCGGC TTCGCCGCCC GCGAGCAGCA GCAGTACCAG GAGTCGCTGG AGTATTTCGG CCAGGTGGTC GAACGCTACC CGGACTCGCC GCTGTACGGC GACGCCTGGA TGATGATCGG CGAGCACTAC TTCAGCACCG GCCAGTGGCC CGAGGCCCGC GCGGCCTACG CCAACGTGCT GGCGCGCCCG GACTCGCCGA CCTACGACCT GGCGCTGTTC AAGACCGCCT GGGCCGACTG GAAGCTCGGC GACCCCGATC TGGCCGCGCG CCGCTTCAAG CAGGTGCTCG ACCTGGCGGT GGAGGCCGAG ACCTCGGGCA GCGCGGTGCA GCGCCGCCGC CGCGCCCAGC TCCGCGACGA GGCGCTCGAG TACCTGGTGG TGGTGTTCAC CGAGGACCGC TCGATCTCGG CTCAGGAGGT CTACGACTTC CTGGCCTCGA TCGGCGGCAC GCGCTACTCG CGCGACGTGC TGGTGCGCGT GGCCGACGCG TATTTCGGAC AGAGCGAATA CGAGCGCGCG GCCCAGACCT ATCGCTTCCT CATCGATATG AAGCCCACGG GTATCGAGGC CGCCGAGTAC CAGCGCGCGG TGGTCGAGGC CTACGTGGCC GCGCTGCAGC CCGAGCAGGT CGAGGCCGAG ATGCGGCTGC TGGTCGAAAA CTACGGGCCG GCGTCGAAGT GGGCCGAGCA GAACGCCAAG TTCCCGACCC GCAAAGCGCG CTCCGAGCGG CTCACCGAGG CCATGGTGCG CAACACGGCC AAGAACTACC ACGCCGAGGC CCAGGCCGCC GAGAAGCGCG ACAAGAAGCC GGATCTGGCG CTCTACACCC AGGCCGCCGA CCTCTACCAG ACCTATCTCA CGGCCTACAC CGAGCACGAG AACGCCGCCG AGGTGCGCTT TCTGCGCGCC GAGATCCTGT ACTTCAAGCT GGGCAAGCTC GAGGAGGCCG GCGACGAGTA CCTGGCCGTG GCCCAGCAGA CCCCGGTCGG CAAGTACCAC AAGGACGCGC TGCTCAAGGC CATGGACGCC TTCGAGAAGG CGCGCCCCGA GAACGCGGGC AGCGCCGGCC AGCGCGAGCT GTCCGCGGCC GACCGCAAGT TCGCGGCCTC GGTGGACCTC TACGCCACGC TGTTCCCGGC CGATCCCGAG CTGGTCGGCG TGATCTTCCG CAACGGCGAG ATGTTCTACG ACTACGGCGA CTACGACGAG GCCATCAAGC GCTACGGCCT CATCGTCACC AAGTACCCGG ACGACCAGAA CGCGGGCCCC GCCGGTGACC GCATCCTCGA GTCGCTGGCC AAGGCCGAGG ACTACGAGAA CATCGAGGAG TGGGCGCGCA AGCTCAAGAC CGCCAAGGCC TTCCAGAGCA AGGAGCAGCA GTCCCGCCTC GACCGGCTGA TCGTCGAGTC GATCGGCAAG AGCGGCGAGC GCTACGCCGA GGCCGGCGAG TTCGAGAAGG CGGCGAGCTT CTATCTGCGC ATCCCCCAGG AGTTCCCGCA GCACACCATG GCGGCGCAGG CGCAGATGAA CGCCGGCGTG ATGTACGAGA AGGCCAAGCG GCCGCAGCGC GCCGGCCAGG CCTATCTGGC GCTGGCCGCG TCCTATCCCG ACAGTAAAGA GGCGCCCAAG GCGGCCTTTG CGGCCGGCCA GCTCTACGAG TCGGTGGCGT ATTTCGACCG CGCGGCCGAA GCCTACGAGG TCGTCGCCGA AACATTCCCG CGCTCGGAGC AGAGCGCGGA CGCTTTGTTC AACGCCGGCC TGCTGCGCCA GTCGCTCGAT CAGAACGAGC GCGCCATCGA GCACTACCAG ACCTACGCCA AGCGCTACCG CGGCAAGGCC GACGCCGCCG AGGTCGCCTT CCGCATCGGC GTGGTGTACG AAAACGCCGA GCGCTACGAC GACGCCGCCG ACGCCTATCG CCGCTACCTC AAGGGTCACG CGCGCAGCGG CCGGCACGTG GTCGAGGCGC ACACGCGCGT CGGCCGCAGC GAGCTGGCGG CCGGCCGGCT CAAGCGCGCG GGCAACGAAT TCGACGCCGC GCTCAAGGTG TTCCGCCGGC TCAAGGGCAA GCAACGCGAG ACCGAGAAGG CGTGGGCGGC CGAGGCCCGC TACCATCAGG GCGAGCTGAT CTACCGTCGC TTCGAGGCCA TCTCGCTCGA CGTCAAGCCG CGCCGGCTGC GGCGCACGCT CGACAGCAAG ACCGCGCTGC TGGCCAAGGC CCAGGACGTG TACCTCGACG TGGTCGACTT CGGCGACGCG CAGTGGGCGA CCGCGGCCCT GTTCCGCATG GGGCGCATCT ACGAGGGCTT TGCCGAGTCG CTGCGCGACG CGCCGGTGCC CCAGGGGCTG AGCGAGGACG AGGCCGAGAT GTACCGCCAG GAGCTCGAGA TGTACGTCAT CGAGGTCGAG GAGCAGGCCA TCGACCTGTA CGCGACCGGC TATCAGAAGG CGCTCGAGCT GGGCGTGTAC AACACCTACA CCAGCCAGAT CCGCACCGCG CTCGGACGCC TGGACTCGAT CGGCTACCCG CCCGCGCTCG AGGCCCGCGC GCGGGTGCGC CTGGGCGACC GGGTGCAGCC GCCGAGCGCG GTCGAGGAGG TGGTGCGCGA TGAGTAG
|
Protein sequence | MPLLHRRHRR RAAAADTRQR GRFSGLRAGL AGLLLLGSTS AAFAQADIEY DRGPAAELYI RKRPPPPASP TLTAELESML TEKEAAADEK RREAIELLRA FIDTKPQGEA RAEALFKLAE LLWEDARVGF IARMDQYERA LEACRQDDEG CKERPSEPRI DLDEPAALYR QLLAEFPQFR RADLVLYLVG FAAREQQQYQ ESLEYFGQVV ERYPDSPLYG DAWMMIGEHY FSTGQWPEAR AAYANVLARP DSPTYDLALF KTAWADWKLG DPDLAARRFK QVLDLAVEAE TSGSAVQRRR RAQLRDEALE YLVVVFTEDR SISAQEVYDF LASIGGTRYS RDVLVRVADA YFGQSEYERA AQTYRFLIDM KPTGIEAAEY QRAVVEAYVA ALQPEQVEAE MRLLVENYGP ASKWAEQNAK FPTRKARSER LTEAMVRNTA KNYHAEAQAA EKRDKKPDLA LYTQAADLYQ TYLTAYTEHE NAAEVRFLRA EILYFKLGKL EEAGDEYLAV AQQTPVGKYH KDALLKAMDA FEKARPENAG SAGQRELSAA DRKFAASVDL YATLFPADPE LVGVIFRNGE MFYDYGDYDE AIKRYGLIVT KYPDDQNAGP AGDRILESLA KAEDYENIEE WARKLKTAKA FQSKEQQSRL DRLIVESIGK SGERYAEAGE FEKAASFYLR IPQEFPQHTM AAQAQMNAGV MYEKAKRPQR AGQAYLALAA SYPDSKEAPK AAFAAGQLYE SVAYFDRAAE AYEVVAETFP RSEQSADALF NAGLLRQSLD QNERAIEHYQ TYAKRYRGKA DAAEVAFRIG VVYENAERYD DAADAYRRYL KGHARSGRHV VEAHTRVGRS ELAAGRLKRA GNEFDAALKV FRRLKGKQRE TEKAWAAEAR YHQGELIYRR FEAISLDVKP RRLRRTLDSK TALLAKAQDV YLDVVDFGDA QWATAALFRM GRIYEGFAES LRDAPVPQGL SEDEAEMYRQ ELEMYVIEVE EQAIDLYATG YQKALELGVY NTYTSQIRTA LGRLDSIGYP PALEARARVR LGDRVQPPSA VEEVVRDE
|
| |