Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_4967 |
Symbol | |
ID | 8547375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 6846975 |
End bp | 6849212 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646389641 |
Product | protein of unknown function DUF1156 |
Protein accession | YP_003269349 |
Protein GI | 262198140 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.241084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGC CGGCGGACGC CGGAGCGCGC GCCGGCCGTC GGCTGATCGA CCGCGGTCTG CTGGCAAAGG CCTCGGCGGC GGGTCTGCGC GAGCGCTACC AGCGGGCGCG CTCGCCGCAC ACGGTGCACG TGTGGTGGGC GCGGAGGCCG CACGCGGCCA TGCGCGCGCT GGTGTTCTCG GCTCTGGCCG CCGACGACGA TGCCGCCGCC ACGCGGCTGG CAGAATTGCT CGCGGCGGCC CCGGCCCGGG GCGACGATGC CCGGTCGGCA GCGGCGGCGA ACGCGACGGC GGCGGCGAGC GCGGCTGCTG CGGCGGCCGA GCTGCGCGCG CGCTACGGCC GCGCGCCGCG GGTTCTCGAT ATGTTCGGCG GCGGCGGCAC CATCGGCTTC GAGGCCGCGC GCCTGGGCGC CGAGGCGCAC GCGCTCGACT GCAACGAGCT GGCGGTGTTC ATTCAACGCA CGCTGCTGAT GCACGCGCGC GGCGGCGAGC GCCGGGTTCT GGTGGGGCTG CTCGAGGACA CGCTCGCGCT CGTGCTCGAG CGCTTGCACG CGCGCACCCG CTGGCTGTAT CCGGCCGCGG CCGAGGGCGT CGCTATCTAT CTATGGAGCT ACGCGCTTGC GTGTCCGAGC TGCGGTTTCG AGATGTTTCT CGGCAAGCGG CCGTGGCTGT CGCGGCTGCG CGGGCGACGG CTGCGCTTGC AAGTGCGCGC CGGGCCCCAC GGCCACCGCT GGGACCGCTT GCTCGAGTCC GAGGATGGCG CGGAGGCTGA GCCGACCCAG GGGGTGTGGC GCGGTCGCCG CGGCCGCGTG CGCTGTCCGG CGTGCGCCGG CGAGTTCGCG CGTCCGCAGA TGGCGTCGTG CCGCGAGCTG TGCGTGGCCA CGGCGGCGCC CGAGCGCGGC GGCAAGCGCT TTCGCCTGGC GACGAGCGCC GACCTTCCGG CCGGTGAGGC GCTGGCGAGC GCGAGCGCGG CGCTGCTCGC CGAGCTCGAG AGCGCGCTGC CGGCGACGCC CTTGCCGGTG TGGTCGGGCA TCGTCAATCC CGCTCTGTAC GGCATGCGTA CGTACGGCGA CATCGTCAAC CCGCGCCAGC GCGTGGCTCT GCTCGCGCTG CTGGTGGAAC TCGGACGCGC GTACGACGAG CTGCGGGCGA GCCGCGGTGA GGCCGCTGCC CGGGCGGTGG TGGCGCTGGC CAGCGGGCTT ATCGACCAGC TCGTCGACTG GAACTGTCGG CTGTCGATGT GGATCCCGCA GAACGAGCAG GTGGGGCGCG CGTTCTGCGG CCCGGGCGTG GCCATGCTCT GGGATTATGC GGAGATCGAC CCGACCGGCG CCGGGCCGGC CAACCTCCGC GACAAGGCCA GGCGCATCGT GGCCGGGGCG CGTCTGCTCG GCGACGGCCA CGGGCGCTGC CGCGTGTATC ACGGTCGGGC GCAGGCGCTG CCCTTCGCGC GCGGCTGCTT CGACGCCGTG GTTACCGACC CGCCGTACTA CGATAATCTG TTCTACAGCG TGCTGGCCGA CTTCTTCTAT ACCTGGAAGC GCCTGCTTTT CCGCCGTATC GAGCCGACTC TATTCGCCGC GCCGGCGAGC TCGACGCGGG CCGAGCTGGT CGCCTGTTCC CATCGCGCCG GCAGCGCGGC GGCCGCGCAC GCGCTGTACT GCGAGCAGCT CGGCGAGGCG GTCGCCGAGG CCGCGCGCGT GCTGGCGCCC GGGGGCGTGT TCGCGTTGGT CTACAGTCAC GCGGCGCTGG CCGGGTGGGA GGCGCTGGTG CGTGCCTACC GCGGCGCCGC GCTGCGCCTG TGCAGTGTGC AGCCGCTCGC CGTGGAGCGG CGGCAGCGTC CCCGCGCCAT GCACGCGGCC GCGGTCAACA TCTGTGTGGT GCTGATCGCC CGCCGAGCGG AGGATGCCGC AGTGGCCGAT TCGCTGGGCC AGGCGAACTC GCCGGCAGCG CTGCGCGTGC GCGTCGCTGA GCTGATCGCG AGCGCGGCCG CCGACCCGGT GCTGGCGTCG TGGCCCGAGG CCGACCTCGG CCTGGCGGTT TTCGCCCAGG CGGCCGGAAT CATCGCCAAC AGCGCCGGGT TCGTGGACGC GGCTGACGCC GATGCGGGAC GCGTCGATGC CGCAGGCGGC GGGGGCGGCG CGGTTGGGCA GACGCTGCGG CGCGCGCTGC GCGACAGCGC GGAGGCCGTA CACGCCCGCT GGGCCGGGTT CCGGCTGCTC GAGCGCCACT CGATGTAG
|
Protein sequence | MSAPADAGAR AGRRLIDRGL LAKASAAGLR ERYQRARSPH TVHVWWARRP HAAMRALVFS ALAADDDAAA TRLAELLAAA PARGDDARSA AAANATAAAS AAAAAAELRA RYGRAPRVLD MFGGGGTIGF EAARLGAEAH ALDCNELAVF IQRTLLMHAR GGERRVLVGL LEDTLALVLE RLHARTRWLY PAAAEGVAIY LWSYALACPS CGFEMFLGKR PWLSRLRGRR LRLQVRAGPH GHRWDRLLES EDGAEAEPTQ GVWRGRRGRV RCPACAGEFA RPQMASCREL CVATAAPERG GKRFRLATSA DLPAGEALAS ASAALLAELE SALPATPLPV WSGIVNPALY GMRTYGDIVN PRQRVALLAL LVELGRAYDE LRASRGEAAA RAVVALASGL IDQLVDWNCR LSMWIPQNEQ VGRAFCGPGV AMLWDYAEID PTGAGPANLR DKARRIVAGA RLLGDGHGRC RVYHGRAQAL PFARGCFDAV VTDPPYYDNL FYSVLADFFY TWKRLLFRRI EPTLFAAPAS STRAELVACS HRAGSAAAAH ALYCEQLGEA VAEAARVLAP GGVFALVYSH AALAGWEALV RAYRGAALRL CSVQPLAVER RQRPRAMHAA AVNICVVLIA RRAEDAAVAD SLGQANSPAA LRVRVAELIA SAAADPVLAS WPEADLGLAV FAQAAGIIAN SAGFVDAADA DAGRVDAAGG GGGAVGQTLR RALRDSAEAV HARWAGFRLL ERHSM
|
| |