Gene Hoch_4967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4967 
Symbol 
ID8547375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6846975 
End bp6849212 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content75% 
IMG OID646389641 
Productprotein of unknown function DUF1156 
Protein accessionYP_003269349 
Protein GI262198140 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.241084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGC CGGCGGACGC CGGAGCGCGC GCCGGCCGTC GGCTGATCGA CCGCGGTCTG 
CTGGCAAAGG CCTCGGCGGC GGGTCTGCGC GAGCGCTACC AGCGGGCGCG CTCGCCGCAC
ACGGTGCACG TGTGGTGGGC GCGGAGGCCG CACGCGGCCA TGCGCGCGCT GGTGTTCTCG
GCTCTGGCCG CCGACGACGA TGCCGCCGCC ACGCGGCTGG CAGAATTGCT CGCGGCGGCC
CCGGCCCGGG GCGACGATGC CCGGTCGGCA GCGGCGGCGA ACGCGACGGC GGCGGCGAGC
GCGGCTGCTG CGGCGGCCGA GCTGCGCGCG CGCTACGGCC GCGCGCCGCG GGTTCTCGAT
ATGTTCGGCG GCGGCGGCAC CATCGGCTTC GAGGCCGCGC GCCTGGGCGC CGAGGCGCAC
GCGCTCGACT GCAACGAGCT GGCGGTGTTC ATTCAACGCA CGCTGCTGAT GCACGCGCGC
GGCGGCGAGC GCCGGGTTCT GGTGGGGCTG CTCGAGGACA CGCTCGCGCT CGTGCTCGAG
CGCTTGCACG CGCGCACCCG CTGGCTGTAT CCGGCCGCGG CCGAGGGCGT CGCTATCTAT
CTATGGAGCT ACGCGCTTGC GTGTCCGAGC TGCGGTTTCG AGATGTTTCT CGGCAAGCGG
CCGTGGCTGT CGCGGCTGCG CGGGCGACGG CTGCGCTTGC AAGTGCGCGC CGGGCCCCAC
GGCCACCGCT GGGACCGCTT GCTCGAGTCC GAGGATGGCG CGGAGGCTGA GCCGACCCAG
GGGGTGTGGC GCGGTCGCCG CGGCCGCGTG CGCTGTCCGG CGTGCGCCGG CGAGTTCGCG
CGTCCGCAGA TGGCGTCGTG CCGCGAGCTG TGCGTGGCCA CGGCGGCGCC CGAGCGCGGC
GGCAAGCGCT TTCGCCTGGC GACGAGCGCC GACCTTCCGG CCGGTGAGGC GCTGGCGAGC
GCGAGCGCGG CGCTGCTCGC CGAGCTCGAG AGCGCGCTGC CGGCGACGCC CTTGCCGGTG
TGGTCGGGCA TCGTCAATCC CGCTCTGTAC GGCATGCGTA CGTACGGCGA CATCGTCAAC
CCGCGCCAGC GCGTGGCTCT GCTCGCGCTG CTGGTGGAAC TCGGACGCGC GTACGACGAG
CTGCGGGCGA GCCGCGGTGA GGCCGCTGCC CGGGCGGTGG TGGCGCTGGC CAGCGGGCTT
ATCGACCAGC TCGTCGACTG GAACTGTCGG CTGTCGATGT GGATCCCGCA GAACGAGCAG
GTGGGGCGCG CGTTCTGCGG CCCGGGCGTG GCCATGCTCT GGGATTATGC GGAGATCGAC
CCGACCGGCG CCGGGCCGGC CAACCTCCGC GACAAGGCCA GGCGCATCGT GGCCGGGGCG
CGTCTGCTCG GCGACGGCCA CGGGCGCTGC CGCGTGTATC ACGGTCGGGC GCAGGCGCTG
CCCTTCGCGC GCGGCTGCTT CGACGCCGTG GTTACCGACC CGCCGTACTA CGATAATCTG
TTCTACAGCG TGCTGGCCGA CTTCTTCTAT ACCTGGAAGC GCCTGCTTTT CCGCCGTATC
GAGCCGACTC TATTCGCCGC GCCGGCGAGC TCGACGCGGG CCGAGCTGGT CGCCTGTTCC
CATCGCGCCG GCAGCGCGGC GGCCGCGCAC GCGCTGTACT GCGAGCAGCT CGGCGAGGCG
GTCGCCGAGG CCGCGCGCGT GCTGGCGCCC GGGGGCGTGT TCGCGTTGGT CTACAGTCAC
GCGGCGCTGG CCGGGTGGGA GGCGCTGGTG CGTGCCTACC GCGGCGCCGC GCTGCGCCTG
TGCAGTGTGC AGCCGCTCGC CGTGGAGCGG CGGCAGCGTC CCCGCGCCAT GCACGCGGCC
GCGGTCAACA TCTGTGTGGT GCTGATCGCC CGCCGAGCGG AGGATGCCGC AGTGGCCGAT
TCGCTGGGCC AGGCGAACTC GCCGGCAGCG CTGCGCGTGC GCGTCGCTGA GCTGATCGCG
AGCGCGGCCG CCGACCCGGT GCTGGCGTCG TGGCCCGAGG CCGACCTCGG CCTGGCGGTT
TTCGCCCAGG CGGCCGGAAT CATCGCCAAC AGCGCCGGGT TCGTGGACGC GGCTGACGCC
GATGCGGGAC GCGTCGATGC CGCAGGCGGC GGGGGCGGCG CGGTTGGGCA GACGCTGCGG
CGCGCGCTGC GCGACAGCGC GGAGGCCGTA CACGCCCGCT GGGCCGGGTT CCGGCTGCTC
GAGCGCCACT CGATGTAG
 
Protein sequence
MSAPADAGAR AGRRLIDRGL LAKASAAGLR ERYQRARSPH TVHVWWARRP HAAMRALVFS 
ALAADDDAAA TRLAELLAAA PARGDDARSA AAANATAAAS AAAAAAELRA RYGRAPRVLD
MFGGGGTIGF EAARLGAEAH ALDCNELAVF IQRTLLMHAR GGERRVLVGL LEDTLALVLE
RLHARTRWLY PAAAEGVAIY LWSYALACPS CGFEMFLGKR PWLSRLRGRR LRLQVRAGPH
GHRWDRLLES EDGAEAEPTQ GVWRGRRGRV RCPACAGEFA RPQMASCREL CVATAAPERG
GKRFRLATSA DLPAGEALAS ASAALLAELE SALPATPLPV WSGIVNPALY GMRTYGDIVN
PRQRVALLAL LVELGRAYDE LRASRGEAAA RAVVALASGL IDQLVDWNCR LSMWIPQNEQ
VGRAFCGPGV AMLWDYAEID PTGAGPANLR DKARRIVAGA RLLGDGHGRC RVYHGRAQAL
PFARGCFDAV VTDPPYYDNL FYSVLADFFY TWKRLLFRRI EPTLFAAPAS STRAELVACS
HRAGSAAAAH ALYCEQLGEA VAEAARVLAP GGVFALVYSH AALAGWEALV RAYRGAALRL
CSVQPLAVER RQRPRAMHAA AVNICVVLIA RRAEDAAVAD SLGQANSPAA LRVRVAELIA
SAAADPVLAS WPEADLGLAV FAQAAGIIAN SAGFVDAADA DAGRVDAAGG GGGAVGQTLR
RALRDSAEAV HARWAGFRLL ERHSM