Gene Hoch_5008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5008 
Symbol 
ID8547418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6906756 
End bp6908153 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content73% 
IMG OID646389684 
ProductRestriction endonuclease S subunits-like protein 
Protein accessionYP_003269390 
Protein GI262198181 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.441656 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCTG ACGACGGCGC CGACGCGCTC GAATTCGATA GCGGCTACGC GCTCGGCGAG 
CTGCTCGATG ATCTCGGACC GTGGTCCGGG AAACCCGGCG CAGCGCTGCC CGAGGGCTGG
CGCTGGTCGT CGCTGGGCGC TCTGGCCACG GGCAAAGCTC GCTACGGCGT CAATCTGCCG
GCGCGCCCCT ACGACGCCGG GCTGCCGCGC TTCGTGCGCA TCACCGATAT CGGCGACGAC
GGTCGGCTGC GCGACGACGC GCCCGTGAGC CTGAGCGATC CGGGCGCGGC CGATTATCGC
CTGAAACCCG GGGATCTGGC GGTTGCGCGC TCGGGCGCCA CGGTCGGCAA GTCGTATCTG
TACCGCCCCG AGGACGGCGT GTGCGTGCCC GCCGGCTATC TGTTGTGCGT GCCCCTGGCG
CCGTCGCGCT GCGAGCCGGC GTTCGTGGCC CAGTGGGCAC AATCGCGCGG CTATCGCGCG
TGGCTGCGCA GCGCGGTGCG CACGGCCGCG CAGCCCAATG TCAACGCCAG CGAGCTGGCG
ACCCTCCCGG TGCCGGTGCC GCCGCTCGAG GAGCAGCGCG AGGTGGCCCG CGTCCTCGCG
CTCGGCGACG CGCTGCTCGC GCACAGCGGG CGCATCATCG ACAAGCTCGG CCTGGTGTTG
GCGGCGCTGG TGCGCGATCT GCTCTCGCGC GGCATCGGGG AGGACGGCCG CATCCGCGAC
CCGGCGCGCC ATCCCGAGCT GTTTCGCGAG ACGCCGCTGG GTCTGCTGCC GCGCGCGTGG
TCGGTGAGCG AAGCCGGCGA GCTGTTGGCC GCGCTCAAAC CGGCGATGCG CAGCGGGCCC
TTTGGCAGCG AGCTGCGCAA GTCCGACCTG GTCGCCGAAG GCGTGCCTCT GCTGGGCATC
GACAACGTCG ACACCGACGC CTTCGTACCG CGGTATCGGC GCTTCGTGCC GCCGCATCTG
TTCCAGGCGC TGGGGCGCTA CGCGGTGCGG CCGGGCGACG TCATGGTCAC GGTGATGGGC
ACGGTCGGGC GCTCGTGCGT GGTCCCGGAC GATATCGGCG ACGCGCTGTC TTCCAAGCAC
GTGTGGACGC TCAGCTTCGA TCCCGAGCGC TACCTGCCGC TGCTGGCCTC GCTGCAGTTC
AACTACGCGC CCTGGGTGCA CGCGCATCTC ACCCGCGAGG CGCAGGGCGG CACCATCGCG
TCGATTCGCT CGAGCACGCT GCGCAGCACG CTGTTGCCGG TGCCGCCGCT GGCCGAGCAG
CGCGCGATCG CCGAGGTGCT GGCGCGCACC CGCGGGCGCA TGGCGGCCGA GCGGCGACTG
CGCAGCGTGC GCCGGCGGCT GCGCGACGCA CTGCGCGAAG ATCTGATGAC CGGGCGCGTG
CGCGCTCGCC CGGGGTAG
 
Protein sequence
MSADDGADAL EFDSGYALGE LLDDLGPWSG KPGAALPEGW RWSSLGALAT GKARYGVNLP 
ARPYDAGLPR FVRITDIGDD GRLRDDAPVS LSDPGAADYR LKPGDLAVAR SGATVGKSYL
YRPEDGVCVP AGYLLCVPLA PSRCEPAFVA QWAQSRGYRA WLRSAVRTAA QPNVNASELA
TLPVPVPPLE EQREVARVLA LGDALLAHSG RIIDKLGLVL AALVRDLLSR GIGEDGRIRD
PARHPELFRE TPLGLLPRAW SVSEAGELLA ALKPAMRSGP FGSELRKSDL VAEGVPLLGI
DNVDTDAFVP RYRRFVPPHL FQALGRYAVR PGDVMVTVMG TVGRSCVVPD DIGDALSSKH
VWTLSFDPER YLPLLASLQF NYAPWVHAHL TREAQGGTIA SIRSSTLRST LLPVPPLAEQ
RAIAEVLART RGRMAAERRL RSVRRRLRDA LREDLMTGRV RARPG