Gene Hoch_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1687 
Symbol 
ID8544069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2296404 
End bp2298029 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content71% 
IMG OID646386395 
Producthypothetical protein 
Protein accessionYP_003266130 
Protein GI262194921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000265405 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000045435 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAGCTACG AAGATCACAA GCAACCCCAA TCACAGCGCA ATAACGAGCT GCAGTCGGCT 
CACGCGATTC AACGCGCCCC CGCGCCGGGC AAGCGCACGC GCACCATGAG CATGCCGCAG
GCCGGTGCCG CGCGCGCGCC GGTGCAGCGC AAGGCCAGCC CCGGAGCCGC GAGCGCGGGC
GCCGACAGCG CCTCGACCGC CCGCTGGATG AACACGGCCA TGCGCCCCGA CCTGCACCCC
ATGCCGGTGC AGCGCAAAGC CGCCCCCGGC GGCGCGGTGG TGCAGCGCAA GAGCGAGCCC
TCGGGCCAGG GCGGCGGCTC GACGCACATC CCCACGCGCG AAGAGGTGCA CGCCAACCCG
GCCGCCTTCG GCATCCCCGA GGGCGGCAAT CCCGACGACT TCTTCAACGA GTACGTAGCC
CACACGCTGG CGTACATGCA GCCGGAACAG CTCGACGTGA ACGCGCTCGA CCCCAACGCC
GACGATTACC AGTCGCGGGT CGCCACCATC GAACGCCACC GCGCCACCAT GCGCGGCTGG
GGCTACGACC CCGACAGCCT CACCTTCCTC AACGACAGCC GCTCGGGCGA CAACGAGACC
GGCCTGCAGG CCGTGCGCAT CGACCCGCTC GATCCCGAGG GCGAGCACGG CTCCATCGTC
GGCTTCCGCG GCACCGAGCC GGTCGCCGGA CATCAGTCGA CGCTGACCAA TCCCACCGGC
TTCGCCGACG ATGTCGCCAC CGACCTCGGC CGCGACATCG GCGGCAACCA GTACCGCGAG
AACCAGGAGC GCATCCACGC GCTCATCGCC GGCGGCACCG GCCCCATGAC GCTCACCGGA
CACAGCCTCG GCGGCGCCCT GGCCCAGCAC GCGGCCGCGG GTAGCACCGA CCTCAACGTC
TCCAACGTGG TCGGCTTCCA GGCCCCCGGC ATCGACAGCT CCGCCGCCAA CTCGTTCAAC
GCGGCCAACG CCGACGGCCA CATCGACGTG CGCTTCCACG AGCACGAGAA GGACGTCGTG
CACCGGGCGG GCGAGCAGAA GCTCGACGGC ACGCACTACA CCTGGCACGA CACCAACGAT
CCCTCGTTCC TCGGCGCCCA CCTCACCAAC TTCATGTACA ACGGCGTCAA CGGCGACGGT
GAGCAGGTCA CCAACGTCGG CGCCGGCAGC TCGCCCGCCG TCACCGAGCA CGATCCGATC
ACCAACCGCC AGGGCTGGGA AGGCGGTCGC CGCATCCTCG GCGGCGGGCT CAACGTCGTG
GCCTCGCCCT TCCAGGGCGC CTTCGCGCTC GGCCAGGGTC TCGGCAACGC CGCCGTCAAC
GCCGGTCAGG GCGTGCTCGA CAGCGGCAGC AGCCTGGTCG GCGGCATCTC CGAGGGCGCC
TCGCAGATGG GCAACGGCGA AGTCCTCAGC GGCCTCGGTA CCATGGCCGG CGGCGTCTGG
GACGGCACCA CGGGCCTGCT CGGCACCGGC GCCAACCTGG TCGGCGACGT CGCGGGCGCA
GCCGTCAGCG GCGTCACCAA GGCGGGCTCG GAGCTCGTCG AAGGCGTCGG CACCGTGGCC
CACGGCATCG GCAACCTCGC CACCTGGGGC GCCCAGGGCA TCGGCCGCCT GTTCTCGGGC
GACTGA
 
Protein sequence
MSYEDHKQPQ SQRNNELQSA HAIQRAPAPG KRTRTMSMPQ AGAARAPVQR KASPGAASAG 
ADSASTARWM NTAMRPDLHP MPVQRKAAPG GAVVQRKSEP SGQGGGSTHI PTREEVHANP
AAFGIPEGGN PDDFFNEYVA HTLAYMQPEQ LDVNALDPNA DDYQSRVATI ERHRATMRGW
GYDPDSLTFL NDSRSGDNET GLQAVRIDPL DPEGEHGSIV GFRGTEPVAG HQSTLTNPTG
FADDVATDLG RDIGGNQYRE NQERIHALIA GGTGPMTLTG HSLGGALAQH AAAGSTDLNV
SNVVGFQAPG IDSSAANSFN AANADGHIDV RFHEHEKDVV HRAGEQKLDG THYTWHDTND
PSFLGAHLTN FMYNGVNGDG EQVTNVGAGS SPAVTEHDPI TNRQGWEGGR RILGGGLNVV
ASPFQGAFAL GQGLGNAAVN AGQGVLDSGS SLVGGISEGA SQMGNGEVLS GLGTMAGGVW
DGTTGLLGTG ANLVGDVAGA AVSGVTKAGS ELVEGVGTVA HGIGNLATWG AQGIGRLFSG
D