Gene Hoch_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3356 
Symbol 
ID8545744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4644035 
End bp4645846 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content70% 
IMG OID646388023 
Producthypothetical protein 
Protein accessionYP_003267751 
Protein GI262196542 
COG category[S] Function unknown 
COG ID[COG4402] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03382] Myxococcales GC_trans_RRR domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.313315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.309242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCATC AACTCATCGA CTCATTGACG AAATGGAGTA ACTCCGTGTC CCTCATCCGC 
ACTCTCAGCA GCGCCGCGCT CGGCCTCGTC GGCGCGCTCA GCCTGGGCGC CCTGGCCGGT
CAGCCGGCGC TGCCCAGCGC CCAGGCCTGC GGCTGCTTTG CCCAGCCCGA TCCCACCGCG
CCCGTGGTCC AGGGCGGCGA GCGCATCGTC TTCGCCATGG AAGACGGCGT GGTCACCGCC
CACATCCAGA TCCAGTACAC GGGCGCGGCC GAGGAGTTCG CCTGGCTGCT GCCCCTGCCC
TCCGAGCCCA GCTTCACGCT CGGCAACGAG GAGATGTTCG CCCGCCTCAT CGACGCCACG
CAGCCGCGCT ATCGCCTGGG GCCACGCATC GACCCCGAGA CCTGCCCAGC GCCGCCGCCG
CCCGTGGCCG CGCCCGGTGG ACCCGACTCG GGCAACGGCC CGGTGGTCAG GCGCGAGCTC
GTCGGCCCCT ATGAGGGCTT CATCCTGAGC GCGGAGGACA AGCAGCCGCT GCTCGACTGG
CTGTCCGATA ACCGCTTCTT CGTCGCCACC GACGGCGACG ACGCCCTCGA TCCCTACATC
CGCGCGGGCG GCTACTTCCT GGCCATCCGC CTGGCGCCCG GCTACGACGC CGGCGATCTG
CAGCCGGTGG TGGTGTCGTA TCGCTCGGAG CTGCCGCAGA TTCCGATCGT GCTCACCAGC
ATCTCGGCGC TGGCCGATAT GCCGATCATG GTGTGGGTGC TGGGCGAGCA CCGCGCGATT
CCGCGCAACT TCTTCCACAC CCAGATCAAC GACGCGCGCA TCGACTGGCT GAACAACGCC
GCCAACTACG TCGAGGTGGT GACCGACGCG GTCGACGAGG CCGAGGGGCA CCACTCCTTC
GTCACCGAGT ACGCGGGCAC CAGCGAGGTC ATGCGCGACC GCCTGGACTA CACCGGGCGC
TTCGGCGACC CCGAGGAGCT GCGCGCGCTC TCCGACCCCG GCGACTACCT CGAGTATCTG
CTCTGGCACG GCTACCAGGA GATCGCGCCC AACGGCGCGG AGGTGGTCAG CGCGCCCTTT
GTCTCGCTGG TCGAGGAGTT TCTGCCGCTG CCGCCGGAGC TGGTGGCCGC GATCGAGGCC
GATATCGGCG AGACCATCAC GGCCGGCGCG CTGTTCTGGG ACTATCGCTA CTGGCTCGAG
CAGTACCCCG ACATCCTGGG CCCGGCGCAC GCGGAGTTCG ACGCCGACGG TCTCACCGAT
GTGCTGATCG AGCGCATCGT CGAGCCGCTG CGGCAGGCCG ATGCGCTGTT CGATGAGCAT
CCCTATCTCA CCCGCATGTT CACCACGCTG TCGCCGGACG AGATGCTCAA GGACCCGGCG
TTCAGCTTCA ACCCGGATCT CGACGAGGTG TCCAATATCC ACATCGCCAC GGTCGAGATC
CTCGAGTGCG CAGAGTCGTC CCCCGACTTC GACGGCCCCA CCATCCTCAC CACCGAGCAG
GGTCGGCGCC TGTACTTCCC GAACGGTCTC GACGACACCG CCTGGCAGGA CGTGGGCATG
CCGGCGAGCC TGCGCACCGA GGTGCTGCGC GAGGAAGGCG CGCCCATGGT GGTGAGCGAT
AACGCCGCCG CCATCGACAG CGCCATCGAC GAGTACCGCC CGGTGCCGGC CGTGCCCGAC
CCCGACGAGG ACGAGGGCGG CTGCGCGGCG GCGCCGGGCA CCGGCACAGG TCCGCACCCG
GGCACGCTGC TCCTGGTGCT GCTCCTCGGC GGCCTGGCCG CGGTCGCGCG TCGCCGCGGG
CGCGAGCGCT GA
 
Protein sequence
MTHQLIDSLT KWSNSVSLIR TLSSAALGLV GALSLGALAG QPALPSAQAC GCFAQPDPTA 
PVVQGGERIV FAMEDGVVTA HIQIQYTGAA EEFAWLLPLP SEPSFTLGNE EMFARLIDAT
QPRYRLGPRI DPETCPAPPP PVAAPGGPDS GNGPVVRREL VGPYEGFILS AEDKQPLLDW
LSDNRFFVAT DGDDALDPYI RAGGYFLAIR LAPGYDAGDL QPVVVSYRSE LPQIPIVLTS
ISALADMPIM VWVLGEHRAI PRNFFHTQIN DARIDWLNNA ANYVEVVTDA VDEAEGHHSF
VTEYAGTSEV MRDRLDYTGR FGDPEELRAL SDPGDYLEYL LWHGYQEIAP NGAEVVSAPF
VSLVEEFLPL PPELVAAIEA DIGETITAGA LFWDYRYWLE QYPDILGPAH AEFDADGLTD
VLIERIVEPL RQADALFDEH PYLTRMFTTL SPDEMLKDPA FSFNPDLDEV SNIHIATVEI
LECAESSPDF DGPTILTTEQ GRRLYFPNGL DDTAWQDVGM PASLRTEVLR EEGAPMVVSD
NAAAIDSAID EYRPVPAVPD PDEDEGGCAA APGTGTGPHP GTLLLVLLLG GLAAVARRRG
RER