Gene Hoch_0339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0339 
Symbol 
ID8542719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp477222 
End bp478292 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID646385136 
Producthypothetical protein 
Protein accessionYP_003264873 
Protein GI262193664 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.110925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCACT CGCAGCAGGC GCGCGCGTCG CGCAAGAAAC AGATTCTCTT CGTCGGTGGC 
GACGTGGCGC AGACCCGCCA GATCCACGAC GTGGCCAAGC ATCTGGGCGA CTACGAGCAG
TATTTCTCGC CGCACTGGGG CGACCGCTTC ATCTCCCTGG TGCGCGAGCT CGGCCTCATC
GAGTACACCA TCGCGGGCAA CAAGCGCGGT CAGAACACGC TCGATTACCT GCACGAGCAG
GGCCTGCGCG TGGACAAGTA CGGACGCCGC GGCTGCTACG ACCTGGTGGT GTCGTGCAGC
GATATCCTGG TTCCGCGCAA TATCCGCTAC ACCAAGCTGG TGGTGGTGCA GGAGGGTATC
TTCGACCCCG AGCACCGCTC GTATCGCCTG ATCCGTCTGC TGCCGTTTCT GCCGCGGTGG
ATGGCGGGTA CGGCCATGAC CGGCATGAGC GGCCTGTACG ACGCGATCTG CGTGGCCAGC
CCGGGTTTTC GCGCGCACAT GATCGCGCGC GGCGCCGACC CCAACCGCGT GCACATCACC
GGCCTGATCC ACTACGACAA CTGCCGGCTC TACGAAGATA ACGAATTCCC CCATCGCGGC
TACGTGCTCG CCTGCACCTC GGACGGGCGC GAGACCTGGA AGGCCGACGA CCGCGAGGCC
TTTATCGCCC GGGCGCTGGA GCTGGCCCAG GGCCGCCAGG TGATCTTCAA GCTGCATCCC
AACGAGGACT ACGAGCGCTC AGAGGCCGAG ATTCGCGCGC AATCGGCCGA TGCGCTGATC
TATTACCGCG AGCCCGGCAT CAAAGCCGAG GAGATGGTCG CCAACTGCGA GGTGCTTTTG
ACCGAGTGGT CGACATTGGT GTTCGTCGGC CTGGCGCTCG GCAAGGAGTG CTACTCGTAT
CACGATATGG AGCTGCTCAA GCAGCTCATG CCGATTCAAA ACGGCGGCAG CTCGGCCGAG
AAGGTCGCCG AGATCTGCCG CCGCATCATC GAATCGCCCG AGCCGCCGAC GCCCGTGGTC
ATGGACCCCA AGCGCTCGCT GGCCACGCGC ATCGCCGAGG CGTTTCACTA G
 
Protein sequence
MNHSQQARAS RKKQILFVGG DVAQTRQIHD VAKHLGDYEQ YFSPHWGDRF ISLVRELGLI 
EYTIAGNKRG QNTLDYLHEQ GLRVDKYGRR GCYDLVVSCS DILVPRNIRY TKLVVVQEGI
FDPEHRSYRL IRLLPFLPRW MAGTAMTGMS GLYDAICVAS PGFRAHMIAR GADPNRVHIT
GLIHYDNCRL YEDNEFPHRG YVLACTSDGR ETWKADDREA FIARALELAQ GRQVIFKLHP
NEDYERSEAE IRAQSADALI YYREPGIKAE EMVANCEVLL TEWSTLVFVG LALGKECYSY
HDMELLKQLM PIQNGGSSAE KVAEICRRII ESPEPPTPVV MDPKRSLATR IAEAFH