Gene Hoch_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1431 
Symbol 
ID8543813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1928574 
End bp1929605 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content72% 
IMG OID646386143 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003265878 
Protein GI262194669 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCA CTCACCGAGG ACGCCAAGCG ACCAACCCAG GGCGCCGCTC CGCGCGAGCG 
CGCGAGCGCC GGCTGCCGCG CGGGCTGCTG GTGGCGCTGC GCGAGGCCGG CGTGGATGTC
GAGCGGGTGG CGACCCGCGC CGGCCTCGAC CCGAGAGCGC TGAATGAGTT CGTCCGATCG
GACGAGAGCG GTGCGTTTCT GCGCGAGGCG CTCGCTCAGG TGCCGCCGTG GTTCGGTCTG
AGCGCGGGTG CCGAGGTGCG TCCCGAGCTG TGGGGCGTGG TCGGCCTGGC GGCGATGAGC
AGCGCGAGCT TCGGTGCGGC CCTGGCGCGG GTCGCGCGCT ACAAGCGCAT TATGAGCAGC
GACGAGTTGC TGATCGACGA CCGCGGTGAC GAGGTCGCGG TGTGTTTTCG CCTGGGCAAC
GCCGCGGCGC CGTACGCGCG CCAGCAGCTC GACTCGGAGC TCGCGTTTCT GGTGTCGCTG
GGTCGTCGGC TGAGCGGAGC GCCGCTGCAG CCGCTGCGCA TCGCCATCGA GCTGTCGCGG
CCCAGCTATC ACGAGCGCTA CCGCGCGCTG TTCGCGTGCC CGCTGGCGTT CGAGCAGCCG
GCCACCGAGC TGGTGTTTCG CGCGCGCGAC CTGGCGCGCC CGCTGCTGAG CGCCGACGCC
GAGCTGGCCG AGGAGTTCTC GGCCCGCGCC GCGCGGCTCA TGCCGGCCGA GTGCACGCTC
GCGGTGGCCG AACAGGTTCG CCTGGCCCTG CGCGGTGCGC TGCGCGGCGA GGTTCCGAGC
CTGGCCGAGA TCGCGCGCCG CATGCACCTG AGCGAGCGCA CGCTGCAGCG CCAGCTACGC
GGCAACGGCA CCTCGTTCAC GCGCCTGGTG GACGAGGTCC GTCAGGAGCT GGCCCGCCGC
TATCTGGGCG GCGACGAGCT GCACGCCGCC GAGGTCTCGT ATCTGCTCGG GTTTGCGCAT
CCCAACTCGT TTTTCCGCGC TTTCAAGCGC TGGACCGGGC TCACGCCCGA GGAGTATCGC
GAATCGCATT GA
 
Protein sequence
MERTHRGRQA TNPGRRSARA RERRLPRGLL VALREAGVDV ERVATRAGLD PRALNEFVRS 
DESGAFLREA LAQVPPWFGL SAGAEVRPEL WGVVGLAAMS SASFGAALAR VARYKRIMSS
DELLIDDRGD EVAVCFRLGN AAAPYARQQL DSELAFLVSL GRRLSGAPLQ PLRIAIELSR
PSYHERYRAL FACPLAFEQP ATELVFRARD LARPLLSADA ELAEEFSARA ARLMPAECTL
AVAEQVRLAL RGALRGEVPS LAEIARRMHL SERTLQRQLR GNGTSFTRLV DEVRQELARR
YLGGDELHAA EVSYLLGFAH PNSFFRAFKR WTGLTPEEYR ESH