Gene Hoch_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2044 
Symbol 
ID8544426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2822225 
End bp2823784 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content75% 
IMG OID646386747 
Productputative sigma54 specific transcriptional regulator 
Protein accessionYP_003266482 
Protein GI262195273 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.526812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCG ACGCCGCGGC GGCGAGCTGG GACACGCGGG TTTTCGGTCT GACCATCCTC 
TACCACCGGG ATCTCGACCG CGTGGGCGAG CGCGCGCTGT GCACCGGCAT CGAGACCGGC
CGCGCCTTTG CGCTGTCGCG CAGCGAGCCG CTGTTCGCGG CCCCGGGCGG ACGCCAGCGC
AGGCCGCTCG ACGATACGCA GGTGTCGCGA TCGCCGCTGC GCTTGGTAGG CGCGGGCGAG
GACGATGAGC CGGGCGCGAT CCTCCTCGAG TGCGCCGGCA GCCGCACCCG CGTGCGCGTC
GACGGCGAGC CGGCGAGTGG CTCGCTGCGG CTGTCGCGGG CCATGCTCGA GCGCGGCGTG
GTGCTGGCGC TGGGCGGCCG CGTGCTGCTG CTCGCGCACC TGCTGGAGCC GCAGAGCGAC
GCCGGGGAGG CGGATTTCGG ACTCATCGGC CACAGCCCGG CCATGCTGCG CGTGCGCCGC
GATATCCGCC GCGTGGCCGA CCTCGAGGTG CCGGTGCTGG TGCGCGGCGA GAGCGGCACC
GGCAAGGAGC TGGTGGCCCG CGCGATCCAC GACGCGGGCG GGCGCGCGAG CGGTCCGTAT
GTGGCCGTCA ACCTGGCCGC GGTGCCGCAG TCGCTGGCCG CGGCCGAGCT GTTTGGCGCG
GTGCGCGGCG CGTTCACGGG TGCCAATCGA GACCGCCGCG GCCACTTTGC GCGCGCGCGC
GGCGGCACGC TGTTCCTCGA CGAGATCGGC GAGACCGTGC CCGAGCTGCA GGTGCTGCTC
TTGCGCGTGC TCGAGACCGG CGAGATTCAG CCGGTGGGCG CCGATGCCGT CCAGCCGGTC
GACGTCCGCG TGGTGTCGGC GACCGACGCC GATCTCGAGG GCGCGATCGC GGCCGAGCGC
TTCCGCGCGC CGCTCCTGTA CCGCCTGGGC GGCTACGTGA TCCGCCTGCC ACCGCTGCGC
GAGCGACGCG AGGACATCGG CCGGCTGCTG GTGTACTTCC TGCGCCGCGA GCTGGTCGCC
CTGGGCCTGG GCGAGGCGCT CGCGCCCGCC GATACGCCGT GGCTGCCGGC GTCTGTGGTC
GAGCGCCTGC TCACCTACGC CTGGCCGGGC AACGTGCGCC AGCTCGCCAA CCTGGTGCGC
CACCTGGTCA TCGCCAACCG CGACCAGACT CGGGCCGCGC ACTTCGAGAT GATCGACGCG
CTGTCCGAGG TCGATGCATC CGCAGCGCCC AAAGCGCCCG CGCCAGCAGC GCCGGCAGCG
CCTGCGCCGA TGAGCGAGGC CCCCGTGCGC CCGGCCGATA TCGGTGACGA TGAGCTGGTG
GCCGCGCTGC GCGCGCACGC GTTTAGCCCC GAGCGGGCGG CGAGCGCGCT CGGCATCCCG
CGCTCGTCGA TCTATCGGCT GATGGACCGC TGCGCGCGGG TGCGCAAGGC CTCGGAGCTG
AGCGCCGAGG AGATCGAAAG CGCGCGCGCG CGCGCGGACG GTGATGTGCG CGCGGCCGCC
GCGCTGCTCG AGGTCTCGGC GCGCGCGCTG CGCCGGCGCA TGAGCGCGCT CGGACTGTGA
 
Protein sequence
MDIDAAAASW DTRVFGLTIL YHRDLDRVGE RALCTGIETG RAFALSRSEP LFAAPGGRQR 
RPLDDTQVSR SPLRLVGAGE DDEPGAILLE CAGSRTRVRV DGEPASGSLR LSRAMLERGV
VLALGGRVLL LAHLLEPQSD AGEADFGLIG HSPAMLRVRR DIRRVADLEV PVLVRGESGT
GKELVARAIH DAGGRASGPY VAVNLAAVPQ SLAAAELFGA VRGAFTGANR DRRGHFARAR
GGTLFLDEIG ETVPELQVLL LRVLETGEIQ PVGADAVQPV DVRVVSATDA DLEGAIAAER
FRAPLLYRLG GYVIRLPPLR ERREDIGRLL VYFLRRELVA LGLGEALAPA DTPWLPASVV
ERLLTYAWPG NVRQLANLVR HLVIANRDQT RAAHFEMIDA LSEVDASAAP KAPAPAAPAA
PAPMSEAPVR PADIGDDELV AALRAHAFSP ERAASALGIP RSSIYRLMDR CARVRKASEL
SAEEIESARA RADGDVRAAA ALLEVSARAL RRRMSALGL