Gene Hoch_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3669 
Symbol 
ID8546059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5047011 
End bp5048366 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content71% 
IMG OID646388337 
Productsigma54 specific transcriptional regulator, Fis family 
Protein accessionYP_003268063 
Protein GI262196854 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.163956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000814904 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCCCTCTC CCAACGCCAC CTTCCACACC CGCCAGATCC GCGACGGGGA AGCGCTCGTC 
CTGCGTCGGC GCAAGCTGCG TCTGCTCGTC GAAAAAGGCC CCGACCGCAA GCTGCGCCGC
GACTTCGACG CCGACCGCGT GGTCCTCGGC AGCCACGAGA GCTGCGACCT GGCGCTCAGT
GATCGCACGA TCTCGCGCCA GCACTGCGAG ATCGCGCTCT CCCGCGAGGG CTACATCATC
CGCGACCTCG ATTCGACCAA CGGCACCTTC CTCGATCGCG GCAACGTGCG CATCCGCGAG
ATCGTGGTCG ACGGCGAGAC CCGCATCCGC CTCGGCGACA CCAGCGTGCG CATCCAGCCG
CTCGACGAGA CCGTCGATAT CCCGCTCGGC CACGACATCC GCTGCGGCCC CCTGCTCGGC
CGCAGCCCGG CCATGCGCCG GGTCTTCGAC ATCGTCCGCC GGGTCGCGCC CAGCGACGCC
ACCGTGCTCA TCACCGGCGA GTCGGGCACC GGCAAGGAGA TCGCCGCGCG CGCGCTGCAC
GAGCACTCCG GGCGCGCCGA CGGCCCCTTC GTGGTGGTCG ACTGCGGCGC CCTGCCCGGC
AACCTCATCG AGAGCGAGCT CTACGGTCAC GAGCGCGGCG CCTTCACCGG CGCCATCAGC
GCCCGCGCCG GCGCCTTCGA GGCGGCCAAT GGCGGCACCC TGTTTCTCGA CGAGCTGGGC
GAGATGCCCC TGGATCTGCA GACCCGCCTG CTCGGCGTGC TCGAGCGCCG CGTGGTCCAG
CGCCTGGGCA GCACCAGCTC GCGCGCCATC GACGTGCGCG TCCTCGCCGC CACCAACCGC
GACCTGCGCC GCGAGGTCAA CCGCAAGAAC TTCCGCGAAG ACCTCTACTT CCGGCTGGCG
GTGGTGACCA TCGAGATGCC GCCGCTGCGC GACCGCCCCG AGGACATCGC GCTCTACGTC
GAGGACTTCC TGCGCGACAG CGCCGGCGGC GTGCGCATCG ACGCGCCGAC CATGGAGCGC
CTGGCCCAGC AGCCGTGGCC GGGCAACGTG CGCGAGCTGC GCAACGTCAT CGAACGCGCC
ATGGCGCTCG GCGAGGTCGA TGTCCCGGGC GGCGGCACCC GCGCCAGCGA GCGCGAGCAC
TCCCTGCCCC AGGTCGGCGG CGAGATCGAC GTCGAGGTGC CGTTCAAGGT CGGCAAGGCC
GCGCTCATCG AGGAGTTCGA GCGCAACTAC GTCGAGCGCC TGATGGCCAA GCACGAGGGC
AATATCACCC AGGCCGCGCG CGCGGCCGAG ATCGACCGCG TGTACCTGCT GCGCGTGCTC
GACAAGTACG GGATGCGGCC CTCGCGCCGC AAGTGA
 
Protein sequence
MPSPNATFHT RQIRDGEALV LRRRKLRLLV EKGPDRKLRR DFDADRVVLG SHESCDLALS 
DRTISRQHCE IALSREGYII RDLDSTNGTF LDRGNVRIRE IVVDGETRIR LGDTSVRIQP
LDETVDIPLG HDIRCGPLLG RSPAMRRVFD IVRRVAPSDA TVLITGESGT GKEIAARALH
EHSGRADGPF VVVDCGALPG NLIESELYGH ERGAFTGAIS ARAGAFEAAN GGTLFLDELG
EMPLDLQTRL LGVLERRVVQ RLGSTSSRAI DVRVLAATNR DLRREVNRKN FREDLYFRLA
VVTIEMPPLR DRPEDIALYV EDFLRDSAGG VRIDAPTMER LAQQPWPGNV RELRNVIERA
MALGEVDVPG GGTRASEREH SLPQVGGEID VEVPFKVGKA ALIEEFERNY VERLMAKHEG
NITQAARAAE IDRVYLLRVL DKYGMRPSRR K