Gene Hoch_3095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3095 
Symbol 
ID8545483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4265772 
End bp4267499 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content73% 
IMG OID646387765 
Productsigma54 specific transcriptional regulator, Fis family 
Protein accessionYP_003267493 
Protein GI262196284 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR02019] bacteriochlorophyll 4-vinyl reductase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.458911 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCA TCGATCTTCG CATCGCCGAC CTGCTCGAGC TCGACCCGGG CGGCGGCGTG 
TACCGCTTTG GCGGCCAGCG GGTGCTGCTG CTCGACGCCG TGGCCCTGGG GCTGCTGCGC
AAACAGCTCG TCGAGGCCTT TGGCCAACAC GCCGCGCGCG GGCTGCTCAC GCGGCTGGGC
TATTCGCACG GCTGGCGCGC GGCCGAGGCG CTGCGCGACA CCATCGCCTG GGACGACGAG
CGCGAGTGGC GCATCGCCGG CGGGCGCATC CACCGCCTGC AGGGGCTGGT GCGCTTCGAG
CCGGTGCCCG GAGACCGCGC CAACACGCTG GCGCAGGCGG TGTGGCACGA CTCCTACGAA
GCTGAGCAAC ACCTGCTGCA TGTGGGACGC TCGTCCGAAC CCGTGTGCTG GTCGCTGTGC
GGCTACGCCA GCGGCTACCT GAGCCGCGTG GTCGGACAGT CGGTCTACGC GGTCGAGGAG
AGCTGCGCCG GCTGCGGCGA CGCGGTGTGC CGCATGGTCG CCCGCACCGA GGCGCAGTGG
GGCGCGGACA TCGAGCCGCA CCTGGCCTAC TACGAGCGCG ATTGTCTCGA CGCCTCGCTG
CACAGCCTGC GCGACGCGGT GCGCAAGCTC GAGCGCCGGC TGCGCTCCCA GCGGCGCGCG
CTCGGCGCCG ACGCCGAGGT GCTCGAGAGC GGCGGCGTGG TCGCCCGCAG CCGCGCCATG
CGCCGGGTGC TCGAGCTCTG CCGCCGGGTC GCCGCGGTCG ACGCCACCGC CCTGGTCCAC
GGCGAGAGCG GCGTCGGCAA GGAGCGCGTG GCCCGCTACA TCCACGATCA CTCGCAGCGC
GCGGCCGGGC CCTTTATCGC CATCAACTGC GGCGCCATCC CCGAGCCGCT GCTCGAGAGC
GAGCTCTTCG GCCACGCCAA GGGCGCGTTC TCGGGCGCCA GCTCGGACCG CGTGGGCCTG
TTCGAGGCCG CCACCGGCGG CACCCTGCTG CTCGACGAGA TCGGCGACGT GCCCGCCGCC
ATGCAGGTGC GCCTGCTGCG GGTGCTGCAG GAACGCGAGG TCCGGCGCGT GGGCGAGAGC
CGGCCGCGGC CCATCGACGT GCGCGTGCTC GCAGCCACCC ACCGCGATCT GCGCGCCGAA
GTCGCCGCCG GACGCTTTCG CGAAGATCTC TTGTTCCGCC TGTGTGTACT CGAGATCGAG
ATTCCGCCGC TGCGCGAGCG CCCCGATGAC ATCCTGCCGC TGGCGCGCAT GAAGCTGCTC
GACACCGCCA CCCGCTACCG GCGCGAGGTC CGCGACTTCA CCCCCGAGGT CGCCAAATGG
CTCATCGCTC ATCCCTGGCC CGGCAACGTG CGCGACCTGC ACAACGTCAT CGAGCGCGCG
GTGGTGTTCG CTGAATCCGC GTGTATCGAG CTCGCCGACC TGCAGCTCGG CGCCGGCGCG
GCCAGCGCAG ATTCGCCCGC AGACCCCGGC CCGGACTCTC CGATCGCCGC TGGCGGCCCC
CAGGCGAGCG CGCGCACGGA CGCCGAGGCT GACGCGGCAA TCGCGACGCC CGCGGGCGCG
ACCCTGGCCG AGGTCGAGCG CGCCCACATC CTGGCCACGC TCGCGGCCTG CGGCGGCAAC
CGCTCGGAAG CCGCGCGCCG CCTGGGCATC GGCGCCGCGA CCTTGTTTCG CAAGCTCAAG
CGCTACGGCG TGCCGGGCCC GCGCCAGGAC CACGCCAAAC CCGCCTGA
 
Protein sequence
MRAIDLRIAD LLELDPGGGV YRFGGQRVLL LDAVALGLLR KQLVEAFGQH AARGLLTRLG 
YSHGWRAAEA LRDTIAWDDE REWRIAGGRI HRLQGLVRFE PVPGDRANTL AQAVWHDSYE
AEQHLLHVGR SSEPVCWSLC GYASGYLSRV VGQSVYAVEE SCAGCGDAVC RMVARTEAQW
GADIEPHLAY YERDCLDASL HSLRDAVRKL ERRLRSQRRA LGADAEVLES GGVVARSRAM
RRVLELCRRV AAVDATALVH GESGVGKERV ARYIHDHSQR AAGPFIAINC GAIPEPLLES
ELFGHAKGAF SGASSDRVGL FEAATGGTLL LDEIGDVPAA MQVRLLRVLQ EREVRRVGES
RPRPIDVRVL AATHRDLRAE VAAGRFREDL LFRLCVLEIE IPPLRERPDD ILPLARMKLL
DTATRYRREV RDFTPEVAKW LIAHPWPGNV RDLHNVIERA VVFAESACIE LADLQLGAGA
ASADSPADPG PDSPIAAGGP QASARTDAEA DAAIATPAGA TLAEVERAHI LATLAACGGN
RSEAARRLGI GAATLFRKLK RYGVPGPRQD HAKPA