Gene Hoch_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3122 
Symbol 
ID8545510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4296079 
End bp4297701 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content71% 
IMG OID646387789 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003267517 
Protein GI262196308 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.340394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.912648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACG AGCGCACCGA ACTAGAGATT GTCCGTCAAC AACTAGCAGA GGCTCACGCC 
ACGCTCGATG CCATCCGCTG CGGCGAGGTC GACGCGGTGA TGGTCGATAC CGGTGACGCC
CACGAGGTGT TCACGCTTCA GCCCGCCGAT CTGCCGTATC GCGAGTTTCT CGAGCGCATG
GCCGAGGGCG CGCTGTCGCT CGACGCCGAT GGTCTGGTGC TGTACTGCAA CCAGTTTTTG
TGCGGGCTGC TGGGCCTGGG GCGTGAGCAG CTCACCGGCC GGCCCTTCTC CTCGTTTGTC
CTCGAGGGCT CGCGCTCCGA GTTCGAGGCC GCGCTGGCGG CCAGCGACAG CGGCAGCGTG
GCCGTGACCC TGCGCGGCGC CGGCGAGCGC CGTGTGCCCG TGCTGCTCAG CTACACGCCG
GTGGGCGGCG GCGAGCGCCG GCGCACCAAC CTGGTGGTCT CCGATCAGCG CATCCGCCGG
CGGCTGCAGA CCGTGAGCGC GGCCCGCGAC GCGGCCGAGG CCGCGAGCAC GGCCAAGGAC
CGCTTTCTGG CCGTGCTCGG GCACGAGCTG CGCAATCCTC TGGCCGCGCT CACCAGCAGC
GTCGAGCTGC TCGCGCACGG CGCGATCGAC GACCAGCGGC GCGACTGGAT CCACGACAGC
ATGGCGCGTC AGCTCGCCCA GCTCCGCTCG CTGGTCGACG ACCTGCTCGA CGTCACCCGC
ATCGCCCAGG GCAAGATGGT GCTGCGCAAG GCGCCGGTGG ACATCGCCAG CGTGGTCGCC
GACGCCCTCG AGTCGGTGTC CTCGCTGGTC AACGCGCGCA AGCACACGCT GGTGTGCGAG
CCCATCGTCG AGCGTCTCGA GGTGTTTGGC GACCGCACCC GGCTCGAGCA GGTGATCGTC
AACCTGGTGG CCAACGCCGC CAACTACACC GAACCCGGCG GCCGCATCGA GCTCAGCGCC
AGGCGCGAGG GTGAGCACAT CCGCGTCGCG GTGGTCGACA CCGGCGTCGG CATCGACGCC
GCCGACATCG AGCACATCTT CGAACCCTTT GCCCAGGTCG GCGAGGCCGG CAGCGGCGGC
CTAGGCATCG GCCTCACCCT GGTGCGTCAG CTCGTCGAGC TGCACGGCGG CACGGTCGAG
GCCGAGAGCG GGGGTCACGG CCAGGGCACG ACCTTCCGGG TGAGCCTGCC GCAGGGTGGC
GAGCAGCCGG CGCCGGAGCC CAGACCCAAC CCGGGCCAGC TCCCGCGCGG GCTGCGCGTG
GTGGTGGTCG ACGACAACGA GGACTCGGCC CAGCTCATGG CGCTGCTGCT CGCGGGCTAC
GGGCTCGAGG TCGAGAGCGT GCACCGCGGC ACCGAGGTGC TGCCCGCGGT CGAGCGCCAC
CGCGCCAAGC TGGTGCTGCT CGACCTCGGC TTGCCCGATA TCTCCGGCTA CGAGGTCGCC
CAGCAGCTCC GCCAGGCCGG CCACGACGAG CTGGTCATCG TCGCGCTCAC CGGCTTCTCG
CACGCCAGCG CCCGCCAGCG CGCCGAGCAG GCCGGCTGCG ACGCGCACGC GGTCAAGCCG
CTCAAGGCCG CCCAGCTCGC GACCATGGTG GCGCGCTTCC ACGAGCGCCT CAAGGTCGAC
TGA
 
Protein sequence
MSDERTELEI VRQQLAEAHA TLDAIRCGEV DAVMVDTGDA HEVFTLQPAD LPYREFLERM 
AEGALSLDAD GLVLYCNQFL CGLLGLGREQ LTGRPFSSFV LEGSRSEFEA ALAASDSGSV
AVTLRGAGER RVPVLLSYTP VGGGERRRTN LVVSDQRIRR RLQTVSAARD AAEAASTAKD
RFLAVLGHEL RNPLAALTSS VELLAHGAID DQRRDWIHDS MARQLAQLRS LVDDLLDVTR
IAQGKMVLRK APVDIASVVA DALESVSSLV NARKHTLVCE PIVERLEVFG DRTRLEQVIV
NLVANAANYT EPGGRIELSA RREGEHIRVA VVDTGVGIDA ADIEHIFEPF AQVGEAGSGG
LGIGLTLVRQ LVELHGGTVE AESGGHGQGT TFRVSLPQGG EQPAPEPRPN PGQLPRGLRV
VVVDDNEDSA QLMALLLAGY GLEVESVHRG TEVLPAVERH RAKLVLLDLG LPDISGYEVA
QQLRQAGHDE LVIVALTGFS HASARQRAEQ AGCDAHAVKP LKAAQLATMV ARFHERLKVD