Gene Hoch_5080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5080 
Symbol 
ID8547491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7003466 
End bp7005499 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content68% 
IMG OID646389756 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003269461 
Protein GI262198252 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCATCG AGAGCTTGAT CGAATCCAAG CTACTCGCTG TAGTGACCGT TCAGGTGGCG 
TCGGGGCAAC TTGGACGCGT GCAGCAGGCG AGCGTGGCAG CGCGCGCGCT GCTCGGCTAT
GAGCGCGAGG CGTCCGAACT CACGGGCGAC GACGCGCTGG AATGGCGAGA GCTGGTGGCG
GACGAATCCC CGGGGCAGTC TGCCGGCGTC GCGCTGCCCA CCGGCCAAGC CGTCGAACTC
GAACTGTGCC GCCGCGACGG CGAGCGCATC CCGGTGTGTG TCGAGCGCAT CGACGCCGCA
GACGAGCTGT GCCTGCTCCT GCTCCATCCC CGCCACGGCC ACCAGCCACC GACCCGGCCC
CCCGCTCGCG CCGGTGGGTT GCAGGATGCT CCGGCACAGT CATCCGGGGC ATTCGATGTT
TCGTCCAACA AACCAGCTTT TGCGTCGGAC GAATCCCAGC CAGCCGCCGG CTCCGACGCG
TTGCCGAGCC GCGCCGAGCT CAAGCAGATG CGCCGCGAGC TCAAGCAGCA CACGCAGATG
TTCAACAGCC TGCCGGTGTC GATGTTCGCC AAGCGCTACG AGGGCGCTCG TCCGGGCGTG
TACATCATGG TCAATCCCCT GGCGACCGAG ATCAGCGGCA TCCAGAGCAT CCCGGGCAAG
TCCGACTTCG ACTTCTTCCC CGAAGCCGTG GCCCAGACGC TGCAAGACAA CGATCGCCAG
GTGATGGCGG CGCGCAAGCC GGTCACCGTC GAGGAGTCGT TTCCGGACCG CCGGACGGGA
AAACCCGTGT ATTTGCTGTC TACCAAAGTG CCGCTGCTGC GCGACGACGG CACGCCCTAC
GGCATCGCCG GCGTGTCGGT GGACATCACC GAGCAGTGGC TGTGGAAACA GCGCATGGCC
GCGCTGATGT CGGCGATTCC CGATACCATG ATCCGCGTCG GCCCCGACGG CGTGGTGCTC
GACATCGAGA ACGATCGCGG CATTCTGCCC GCCGGTCGCC TGCGCATCGG CGAATCCATC
TGGGAGCGGC CCGGGCCCAG CCAGGGCCGC CGGCTCATCG AAGCGGCCCT GCGGCAGGCG
CTCGCGAGCG GCGAGCGCAT CGCGGTCGAG ACCAGCGTGG AGACCGACAA CGGAATGCGC
CACTTCGAGA ACCACCTGGT CAAGAGCGGC GAGGACGAGG TGGTCTGGCT GGTGCGCGAG
CTCACCGAGC TCAAGCGCAG CCGCGCCGAA TTGCAGGCGG CCAACGACAA GCTACAGGTC
GTCAACCAGG AGCTGGCGCA GTTCGCCTTC GTGGCCTCAC ACGATCTGCA AGAGCCGCTG
CGCACGGTGC GGACCTTCAC CGAGTTGCTG GGGCGGCGCT ACGCCGACGC CTTCGACGAG
CGCGGCAGGA CGTGGCTCGC CAGCATCAGC AACGGCATGG CCCGCATGCG CACCCTGGTC
GACGACCTGC TGAGCTACTC GCGCGTCGGC CGCATCGAGG ACGACACCCC CTACGATACC
GAGCGCGTGC TCTCGCAGCT CCTGCGCGAC ATGCACACCA GCATCGAGAG CGCCAACGCC
GAGCTCGTGA TCGGCGAGCT GCCGTGGGTG GTCGTGAGCC GGCTCGAGTT GCAGCAGGTG
TTCCAGAACC TGGTCAGCAA CGCGATCAAG TTCCGCCGCA ACGAGGTGCG CCCACGCGTC
GAGATCCACG CCGAGCCCGA TCCTCACGGC TGGGCCTTCA CGGTCTCGGA CAACGGCATC
GGCATCGACG CCGAGCAGCA CGAGCGCATC TTCAATCTGT TTCAACGCCT GCACGCGCGC
AGCGAATACG AGGGCACCGG CATCGGCCTG GCGCTGGTCA AACGCGCCAT CGAACGCCGC
GGCGGGACCA TCACGGTGTC GTCGACGCCG GGCCAGGGCA CGAGCTTTCG CTTCACCCTG
CCCGGCCTAC CCACCGGAGC CGAGCCCGAC ACCGTCGACA GCGACAGCGG TGGCGACCAC
GACCACGACC ACGACCACGA CCACGACCAC GCCGACGACA ACACCGTGCG CTGA
 
Protein sequence
MVIESLIESK LLAVVTVQVA SGQLGRVQQA SVAARALLGY EREASELTGD DALEWRELVA 
DESPGQSAGV ALPTGQAVEL ELCRRDGERI PVCVERIDAA DELCLLLLHP RHGHQPPTRP
PARAGGLQDA PAQSSGAFDV SSNKPAFASD ESQPAAGSDA LPSRAELKQM RRELKQHTQM
FNSLPVSMFA KRYEGARPGV YIMVNPLATE ISGIQSIPGK SDFDFFPEAV AQTLQDNDRQ
VMAARKPVTV EESFPDRRTG KPVYLLSTKV PLLRDDGTPY GIAGVSVDIT EQWLWKQRMA
ALMSAIPDTM IRVGPDGVVL DIENDRGILP AGRLRIGESI WERPGPSQGR RLIEAALRQA
LASGERIAVE TSVETDNGMR HFENHLVKSG EDEVVWLVRE LTELKRSRAE LQAANDKLQV
VNQELAQFAF VASHDLQEPL RTVRTFTELL GRRYADAFDE RGRTWLASIS NGMARMRTLV
DDLLSYSRVG RIEDDTPYDT ERVLSQLLRD MHTSIESANA ELVIGELPWV VVSRLELQQV
FQNLVSNAIK FRRNEVRPRV EIHAEPDPHG WAFTVSDNGI GIDAEQHERI FNLFQRLHAR
SEYEGTGIGL ALVKRAIERR GGTITVSSTP GQGTSFRFTL PGLPTGAEPD TVDSDSGGDH
DHDHDHDHDH ADDNTVR