Gene Hoch_2683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2683 
Symbol 
ID8545070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3699658 
End bp3701898 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content69% 
IMG OID646387378 
ProductATP-binding region ATPase domain protein 
Protein accessionYP_003267107 
Protein GI262195898 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.301078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGGCG GGGTGGGCGG TGCGCCCGTG CGTGGATTGT TAGCGCCGGG CACCATGGTC 
GGCGGATTTC GCATCCAAAA CCACCTGGCG TCCGGGCACT ACGGCAGCAT GTACGTGGCC
GAGGAGCCCG AGGGCGAGCG CAAGGTCGCC ATCAAGGTGC TGCACCCCGA GCTCGTCTCC
TTTGGCGAGC CCTTCATCCG CTTCATGCGC GAGGCCCGCA TCCTCGACCT GGTGCGCCAC
CCCAAGGTCG TCGAGGTCTA CGACTCGGGC ATCCTGGCCG ATGGTCGTGC CTTTCTGGTC
ATGGAGTTCC TCGAGGGCGA GGATCTGGGC AGCTACCTGC GCAAGCACGG CCGCCTGTCG
CCGGCCCGCT CGGTCGAGAT CCTCGAGGCG GTGTGCGACG CGCTCAGCCA CGCGCACGCG
CGCTCGGTGG TGCACCGCGG GCTCAAGCCC TCGAATGTAT TTCTGTGCAG CGGCGAGGCG
CAGCGCATCG TGTTGCTCGA CTTCGTGCTG GCCAAGCTGA GTGAGCACAA CAGCATCGAG
CTGACCTCGT CGCGCCTGGC CGTGGGCTCG CCCTCGTCGC TGGCGCCCGA GCAGATCCGC
GGCAAGCCGG TGGACGCGCG CACCGACGTC TATGGTCTGG GCGTGCTGCT GTATCAGATC
TTGACCGCGC GTCCGCCCTT CGAGAGCGAG TCCTGGGTGG CCGTGCAGCA CCTGCACCTG
CACGCGCCGC GACCCCGGCC CTCGCGCTAC GCGCCGCTGT CGCCGCTGCT CGACGAGGTG
GTGATGCGGG CCATGGATCC CGATCCCTCG GTGCGTCACG CCAGCGCCGC CGCGTTCCTC
GACGCCCTGC GCCGCGCGGT CGAGGACCTG GGCGAGCGCG CGCCGGCGGC GCCTGAGCCC
AGCGCCGCCA GCGCCGCCAG CGACGCCATC GAGAACTCCT CGGTGACCAT GCCGGCCGTG
GCCGTCTATG TAGATGTGCG CACGGCCCAC GACGGCGACG AGGACGACGA CGTGGTGCTC
GACGCCACCG ACGCCGTGCT CGAGCGCGCC GGCCAGGTGT TCACCGAGCG CGGGTTTCAG
ATCGCGCTGG AGACCAGCAG CGCCACCCTG TACGTGCGTT CGGGCAGCTC GGATGGCGAG
ATGGCGCCGC GCGAGCGGCT CGAGGCGGTG GGCGCGGCCC TGGCCTTGCG GGCCATGATC
GCGGAGGAGG TGCCGGCCGG GCCGTGGTTG CGGGTCAACA TCACCCTGCA CAGCGGCCGC
ATGCGCTTTC GCGACGGCCG TCCGCTCGAG GGTGAGCTGC TGCGGGTCGA GAGCTGGGCG
CCCGACGCGC CGGTCGGCGA CGTCCTCGGC ACCGAGCGCG CGTTTTCCGG TCTCAAAGTC
GAATCCGAGC CGCTCGCCGA GGGCTCGTCG CTGCTCAGGC TGCGCGGGCT CAAAGCCGAC
GCCATGTCCT TATTCGACGA CGACGAGCTC GCCGACGACG ACGCCCAGGC CGAGTCCGAG
GCCGGCGAGG CTGCCGCGGG CGGGCTCAAC GACCCGCGGG TCATGCACCT CGAGATGATG
GCCCAGATCG GCCGTCACAC CGCCGGCATC GTGCACGACC TGCGCTCGCC GCTCACCGTG
ATTCGCGGCA GCCTGGAGCT GGTGCTCGAC AACACCGAGT CGCGCGGCGA ACTCACCGAC
TCCGAGCGCA AGATCCTCAG CAACGCCTAC CAGTGCGCCG AGCAGATGAC CGACATGATC
TCGCTGATCC TCAAGGCGTC GGCGATCAAG TCGTACAGCG CGGGTACGCG CAAGATCCTG
TCGGTCGGCG ACCTGGTGGA CAACGCGCTC AAGCTGGTCT CCAAGGAGCT GCGGCGCAAA
GCCACCATCC GCGTCAATCA CGACGGCAGT AGCTGGGTGT TTGGCTCGCC GCTGCGGCTC
ACCCAGGTGC TCATCAACCT CATCGTCAAC GCCTCGCAGG CGATCCCCAA GCGCGGGCGC
ATCGACATCG AGACCTGCAC GACCAACGAC GGGCGCGTGC TCATCACCGT GCGCGACAAC
GGCGTGGGCA TGACGCCCGA CGTGCTGGCG CGGGTGTTCG AGCCCTACTT CTCGACCAAA
GACGCGGGCG AGGGCACCGG CCTGGGCCTG TCGCTGGCTC ACGCCATCAT CAAAGAGCAC
GGCGGCGAGA TCCAGCTCTC GTCCACCCCC GGGCAGGGAT CGAGCTTTAT TATCGACTTG
CCCGCCGCCG ACGTCCCCTA A
 
Protein sequence
MSGGVGGAPV RGLLAPGTMV GGFRIQNHLA SGHYGSMYVA EEPEGERKVA IKVLHPELVS 
FGEPFIRFMR EARILDLVRH PKVVEVYDSG ILADGRAFLV MEFLEGEDLG SYLRKHGRLS
PARSVEILEA VCDALSHAHA RSVVHRGLKP SNVFLCSGEA QRIVLLDFVL AKLSEHNSIE
LTSSRLAVGS PSSLAPEQIR GKPVDARTDV YGLGVLLYQI LTARPPFESE SWVAVQHLHL
HAPRPRPSRY APLSPLLDEV VMRAMDPDPS VRHASAAAFL DALRRAVEDL GERAPAAPEP
SAASAASDAI ENSSVTMPAV AVYVDVRTAH DGDEDDDVVL DATDAVLERA GQVFTERGFQ
IALETSSATL YVRSGSSDGE MAPRERLEAV GAALALRAMI AEEVPAGPWL RVNITLHSGR
MRFRDGRPLE GELLRVESWA PDAPVGDVLG TERAFSGLKV ESEPLAEGSS LLRLRGLKAD
AMSLFDDDEL ADDDAQAESE AGEAAAGGLN DPRVMHLEMM AQIGRHTAGI VHDLRSPLTV
IRGSLELVLD NTESRGELTD SERKILSNAY QCAEQMTDMI SLILKASAIK SYSAGTRKIL
SVGDLVDNAL KLVSKELRRK ATIRVNHDGS SWVFGSPLRL TQVLINLIVN ASQAIPKRGR
IDIETCTTND GRVLITVRDN GVGMTPDVLA RVFEPYFSTK DAGEGTGLGL SLAHAIIKEH
GGEIQLSSTP GQGSSFIIDL PAADVP