Gene Hoch_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2079 
Symbol 
ID8544461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2876873 
End bp2878402 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content70% 
IMG OID646386782 
Productputative sensor with HAMP domain protein 
Protein accessionYP_003266517 
Protein GI262195308 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTGA AGAGGACCTG GAGCTGGCGC ACGCGCCTGC TCGTGCTGCT GCTTCTGTTC 
GCGGTGGTGC CGGTTACCGG CATCACCCTG TGGAACCTGG CCCAGCTCGA GGCGGCCTTC
GAGGAAGACG CGGTCGAGTC GCTGCGCGCC ATCGGCTACG CGCGCGCCGA CGCCATCGAC
CAGCTCATGG ACGACCGCCG CCGCGATGTC GAGCTGCTGG CGTCGCAGCT CGTGCCGCAT
CTCGAACAGA TGGGCATGGC CAATCGCGAG GCCGAGGCGC TGGAGCCGCC GGAGACCGCG
CCGGCACCGC TGCCGGCGCC ACTGCCCGAG CTTGAAGACG CCCAGGGTGG CGAGCCTTTT
GCCCCGACCG CGACCGAGAC CCCGGAGGCG GGCGCGTCGC CCGAGGCGGA TGCGGGCGCG
GCGGAGAGCG GCGAGGGCGC GGCGGAGAGC GGCGAGGGCG CGGCGGAGAG CGGTGAGGGC
GCGGGCGCGT CGCGGCCCCA GGCGTCGGCG CGTGCGGTCG AGATCGAGGA GGCGCGCCGG
CAGCAGGCGG TCGCAGCCGA GGCCGAGGCC AAGGCCGAGC TGTACCAGGC GCTGGCGCTG
ATTCTGCTCG ACCAGAAGGT CTTCGAGGAG CTGTTGGTCA TCTCCGAGGA CGGTCTGGTG
GTGGCGTCGA CCTACAACCG CCACGAGGGC AAGACCGCCG ACGGCCTGGA GTACTTTCAG
AACGGGCTCA AGGCGACCTT CCTGCAGCCG ATCTTCCTGT CGCCGATCAC CGACCGCTTG
ACCATGGTGA TCTCGACGCC GATCCGGGCG CCCGACACGC GGGTGCTGGG CGTGTTGGCC
GCGCGCCTCA ACCTCACTCG CTTCTTCCGC TTGATCAACG ATCTCACCGG CCTGGGCGAG
ACCGGCGAGA TCGTGGTCGG CAAGTTCAGC GAGGGCAAGG TGGTGTTCAT GGCGCCGACG
CGGCTCGACG CCAACGCCGC CCTGCAGCGC ACGGTGGAGA TCGGTAGCAA GCAAGGGCGG
CCGCTGCAGG AGGCCGCGCG CGGGCTCAAG GGCTCGGGCG AGCACGAGCT CGACTACCGC
GGCGTCGAGG TGATCGCGGC CTGGCAGCCG GTGCCCTCGC TGAGCTGGGG CCTGGTGGTC
AAGATCGACT ACGAGGAGGC GATGGACCCG GTGCACGCGG TGGCCTCGCA GACCCTGCTG
GTGGCGCTGG GCCTGCTGCT GATGGCCGTG ATCGCGGCCT ACGCGGTGTC GCGCGAGCTG
GTGCGGCCGC TGCGCACGCT CAAGGACGCG GTCGACCACA TCAGCCGCGG TCACCTCGAC
GTGCAGCTCG AGATCCGCTC GAGCGACGAG ATCGGCGAAC TCGCCGACAG CTTCGAGCGC
ATGGTGGCCG CGATCAAGTA CTTCCGCGAG CACGCGCGGC GCGAGGAAGA GGATGAGTCC
GAATTCGATT CCACCGATCA CGAGAGCGAC AGCGGCAGCG ACAAATCGGC TCTGACGCGC
TCCTCGGACG ACGAACCTGC GGCATCATAA
 
Protein sequence
MKVKRTWSWR TRLLVLLLLF AVVPVTGITL WNLAQLEAAF EEDAVESLRA IGYARADAID 
QLMDDRRRDV ELLASQLVPH LEQMGMANRE AEALEPPETA PAPLPAPLPE LEDAQGGEPF
APTATETPEA GASPEADAGA AESGEGAAES GEGAAESGEG AGASRPQASA RAVEIEEARR
QQAVAAEAEA KAELYQALAL ILLDQKVFEE LLVISEDGLV VASTYNRHEG KTADGLEYFQ
NGLKATFLQP IFLSPITDRL TMVISTPIRA PDTRVLGVLA ARLNLTRFFR LINDLTGLGE
TGEIVVGKFS EGKVVFMAPT RLDANAALQR TVEIGSKQGR PLQEAARGLK GSGEHELDYR
GVEVIAAWQP VPSLSWGLVV KIDYEEAMDP VHAVASQTLL VALGLLLMAV IAAYAVSREL
VRPLRTLKDA VDHISRGHLD VQLEIRSSDE IGELADSFER MVAAIKYFRE HARREEEDES
EFDSTDHESD SGSDKSALTR SSDDEPAAS