Gene Hoch_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1844 
Symbol 
ID8544226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2539931 
End bp2542108 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content71% 
IMG OID646386550 
Productadenylate/guanylate cyclase with integral membrane sensor 
Protein accessionYP_003266285 
Protein GI262195076 
COG category[T] Signal transduction mechanisms 
COG ID[COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.266709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCT GGCGATTCTA CAAGCGGCGC ATGGCGGTCA AGGTCTTCAT GGCCCTGGCC 
GTGGTCGTCG GCGGTGTCCT CAGCGTGCAG GCCTGGTTCG ACAGCCGCGG CGACGCCGAG
GTGATGCGCG AGCAGAGCAA GGAATCGGCC GTGGACATCG CGCGCCTGTT CATCGGCGCG
GTCGAGCACT CGATGCTGGC CGGCGGTGGG CTCGAGGTCA AGACCCTGGT CACCGGGCTC
GAGGACCGGC TCAGCCAGCA GCAGGAGCTG GACCGGTTGC CCGCGGTGCA GGTGCACATC
TACGATCAGC GCGGGCTCGA GGTGTTCGCG CCCAAGGCGC CGGTGCCCGC GCGCGAGGAC
CTGCCCGCGG ACATCCGCGC GGTGCTCGAC GGCGGCGCCC GCCACGAGGG CGACGACGGC
CGCATCTACC GGCCGGTGCC CAACGAGGCC CGCTGCAACG CGTGTCACGA CAGCGGCTCG
ACCCTGCGCG GCGTCATCGC GCTCGAGCTC GACCCCGCGC AGTGCAGCGA CGCCCGCGAG
GAGGTGCTGC CGCACATCAT CGCCGAGGGC TTCACCCACG TGATGACGGC CGAGCGCACC
CACTTCCTCG ACGACTACTT CCAGGAGCTG CGCCGGGAAG TGCCGCAGGT GCAGGGCGTG
GCCGTGTTCG ATGGCGACGG CCTGCTGATG TTTGGCGAGG AGTTCGCGGG CCTGAGCGAG
GAGGCGCTGC GGCCGCTGCT GCAGACCGAC GCCCAGGCCG CGTATCTGCC GCGCCCCGAG
GGCGGCACCT TGGCCATGCT GCCGCTGCCC ATGGAGGACC GCTGCACCGC CTGCCACGAC
GACGAGCTGG GCAGCATTCG CGGCGTGCTC GGGGTGTCGC TGGCGGCCTC GCCCAACATC
GCGCGCTGCG TGTCGAGCGA GTACGAGGGC ATCGTCGACA CCTCGCTGCG CTACATCATG
GTGTCGCGTC TGGGTCGGCG CATCGCCGAT TTCCTCGACG CCGTGGCCGC CACCGAGCCG
CTGCGCGAGC TGGTGCTCTA CGACGAAGTC GGACGCCAGT ACTGGAACAC CACGCACCCG
ACCCCGCTGC CGCACGTGGC CGAGGTGCTC GAGCGCGGCT CGTCGATCGT CGAGCTGGTC
GAGGCCGAGG ACGGCGAGCG CGTGCGCGCC ATCGAGCCGC TGTTCAACGA GCGCGCGTGC
ACCCGCTGTC ACGGCGCCAG CTCGCCCATG CGCGGCGCGG TCGCCGTCAG CCTGTCGACC
GAGTTCGCGG TGCGCCACCA GCGCGAGGCC CTGCAGCGGC GCCTGCTGTT CACCGGGCTC
ACCCTGCTGG CGCTGCTCCT GGTCATGGCG CCGCTGCTCA ACTATCTGGT CGCGCGGCCG
GTGCGGCGCA TGGGCGACGT CGCCGAGCGC GTGGGCCGCG GCGATCTCAG CGCCATCGTC
GACCACGCGG ACGAACGCGG CGACGAGATC GCGCGCCTGG GCTTTCGCAT CAACGAGATG
GTGGGCGGCC TCAAGGCCAA GCTGCGCCTG GAAAAATTCG TGTCGCGCGG GGCCGCGGCC
GCGGCCGACG CCGCCGGCGA GCGCGAGCTG CTCCGCGCCG GCCAGCGCCG CGAGGCGACC
GTGCTGTTCA GCGACATCCG CGGCTTCACT GCCTACTCCG AGCAGGTCGA TCCCGAGAGC
GTGGTCGACA TGCTCAACCG CCTGCTCCAG GCCCAGGCCG ACGTCGTCGG ACACTTCGAC
GGCGACATCG ACAAGTTCGT GGGCGACGAG CTGATGGCCC TGTTCCACGG CCCCAACGCC
GAGGCCCGCG CCGTGCTCTG CGCCACCCGC ATGCTCGAGG CCGTGCGCCG CGGCCTGCGC
GAAAACGAGC CCCTGGGCGT GGGCATCGGC ATCTCCTCGG GCATCGTGGT CTACGGCGCC
ATCGGCCACG AGTCGCGCAT GGACTTCACC GTCATCGGCG ACGTGGTCAA CACCGGCGCG
CGTCTGTGCT CGGCCGCGGC CGGCGATCAG ATCCTGGTCA CCGCGTCGGT GCGCGATGCC
GTCGGCGACA GCCGCGATCT CGAGTTTCAA GCGACCGAGC CCTTATCGGT CAAAGGCAAG
CGTGAACCCA TCACCGTCTT CGAAGCCCGG CGCTGTTCGC CCGGCGACGA GGACGCGTCA
GGAGTAAATC GGACGTGA
 
Protein sequence
MGIWRFYKRR MAVKVFMALA VVVGGVLSVQ AWFDSRGDAE VMREQSKESA VDIARLFIGA 
VEHSMLAGGG LEVKTLVTGL EDRLSQQQEL DRLPAVQVHI YDQRGLEVFA PKAPVPARED
LPADIRAVLD GGARHEGDDG RIYRPVPNEA RCNACHDSGS TLRGVIALEL DPAQCSDARE
EVLPHIIAEG FTHVMTAERT HFLDDYFQEL RREVPQVQGV AVFDGDGLLM FGEEFAGLSE
EALRPLLQTD AQAAYLPRPE GGTLAMLPLP MEDRCTACHD DELGSIRGVL GVSLAASPNI
ARCVSSEYEG IVDTSLRYIM VSRLGRRIAD FLDAVAATEP LRELVLYDEV GRQYWNTTHP
TPLPHVAEVL ERGSSIVELV EAEDGERVRA IEPLFNERAC TRCHGASSPM RGAVAVSLST
EFAVRHQREA LQRRLLFTGL TLLALLLVMA PLLNYLVARP VRRMGDVAER VGRGDLSAIV
DHADERGDEI ARLGFRINEM VGGLKAKLRL EKFVSRGAAA AADAAGEREL LRAGQRREAT
VLFSDIRGFT AYSEQVDPES VVDMLNRLLQ AQADVVGHFD GDIDKFVGDE LMALFHGPNA
EARAVLCATR MLEAVRRGLR ENEPLGVGIG ISSGIVVYGA IGHESRMDFT VIGDVVNTGA
RLCSAAAGDQ ILVTASVRDA VGDSRDLEFQ ATEPLSVKGK REPITVFEAR RCSPGDEDAS
GVNRT