Gene Hoch_4557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4557 
Symbol 
ID8546962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6222213 
End bp6224033 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content68% 
IMG OID646389230 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_003268941 
Protein GI262197732 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCAA GGAAGAACGT CTCGGATCCC CCACCCCCCA CAGCGTTGAT CTTCCTCGCT 
GGACCCAACG CGGGGCGCCG CTACAAGCTC CACCGAGAGG GCGATTACAT CATCGGCCGG
CGCTCGGACT GCCAGATCTT CATCCCCGAT ATGCGGGTGT CGCGGCAGCA CGCGCGCATT
CACGAGGAGC AAGGCGCCTG GGTGCTCGAG GACCTGGGCT CGAACAACGG CACCTTCCTC
AACGGCGAGC GGGTGCAGTC GGCCAAGCTC AAGAACCAGG ACGAGATCAG CATCGCCACC
AACAGCATCC GCGTGGAGGT GCCGAAGGGG ACGAAGCCGC TGGCGCAGCA GGATTCGCAC
GTGACCATCG TCGACGTGAA GAATCCGGCC ATCTATGTGA GCACCGAGGA TGCCGAGGCG
GCGATGACCA ACTCGGCCTC GTTCCCGTGG GATCCGCGCA ACAAAGAGCG CCTGCTCACG
CGCAAACTGC ACGCGGTGCA GACCATCCTC GAGACCGCGG CCAACATCGC CGATCCCGAC
CTGCTGCTCG AGTCGGTGGT GGCCCAGCTC CTCGAGGTCT TCCCGCAGGC CGACTCGGTG
GGCGTGCTGG TCGAGGACGA GGACTCGCAC GAGCTGCTGG TCAAGTGCCA CAAGACCCGC
AAAAAGCAGG GCTTCAGCGC CGACTTGAAG GTGCCGGGCA CGATCATCGA CCACGTGGTC
CACGACCGCC GCGGCATCCT GCTCAGCGAG AGCGGCCACG ACGCCCGCGA AGACGGCGAT
CTCGCCGGCC GCACGAGCGT GCCGCCCAAC GGCTCGCGCA TGGGCGCGCC TCTGCAGGCG
CGCAACGTGC ATTACGGCGT GATCTACGTC GAGTGCACGA CCGGCACCTT CCAGCAGGAA
GATCTCGACC TGCTCACCAG CATCGCGGCC CAGACCGGCC TGGCCATCTA CACCGCGCGC
ATGCACAACC AGATGCAGCA TCGGCAGCGG CTGGAGCGCG ACCTGCGGGT GGCGCGGCAG
ATTCAGCGCT CGCTGATGCG CAGCCCGCCG CGCGTGCTCG GGCTCGACTT CGCCATCCAC
TACGAGCCGG CGTATCAGAT CGGCGGCGAC TTCTTCGACT TCATCTGGAA GGACGACAAT
CACCTCACGC TGATCGTCGG CGACGTGGCC GGCAAAGCCA TCAGCGCGGC GCTGTACATG
GCGCGGCTCA CCAGCGAGCT GCGCGGGCGC GCCGGCATCG CGCGGTCGCC GCAGCGGCTG
ATCAAGCGGG TCAACGAGGA GATGGTCAAG CTCGGCGACG ACGGCATGTT CGCGACCCTG
GTGTGCGCGG TGTTCGAGCT GTCGACGCGC AGCTTGCTGT TCACCAACGC CGGCCACTGC
GTGCCGCTTT TGCGCCGCGG TGAGCAGGTC TTCCCGCTGG AGTCGGAGCG GGCGCACATT
CCGCCGATCG GCATCCTGCC CGACCTCGAG GTCGGCGAGG CGCGCGTGCA ACTGCACACC
GGCGACCTGC TGGTGATCGT CTCGGACGGC ATCGTCGAGG CCCGCGACCC CAACGGCAAC
GAGTACGGCG AGCGCCGCCT GATCCGCCGC ATCCGCACGG CCCGCGGCGG CGCCGAGGAC
CTGGTCAAGT CGATCCTGCA GGACGTCGAC AGCCACGTCG GCAGCGCCAC CCAGGCCGAC
GACATGACCA TCCTGGTGAT GCACGTGGCC GAGCGGCGCA CGCGGCGGCA GACGACCACG
GTGCCGGGCG GCGTGCCGCA TGTCGGCGGC GAGATCGCCG GCGGCGACAG CGCGGACGAC
GAAGAGGCCG AGGAGAAGTA G
 
Protein sequence
MSARKNVSDP PPPTALIFLA GPNAGRRYKL HREGDYIIGR RSDCQIFIPD MRVSRQHARI 
HEEQGAWVLE DLGSNNGTFL NGERVQSAKL KNQDEISIAT NSIRVEVPKG TKPLAQQDSH
VTIVDVKNPA IYVSTEDAEA AMTNSASFPW DPRNKERLLT RKLHAVQTIL ETAANIADPD
LLLESVVAQL LEVFPQADSV GVLVEDEDSH ELLVKCHKTR KKQGFSADLK VPGTIIDHVV
HDRRGILLSE SGHDAREDGD LAGRTSVPPN GSRMGAPLQA RNVHYGVIYV ECTTGTFQQE
DLDLLTSIAA QTGLAIYTAR MHNQMQHRQR LERDLRVARQ IQRSLMRSPP RVLGLDFAIH
YEPAYQIGGD FFDFIWKDDN HLTLIVGDVA GKAISAALYM ARLTSELRGR AGIARSPQRL
IKRVNEEMVK LGDDGMFATL VCAVFELSTR SLLFTNAGHC VPLLRRGEQV FPLESERAHI
PPIGILPDLE VGEARVQLHT GDLLVIVSDG IVEARDPNGN EYGERRLIRR IRTARGGAED
LVKSILQDVD SHVGSATQAD DMTILVMHVA ERRTRRQTTT VPGGVPHVGG EIAGGDSADD
EEAEEK