Gene Hoch_5042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5042 
Symbol 
ID8547453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6957393 
End bp6958901 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content71% 
IMG OID646389718 
Productprotein serine/threonine phosphatase 
Protein accessionYP_003269423 
Protein GI262198214 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.384072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCGC TCCCCAAGCT CATCGTCACC GTCGCGCTCT CGGTCGCCCT TGGCGTCGGA 
TTGATCATCT GGCAGGCCCA CTACCAGGCC AACCAGATCG CCGAGCAGGG CACCAAGGTG
CTGCTCGACG TGCTCACGCG CGAGCAGCAC GCGCGCTGGC AACGGGAGAC CGTGCAGCTG
GCCACCGCGC TCGGCGACGG CCGCGCCGAG CTCGCGCCCG GGCCCCGGCT GCGGCGTATC
CTCGACCTGC CGTCCGAATT CAAGCACGCG CGCATCCTCG ACGCCGAGGG CGACGACCTC
GCGGTCGCCC ACGCGTCCAT GGAGAGCTCG TTCACCGACA ACGAGCTCGA CAAGGACGCG
CTGCTGGCCG CCTGGCGCGA GCGCGCCGAC CCCGAGCTGC CGCTGGCCGA GATCGAAGAC
CCCGAAGATC AGCGCGTGTA CATGATCACC TCGCTGAGCG ACGCTGGCGA GGGCGCCTAT
TTCGTCACCG GGTATTCGGG CGCACCGCTC GCCGCGATCG CCGCCGACCT CGAGCGCCAT
CACCACACCG AGGCGCGCAC ATCGCTCGAG CTGCTGGTGT TGATCGGCAT CGGCGCCGAG
TTGCTCGCCG TCATCGGTCT GTTCCTCATC CTGCGCCGCC GCGCCGAGCC CGACGACGCG
ATTCCCCAGT TCGCGCCCGC GGCCACCGCG CGCTCGGGCC CGGCCCTGCA CGACATGGCC
GGCGAGCTGG CCGTGCTGCT GCAGGAGACC GCGGCCAAGT CGCGTATGGA TAAGGATCTG
GAGATCGTCC AGACCGTCCA GAATACGCTG CTGCCGGCCG ACGAGTTCGT CGAGCGCGGG
CGCCTGTCCT TCGCCGGCAA GCTGCACTCG GCCGGCACCT GCGGCGGCGA CTGGTGGACC
TACCACGACC TCGCCGACGG CACCGTGCTG CTGGTGCTCG GCGACGTCAC CGGCCACGGC
CCCAGCGCCG CCATGCTGAC CGCGGCGGCC AAGGCGGCCT GCGACCTCGC CTGCGACATG
CACGACCACC TCCCGAGCCC GGCCGCGGTG CTCAACCTGA TGAACCAGGC GGTGTTCCAC
GCTGGCGGCC GGCGCCTGCT CATGACCTGC TTTGCCATCA TCATCGACCC GCGCACGGGC
GCGGCGCAGT TCGCCAACGC CGGCCACAAC TTCCCCCTGC TGGCGCACCA GGAGCCCGGC
CAGGACGAGG TGCAACTGAC CTCGCTGATC GCGCGCGGCA ACCGCCTCGG CGACTCGCGC
GAGTCGCACT TCGAAATGGT CAGCGCCACC CTGGAGCCTG GTGATCGCCT GCTGCTGTAC
ACCGACGGCA TCATCGAGTG CGAGAACTCG CAGGGCTCGG CCTACGGCGC GCGCCGCATG
CGCGAGCTGA TCGCCGGCGC CGCGTCCGAG CCCGTCGCTC TGCGCGACGA GCTCATCCGC
AGCGCGCTCG AGTTCGCCGA AGGCAAACTC GACGACGACC TCACCCTGGT CGCCGTGCGC
TTCGCCTAG
 
Protein sequence
MLPLPKLIVT VALSVALGVG LIIWQAHYQA NQIAEQGTKV LLDVLTREQH ARWQRETVQL 
ATALGDGRAE LAPGPRLRRI LDLPSEFKHA RILDAEGDDL AVAHASMESS FTDNELDKDA
LLAAWRERAD PELPLAEIED PEDQRVYMIT SLSDAGEGAY FVTGYSGAPL AAIAADLERH
HHTEARTSLE LLVLIGIGAE LLAVIGLFLI LRRRAEPDDA IPQFAPAATA RSGPALHDMA
GELAVLLQET AAKSRMDKDL EIVQTVQNTL LPADEFVERG RLSFAGKLHS AGTCGGDWWT
YHDLADGTVL LVLGDVTGHG PSAAMLTAAA KAACDLACDM HDHLPSPAAV LNLMNQAVFH
AGGRRLLMTC FAIIIDPRTG AAQFANAGHN FPLLAHQEPG QDEVQLTSLI ARGNRLGDSR
ESHFEMVSAT LEPGDRLLLY TDGIIECENS QGSAYGARRM RELIAGAASE PVALRDELIR
SALEFAEGKL DDDLTLVAVR FA