Gene Hoch_5561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5561 
Symbol 
ID8547975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7632819 
End bp7635212 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content71% 
IMG OID646390234 
Productsulfatase 
Protein accessionYP_003269936 
Protein GI262198727 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTAATG CGACGCGCAA GGGCGCCCTG CTCGGCTCCC TGCTGGTTCT CTGTTCCCCC 
TGGTGGCTGG CGTGCAGCGA CAAAGCCGGC CCGACGACCT CGTCCCCGGC TCCCGAGGCC
GCGGGTGCCG CCAGCGGCGG CGCGAACCCG ACCCAGGCCG GCGGCGAGGC CGAGGCCGAG
GTTACCGGAC ATCCGTCCGA AGCCGAGCCG GCGATCGCGC TGGCCGACAG CCCGGATGTC
GATCTGGTGG ACAATCGCTT CCTCTGGCAC CTGGCACCGC CGGCGCAGGC CTCGGGTCTG
TTCGTACCGG TCGCCTCCGA GGGGCTGCGC AAGTACACGC AGCAGTACCG CAACCCCTGG
GGGGATGTGG TCACCCGCGA CGGTCACAGC GGCCGGGTCC TGCAGGCGTC CTCGGCCGAG
CTGCGCATCC CCTGGCGCCC GGCCGCGGAG GCCCGTGGTC CGGTGCGCTT GCGCGCGCGA
CTGCACGGGC TCAGCGCCGG GCAGCGGCTG TCGGTGTCGA TCAACGGCGA GCGGGTGGTC
AACGCCTCGC TCGACGATGG CTGGCAGGAC GCGGCCTTTG AGATCAAACG CGGCGCTCTG
CGCGCGGGCG AGAATCAGGT GGTGCTGTTC TTCGGCAAGC GCGGTGGCCC CGAGCGCGCC
TACGCGCTGG TGCACTCGCT GGCCTTCGAA GAGGCGGCCG CGTCGGACAG CGCCGGCGAC
GACAGCGGTG GCGGCGACGG CGGTGACAGC GCGAGCTGGC CGCCGCTGTC GCCGGCCGCT
CAGGTCAGGG TTGCCGACAC CGCGCGCCCG GCCTTGACCG GTTTCACGCG CATGAGCGCC
TACCTCGAGG TGCCCGAGAC CGGCTGGCTC GAGCTGCACA CCGGTGCGCC CGGCGCGGGC
ACGCGCTTTC GCGTCACCGC CCAGCCGGTC GAGGGCGCGG TCGTGGAGAT TCTCGACCAC
ACGTCCGAGG ATGCCGGCTG GCGGCGCCAT CGCCGCGATC TCGCCGAGCT CGCCGGCAAG
CTGGTGATCC TCGAGTTCGC GCTCTCGGGC GAGGGCGCCG AGCAGGCGGC CTGGGGCGAA
CCTCGGCTGG CGCTGGCGGA AGCGCCCACG CGCGCGGCGC CGCCGCCGTA CGACAACGCC
ATCCTGCTGG TGGTCGACGC GCTGCGCTCC GACCGCCTGC GCCTGTACGG GGACACCCGG
GTGAAGACGC CCCATATCAG CGCCGACGGC AACGCCCGCG GCGTGGTCTT CCTGCACAAC
CAGGCGGCCT CGCCCTCGTC GCCGCCCTCG CACGGCAGCA TCCAGACCGG CATGATCCCG
CGCGTCCACG GCGTCACCGG CGACAACGGC AAGCTCGAGC CGGGCACGCC CATGATCAGC
ACCCAGGCCG AGGCCGCCGG GATCGCGGCC GGCTACTACG GCAACAACCC CTTCGGCATG
GGCCGGCTCG AGGCCCCGGG CGAGTGGACG GCCTTCCACC AGCCCAACAA GGAGGGCAAG
GGCATCGACT GCACGGTGCT CATGGACGAG ATGCTCGGCT TTGCCAAGCA GCAAGCCGAT
GCCGGCAAGC GCTTCTTCAT CTCCTCGCTG CCCTACGAGA CGCACACGCC GTATCGCTAC
CACGAGGGCA TCACCGACAA CTATCACGAC GGCGACTGGG GTCCGCCGGT GGGCAAGAAC
GTCGATGGCG TGCTGCTCAG CAAGCTGTCG AGCGCGTCGG TGACCTTGAA CGATGCCCAG
TGGGCTCAGC TCCGCGCGCT CTACGACGGC GAGGCCGAGT ACATGGACGG CTGTTATCAG
CAGCTTCTAG ACGGCCTCGA GGCACGCGGT CTGCGCGAGC GCACGCTGCT GGTGCTCACC
TCGGACCACG GCGAGGGCAT GTACGAACAC GGACGCATGG GCCACGCTTT CGGACACTAC
GCCGAGCTGG CTAATGTGCC GCTGGTGTTC CTCGGTGACG GCCTCACGCC GCAGGGTGCG
GTGCTCGACG CGGTGTCCAG TCACCTCGAT ATCGCGCCGA CGATCCTGGC GCTGCTCGGG
GTGACGCCGA GCGAGCGCAT CCAGGGGCGC GACCTGAGGG CGCAGATGCT GCGCGAGGGG
CCGTGGACGC CGCGGGTGGT GTCGCTCGAG TACGGCCGCA GCTACGCGTT GCGCGCGCGG
CGCTGGAAGT ACATCGTGGA TTATCAGCAG AACGAGAGCC TGTTCGACCT GGTCGAGGAT
CCCAGCGAGC AGCGCGATCT CCTGGGCACG CAGGACATCC CCCTGCGCTA TCTGCGCGAC
CTCGCCGGGG TGTTCCTGGC CCATCGCAAG GCCTGGCGCG CCGACTCCTT CGGCGAGCTC
AACAACCACG CGCCGGGCTT TCTGCGCCAC GTCGCCGCGC CCGTCTTCGA TTGA
 
Protein sequence
MCNATRKGAL LGSLLVLCSP WWLACSDKAG PTTSSPAPEA AGAASGGANP TQAGGEAEAE 
VTGHPSEAEP AIALADSPDV DLVDNRFLWH LAPPAQASGL FVPVASEGLR KYTQQYRNPW
GDVVTRDGHS GRVLQASSAE LRIPWRPAAE ARGPVRLRAR LHGLSAGQRL SVSINGERVV
NASLDDGWQD AAFEIKRGAL RAGENQVVLF FGKRGGPERA YALVHSLAFE EAAASDSAGD
DSGGGDGGDS ASWPPLSPAA QVRVADTARP ALTGFTRMSA YLEVPETGWL ELHTGAPGAG
TRFRVTAQPV EGAVVEILDH TSEDAGWRRH RRDLAELAGK LVILEFALSG EGAEQAAWGE
PRLALAEAPT RAAPPPYDNA ILLVVDALRS DRLRLYGDTR VKTPHISADG NARGVVFLHN
QAASPSSPPS HGSIQTGMIP RVHGVTGDNG KLEPGTPMIS TQAEAAGIAA GYYGNNPFGM
GRLEAPGEWT AFHQPNKEGK GIDCTVLMDE MLGFAKQQAD AGKRFFISSL PYETHTPYRY
HEGITDNYHD GDWGPPVGKN VDGVLLSKLS SASVTLNDAQ WAQLRALYDG EAEYMDGCYQ
QLLDGLEARG LRERTLLVLT SDHGEGMYEH GRMGHAFGHY AELANVPLVF LGDGLTPQGA
VLDAVSSHLD IAPTILALLG VTPSERIQGR DLRAQMLREG PWTPRVVSLE YGRSYALRAR
RWKYIVDYQQ NESLFDLVED PSEQRDLLGT QDIPLRYLRD LAGVFLAHRK AWRADSFGEL
NNHAPGFLRH VAAPVFD