Gene Hoch_4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4030 
Symbol 
ID8546431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5532558 
End bp5534855 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content76% 
IMG OID646388707 
Productsulfatase 
Protein accessionYP_003268422 
Protein GI262197213 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.868128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.235672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGA ACTCCGGAGC CCTGCCGCGC TCGCGGCCGC CCCGAGCCGC ACTCGCCAGC 
CCTCCCCAGC GGCCCTGGCG CGCGCTCATC GCCGCCCCCA TCCCGGCCTG CCTGCTCGGC
GCGCTCGCCA TCGCCGCCGT GGAGATCGTC GCCCTGGGCC GCCCCAGCCT CGGCCTCATC
GCCGTGTGCC TCAGCCTGCA TCTCGCGCTG GGCCTGGCCA TCGGCGCCGT GCTCAGCGCG
CAGGAGGCGC TGCTGCGGCG CTGGCCGGGC CTGCGCCGGG GCGCCTCGTG GCTGTACGCG
CTCGGCGCCC TGCCCGCCCT GGCCTTCGTC GCCGCGCACC TGTTCGACGG TGCCCAGGCC
GCCAAGCTGC CGGGCGCCGC CCTCGGCCAC CTGTGGCTGC CGGTCCTCGG CGTGCTCGCC
GCCGCCGCCG CCATCGCCCT CGCCCGCCGC CTGCGGCGCC GACTCGGCGC CGCGCCCGGC
GCCCTCCTGC TGGCCCTGAG CGCGGCCCTG GTCGAATACC TCGGCCGCAC CCTCTACCCC
AGCGAGTACG CCGACGTGCA CGCCTTCCTG GTCGCGCTCG CGTGCCTGCT GGCCGCGCTC
AGCGTGCGCG TGGCCGCCGC CGGCAGCGCG CCGAGCGCCG GCGCTTCGGC CGCGCCGCCG
AGCCAGCCAG CCGCCGCCTC GCGCCTCGTG ATCCCGGGCC TGGCCGGCGC CGTCGTGGCC
GCCCTCATCG CCAGCCTGGT CTTCGGCCTG GCCGACAAAG ACGAGCGCCT GTGGGTCGCC
ACCCACGGCA CCGACCTGCG CCACCTGGTG CGCGTCGCCC GCCTGGCCTT CGACCGCGAC
CGCGACGGCG CCGCCGTGGT CCTCGGCGGC GGCGACTGCG ACGACGGCGA CCCCGCCATC
TCGCCGACCG CGCCCGAGCG CCCCGGCAAC GGCATCGACG AGGACTGCGA CGGCCGCGAC
CTCGAAGCGC TCGCGCTGCC CGCCAGCGAC CGCGACTGGG ACGCCGAGCT CGCCGCCTGG
GCGCGCGCGC CCGCGCTCGA ACACACCCTG GCGCGCACCC GCGACATGAA CATCCTGGTC
GTCGCCGTCG ACACCCTGCG CGCCGACGCG CTCGCGCTCC CGGGCGACGA CGACGCCGCC
GACGGCGCCC CCGCGGACGG CGCCCCGGCG GACAGCGAGC AGGCCGTCGC CAGCGACACC
CCGCACCTCG ACGCCCTGCT CGCCGACGCC GCCGTCTTCC GCCACGCCTA CGCCACCGGC
GCGGGCACCG ACATCTCGGT CTCGAGCACC ATCACCGGCC ACATCTCGCC CTTCGAGGGC
GCCGAGATCA CGCTCGCCGA GGCCCTGCGC GCGACCGGCC GCGCCACCGC CGCCGTGTTC
CCCACCGAGG TGCTGCGCTA CGCCGGCCAG GTGCTGCTCG GCCGCGGCAT CGACCGTCTG
CGCCGCTACG TCAACGACCG CGGCCAGCGC GACGTCGGCT CGTACACCAC CTCGGCCGAG
ACCACGCGCA TGGCGCTCGC CACCATCGAC GCCATGGGCC CGCACAAGCG GCCCTTCTTC
CTGTGGACGC ACTACTTCGA CGTGCACGAG CACGCCCAGG TGCTGGCCAG CGACCCCACC
CTGGCCCGCT ACGGCGAGCG CTTCGAGCTC GGCGAGCTGG CCGGCAAGTA CCGCGCCCTG
GTCGCGCTCA CCGACGCCGA GATCGGACGA CTGTTCGAAG AGCTGCGCGC GCGCGATCTC
TGGGACCAGA CCATCATCGT GCTGCTGAGC GACCACGGCG AGAGCATGGG CGAGGACCCG
CGCCTGCCGC TGCGCCACGG CCGCTTCGTG TACAACGCCC TCACCCACGT GCCCCTGGTC
ATCCGCGTCC CCGGCGCGAC GCCGCGCGTG GTCGATGCCC CGGTCTCGGT CGTCGACCTC
ATGCCCACGC TGCTCACCCT GGTCGGCGCC GAGCTCCCGC CCGGACTCGA CGGCCGCTCG
CTGCTGCCGC TGTTTCTGGG CTCGGCGCCC GGCCTGCCGC CGCGGCCCGT GGCCATGAGC
GAGAGCGAGC AGAGCGCCGT CATCGTGTGG CCCTACAAGC TGCTCCTGCG CCCGGCCGAC
AACCTGGTCG AGCTCTACGA CCTGGCCAGC GATCCCGCCG AACAGCGCGA CCTGGCCGAG
CAACAGCCCC AGCGCGTCTC GGCGCTGCGC GCCATCATGG CCCAGCTTCC GCAGGTCGAG
ATCGACCGAA CCCGTCGCGG ACGGCGCGCG CGAGATGCGC GCTCGCGAGC ACCACGAGCG
CGTACGCCAG CACCGTGA
 
Protein sequence
MTSNSGALPR SRPPRAALAS PPQRPWRALI AAPIPACLLG ALAIAAVEIV ALGRPSLGLI 
AVCLSLHLAL GLAIGAVLSA QEALLRRWPG LRRGASWLYA LGALPALAFV AAHLFDGAQA
AKLPGAALGH LWLPVLGVLA AAAAIALARR LRRRLGAAPG ALLLALSAAL VEYLGRTLYP
SEYADVHAFL VALACLLAAL SVRVAAAGSA PSAGASAAPP SQPAAASRLV IPGLAGAVVA
ALIASLVFGL ADKDERLWVA THGTDLRHLV RVARLAFDRD RDGAAVVLGG GDCDDGDPAI
SPTAPERPGN GIDEDCDGRD LEALALPASD RDWDAELAAW ARAPALEHTL ARTRDMNILV
VAVDTLRADA LALPGDDDAA DGAPADGAPA DSEQAVASDT PHLDALLADA AVFRHAYATG
AGTDISVSST ITGHISPFEG AEITLAEALR ATGRATAAVF PTEVLRYAGQ VLLGRGIDRL
RRYVNDRGQR DVGSYTTSAE TTRMALATID AMGPHKRPFF LWTHYFDVHE HAQVLASDPT
LARYGERFEL GELAGKYRAL VALTDAEIGR LFEELRARDL WDQTIIVLLS DHGESMGEDP
RLPLRHGRFV YNALTHVPLV IRVPGATPRV VDAPVSVVDL MPTLLTLVGA ELPPGLDGRS
LLPLFLGSAP GLPPRPVAMS ESEQSAVIVW PYKLLLRPAD NLVELYDLAS DPAEQRDLAE
QQPQRVSALR AIMAQLPQVE IDRTRRGRRA RDARSRAPRA RTPAP