Gene Hoch_3007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3007 
Symbol 
ID8545395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4163230 
End bp4164525 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID646387679 
ProductRadical SAM domain protein 
Protein accessionYP_003267407 
Protein GI262196198 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.486348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATAC AAACGCGAGA GTGGGCAAAC CGCCTGGATG TAGTCCTCAA GCTGGCCGAA 
CGCTGCAACC TGGCCTGTCC CTACTGCTAC TACTTCTTCC AGGAAAACGA TCTCGCCGAG
AAGAGCGCCA GCGCCCTCAT CCCCGAGTCC ACGGTGCGCG AGGTCGCGGC CTTTCTCCGC
GCAGGCGCGG AAGCGCTCGA CATCGGCCAC CTGTACATCG GTCTACACGG CGGCGAACCG
CTGCTGCTGC CCAAGAAGCG CTTCGATGCC CTGTGCACCA TCCTGCGCGA GGAGCTGGGG
GACGTCGCTC AGGTGCACCT CGGCCTGCAG ACCAACGGCA CCCTGGTCGA TGACGAGTGG
ATCGACCTCT TCGCCAAGCA CGAGGTGATG GTCGGCGTCA GCATCGACGG GCCGCCGCAT
GTCCACGACG CTGGCCGCCC CGACCACCGC GGACGCGGCA GCTACGAGGC CGCCGTGCGC
GGGCTGCGAC GACTGCAGGA CGCGCACGCC GCCGGTCGCG TGCCACCCAC CGGCGTGCTC
TGCGTGGCCA ATCCCGAGCA CGACGGCGGG GAAATCGCGC GCCATTTCGT GGAGGAGCTC
GGGGTCTACA ACTTCGATCT CCTGCTCCCG CGCGAGGGCT ACGACAGCGA GGTGTGGAAG
CCGCAGAGCA AGTGGATCCG CTATTTCCGC GAAGTCATCC GCTACTGGCA GACGAGCAAG
ACCGAACGCC CCGTGGTCAT CTACCTGTTG TCCGAAATCC TCACCGCGCT CTCGTCTCCG
GCCTCGGCAC AGCGCGCCGA TTATCGCCTA TCCAACCGGC ACAGCATCAT CACCATCTCG
AGCGAAGGCG TGTTGAGCGT CGACGACAAC ATCATGGCGC TCGACAAGGT GCTGTGTAAC
CCCGACCTCA CGATTCGCAA CACCACGCTC GCGGAGTTCG TTGCCAGCCC GGTATGGCAA
CACCTGGTGC GCAGCGTGGA CACCGCGCCC GAAGCGTGCG CCCGCTGCGA ATGGTATCGA
AGCTGCCGCT CTGGCGCGCT GTTCAACCGC TACAAAAAAG GCGAAGGCTT CGGCCGCCGG
TCGGTATTCT GTGAGACGCT AGACGCCATC CACACCTCGC TCGCACACAT GGTCGCGCAG
CGCGAGGAGC GTATCGAACG CATGGCCGAA ATCCTGCAAA CGCCGCCCAT GCACTGGGCG
CGTGACTTTC TCACGAGCAA CGCGGAGCCG AGTTCCGACC CCGACGCCTG GCTCGGCCAG
AGCGGTCCCG TGCGGCTGCC CATCTACAAG TCCTGA
 
Protein sequence
MSIQTREWAN RLDVVLKLAE RCNLACPYCY YFFQENDLAE KSASALIPES TVREVAAFLR 
AGAEALDIGH LYIGLHGGEP LLLPKKRFDA LCTILREELG DVAQVHLGLQ TNGTLVDDEW
IDLFAKHEVM VGVSIDGPPH VHDAGRPDHR GRGSYEAAVR GLRRLQDAHA AGRVPPTGVL
CVANPEHDGG EIARHFVEEL GVYNFDLLLP REGYDSEVWK PQSKWIRYFR EVIRYWQTSK
TERPVVIYLL SEILTALSSP ASAQRADYRL SNRHSIITIS SEGVLSVDDN IMALDKVLCN
PDLTIRNTTL AEFVASPVWQ HLVRSVDTAP EACARCEWYR SCRSGALFNR YKKGEGFGRR
SVFCETLDAI HTSLAHMVAQ REERIERMAE ILQTPPMHWA RDFLTSNAEP SSDPDAWLGQ
SGPVRLPIYK S