Gene Hoch_2313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2313 
Symbol 
ID8544699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3216625 
End bp3217845 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content72% 
IMG OID646387018 
ProductEndonuclease/exonuclease/phosphatase 
Protein accessionYP_003266749 
Protein GI262195540 
COG category[R] General function prediction only 
COG ID[COG3568] Metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA ATAGAGTTCC TATCCCCCGC CCGCTCCGCC TGCTCAGTGT CGTCATCGCC 
CTCGCCGCCG GGCTGAGCGC GTGCGGATTT TCGACCGATC GTACGCCCAA GCGCGAGGTC
GGCGATATCG AGCTGGCCAC CTGGAACCTG TCGTGGCTGG CGAGCGAGGA CGGCGCCGGC
ACCAACCCGC GCACCGAGGC CGACTACGCG CGCCTGCGCA CCTACGCCGC GCGCCTCGAC
GCCGACGTCA TCGCGCTGCA GGAGGTCGCC GACGAGTTCG CCGCGCGCCG GGTGTTCGAC
CCCACGGTCT ACGACTTCGC CATCGCCTCC AAGCGCGGCG CCCAGCGCAG CGGCTTTGCC
TATAAGCGCG ACCTGCGCGT GCGCCGCCAC GCCGACCTCG CCGCCCTCGA CGTCGGCGGT
CTGCGCCCCG GCGTGGACAT CGAACTCGAC CTCGGCCGCG GCGCCGACGG CACCCCGCGG
CGGCTGCGCC TGCTGGCCCT GCACCTCAAG AGCGGCTGCT TCGACGACAG CCTGCGCAAG
CGCTCCAACG CCTGCCGCAA GCTGTCGCGC CAGCTCCCCC AGCTCGAGGG CTGGATCGAC
GCGCGCGCCC GCGAGGGCGT GCCCTTCGCC GTGCTCGGCG ATTTCAATCG GCGCATGAAC
GCGCGCGACG CCTTATGGCG CGAGATCGAC GACGCCGAGC CCGCCGCCGC CGACCTCACC
CTGGTCACCG AGGGCCAGCG CTCGCGCTGC TGGAAGGGCA AGTACCCGCG CTTCATCGAC
CACATCGCCC TCGACCTGCA CGCCTCCGCC TGGCTGGTGC CAAACTCGTT CGAACAGCTC
GTCTACAGCG ACAGCGACAC CGCGCACGCC CGCGCGCTCT CCGATCACTG CCCGATCTCG
GTGCGCCTGC GGCCCAAAGG CGGCCCGGCT CAGCCCGCAG ACCCCGCGAG CGCGCCCGCG
GCAGACCAGC ACGCGGACAC AGCGTCCGAC CAGGGCGCCA GCGACGACGA GGCCGCGGCC
GCGAACGCGA ACGCCGCACA GTCGGCCGAG GGCCTGCGCA TCAAGGGCAA TGTGTCGCGC
AAGCGCAAGC TCTACCACCT GCCCTCGTGT CCCAGCTACG CCAAGGTGCG CATCGACCCC
GACAAGGGCG AGCGCTGGTT CGCCAGCGAG CGCGACGCCC AGGCAGCCGG CTTCCGCAAG
GCCGGCAACT GCCCGCCCTG A
 
Protein sequence
MTKNRVPIPR PLRLLSVVIA LAAGLSACGF STDRTPKREV GDIELATWNL SWLASEDGAG 
TNPRTEADYA RLRTYAARLD ADVIALQEVA DEFAARRVFD PTVYDFAIAS KRGAQRSGFA
YKRDLRVRRH ADLAALDVGG LRPGVDIELD LGRGADGTPR RLRLLALHLK SGCFDDSLRK
RSNACRKLSR QLPQLEGWID ARAREGVPFA VLGDFNRRMN ARDALWREID DAEPAAADLT
LVTEGQRSRC WKGKYPRFID HIALDLHASA WLVPNSFEQL VYSDSDTAHA RALSDHCPIS
VRLRPKGGPA QPADPASAPA ADQHADTASD QGASDDEAAA ANANAAQSAE GLRIKGNVSR
KRKLYHLPSC PSYAKVRIDP DKGERWFASE RDAQAAGFRK AGNCPP