Gene Hoch_3446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3446 
Symbol 
ID8545834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4758225 
End bp4759361 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content70% 
IMG OID646388113 
Producthypothetical protein 
Protein accessionYP_003267841 
Protein GI262196632 
COG category 
COG ID 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.930769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.012517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGG CACTCAATAT CGGTCTGTCG CTGACCATGC TGGCCATCTG CCTGTTCCTG 
GTGTGGCCGG CGCCGGCCGA GCGCGCCCAG CTCGGCGAAG CGCTCGGACG GCTGGACGAA
ATCTACCTGC TCGGCTTCAT CGCGCTGCTG GCGGTGGTGC ACTTCTTCCG CGCCTGGCGC
TGGAACAACC TGCTGGCCCC GCTGGGCGCC CGGCTCGGCG CCGGCCGCCT GCTGGCGGTG
TCCTCGGTCG GCTTCATGGC GATCCTGGCG CTGCCCGCGC GCCTGGGCGA GTTCGTGCGC
CCGGCCCTGG TGCGCGAGCA CGGCACGCTG TCGGCCACGG CCGCGCTGGG CACGGTGGCG
GTCGAGCGCA TCATCGACGG GCTGCTGGTG TCGCTGCTGG TGTTCGCCGC GTTCTTCTCG
CTGCGCGGAC CGGAGGCGCC GCCGTGGATG ATGCCGACCG CGTACGCGGC CCTGGGTATC
TTCTCGGCCG CGCTGGTGTT CCTCGGCTTC GCCATGCGCT GGCCGGAGAA GACCGTGAAC
ACCGCGGTCG CGCTCACGGG CGCGCGCCTG CTGGCGCCGC GCTTCGCCGA GGTGCTGCGC
GAAAAACTCC TGAACATGAT CAGCGGCTTC CTGGTCATGA ACGACCGCCG CAACCTGCTG
TGGTTTCTGC TCTGGAGCCT GGTCTACTGG ATCGCCAACG GCCTCAGCCT GTGGGTGCTC
TCGCTCGGCT TCGATCTCGG CCTGGGCGTG GTCGGCGCCT TCGCCACCAT GGGCCTGGTC
GCGGTCGGCA TCACCCTGCC CAACTCCCCG GGCCTGGTCG GTCAGTATCA ATGGTTCACC
CAGCTCGGCC TGTCGCTGTA TCTCGGCCAG GCCGGCCACG GCGCCACCGG GCTGGCCTTT
GCCATTGTTT TGCACGGGGT CCAGGTCGTC TGGTACATGC TGATGGGAGG CATCGCGCTG
GCCACGCCCT TCGTCTCCCT GCACGAGGTG TGGCGGGCGC GGCGCATCGA CGACGCCCCA
CAGGCCGCCA ACGACGCCCC CGACGACGAC CCCGACGAGG ACCGAGCTAA CATCGCAGAC
GACGCAGCGG GCGCCCGCCT GTCGGCCAAC GCCGCCAACC CGAGCGCGCC GCCCTGA
 
Protein sequence
MKLALNIGLS LTMLAICLFL VWPAPAERAQ LGEALGRLDE IYLLGFIALL AVVHFFRAWR 
WNNLLAPLGA RLGAGRLLAV SSVGFMAILA LPARLGEFVR PALVREHGTL SATAALGTVA
VERIIDGLLV SLLVFAAFFS LRGPEAPPWM MPTAYAALGI FSAALVFLGF AMRWPEKTVN
TAVALTGARL LAPRFAEVLR EKLLNMISGF LVMNDRRNLL WFLLWSLVYW IANGLSLWVL
SLGFDLGLGV VGAFATMGLV AVGITLPNSP GLVGQYQWFT QLGLSLYLGQ AGHGATGLAF
AIVLHGVQVV WYMLMGGIAL ATPFVSLHEV WRARRIDDAP QAANDAPDDD PDEDRANIAD
DAAGARLSAN AANPSAPP