Gene Hoch_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3035 
Symbol 
ID8545423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4193132 
End bp4194337 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID646387706 
ProductCytochrome-c peroxidase 
Protein accessionYP_003267434 
Protein GI262196225 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0428958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0720293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGAC TCGCCGCCTC CCTGCTCGCC GCCGCCCTGG TCCCCGCCTG TGCGGTCTCC 
GACGACGTGC CCTTTACCGA CGCGCCCGCG CTCGACGACC CCTTCCTCGA CGCGGTCGAG
AGCGCGCTGT CTCTGCCCAC GACGCCCTAC GCGTACGCCG CGGTGCCGCT GCCGGCGCAC
TTCCAGACGC CCGGCGTGCG CGCGCGCGAC AACACGCCGA GCGACAATCC CGTGACCGAC
GCCGGCGCCA CCCTGGGCCG GGTGCTGTTC TACGACACTG CGCTGTCGGC CAACGACACC
GTGTCGTGCG CCTCGTGCCA TGTGCAGGAG TACGGCTTCT CCGACCCCGC GCGCTTCAGC
GTCGGCTTCG AGGGCGGCAC CACCGGGCGC AACTCCATGG GTCTGAGCAA CTCGCGCTTC
TACGGCAGCG GTCACTTTTT CTGGGACGAG CGGGCGGACA CGCTCGAGGA CCAGGTGCTC
ATGCCCATCC AGGACGCCAC CGAGATGGGC ATGACGCTGG CCGAGCTGGA GATCGCGCTG
GCGGCGAAGA GCTACTATCC GCCGCTGTTC GAGCAGGCCT TTGGCGACCC CGCGATCACC
TCCGAGCGCG TGTCGCTGGC CCTGGCCCAG TTCGTGCGCA GCATCGTGAG CTACCGCTCA
CCCTACGATC AGGGGCTGGC CCTGGCCGGC GGCGACCCGC GCGCGCCCTT TGCCAACTTC
ACGCCGCAGC AGAACCAGGG CAAGGGTCTG TTTTTCGGAC CGCGCGGCGG CTGCGCCATC
TGCCACGTGG ACAACGGACC GCCGGCCCCG GGTCGGGTCC CCGATAACGC CGCCGTGTTC
TTCATCGACC TGGCGGTCAA CAACGGCCTG GACGCGACCC CGAACGCCGA CGACCCCGGG
CTCGGCGGCC ACACCGGGCG CCCCGTGGAC ATCGGCAAGT TCAAGTCGCC GTCGCTGCGC
AACGTCGAGT TCACCGGCCC GTACATGCAC GACGGCCGCA TCGAGACCCT GCGCGGCGTG
GTCCAGTTCT ACAACGCGGG CGTGCAGCCG CATCCCAACC TCGACCCGCG GCTGCGTGCG
CCCGATGGCC GGCCCCGCCG CCTCGGCCTC ACGCCCATGG AGATCGACGC GCTCGTGGCC
TTTCTCCGCA CGCTCAGCGA CGAGGCCATG ATGAGCGACC CCAAGTACAG CGACCCCTTT
CGCTAG
 
Protein sequence
MNRLAASLLA AALVPACAVS DDVPFTDAPA LDDPFLDAVE SALSLPTTPY AYAAVPLPAH 
FQTPGVRARD NTPSDNPVTD AGATLGRVLF YDTALSANDT VSCASCHVQE YGFSDPARFS
VGFEGGTTGR NSMGLSNSRF YGSGHFFWDE RADTLEDQVL MPIQDATEMG MTLAELEIAL
AAKSYYPPLF EQAFGDPAIT SERVSLALAQ FVRSIVSYRS PYDQGLALAG GDPRAPFANF
TPQQNQGKGL FFGPRGGCAI CHVDNGPPAP GRVPDNAAVF FIDLAVNNGL DATPNADDPG
LGGHTGRPVD IGKFKSPSLR NVEFTGPYMH DGRIETLRGV VQFYNAGVQP HPNLDPRLRA
PDGRPRRLGL TPMEIDALVA FLRTLSDEAM MSDPKYSDPF R