Gene Hoch_3799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3799 
Symbol 
ID8546192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5217718 
End bp5219001 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content70% 
IMG OID646388469 
ProductCytochrome-c peroxidase 
Protein accessionYP_003268192 
Protein GI262196983 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0226321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.14874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGTCC GTGTACCCGG CTTCGCGATC GCGGGCGCGC TTCTGAGCGC GCTTCTGAGC 
GCGCTATTGA TCGGCGCCTG CGGGGCGGAG TCGAACCCGC AGGCGCCCGG CGCCGGCGGC
GTCGACGCGG CCGCCGACAT GGACGGGGAC ACCCTGGCGC GCGAGGGCTG GCGCATCCCC
GAGAGCTTTC CGCGCCCGCA GGTGCCCGAG GACAACCCCA TGAGCGCGGC CAAGGTCGTG
CTCGGCCAGC ACCTGTTCTA CGACCAGCGC CTGTCGGGCA ACGGCACCCA GTCGTGCGCC
TCGTGCCATC GCCAGGAGCT GGCCTTTGCC GACGGCGAGC GCACGCCCAC CGGCTCCACC
GGCGAGGTCT TGCATCGCAA CGCGCCCGGC CTGGGCAACG CCGCCTACTA CGCCACGCTC
ACCTGGACCA GCCCGGCGCT GCTCGAGCTC GAGTCGCAGA TCCTCATCCC GCTGTTCGGC
GAGACCCCGG TGGAGATGGG CGCCACCGGA CACGAGGACG AAATCCTCGC GCGCCTGCGC
GCCGAGCCCG CGTACGCGCC GCTGTTCGCG GCCGCATACC CCGAAGACGG CGATGCCTAC
ACCTGGGGCA ACATCGTGCG CGCGCTGGCC GCCTTCGTGC GCTCGATGCT CACCGGCGAC
GCGCCCATCG ACGAGTACGT GCACAAGGGC AATTCGGACG GCGTGTCCGA CTCGGTCAAG
CGCGGCCTCG ACCTGTTCCT CAGCGAGCGC CTCGAATGCC ACCACTGCCA CGGCGGCTTC
AACCTCACCA CCGCCACCAA GTACGAGGGC ACCGCGTTCA TCGAGCTGTC CTTTGCCAAC
GTCGGGCTCT ACAACCTCGA CGAGCAGGGC CGCTATCCCG AGGGCAACGA GGGCCTGTGG
ACCTTCACCG GCGACCCCGG CGACATGGGC AAATTCCGCG CGCCCAGCCT GCGCAACGTG
GCCCTCACCG CGCCCTACAT GCACGACGGC AGCATCGCCA CCCTCGACGA GGTCATCGAC
CACTACGAGC GCGGCGGCCG CCTGATCGAA GATGGCCCGC TGGCCGGCGA CGGCCTCGAC
AACCCCAACC GCAGCGGCTT TCTGCACGGC TTCGAGCTCA CGCAGCAAGA GCGCGCCGAC
CTCATCGCGT TTCTCGAGAG CCTCACCGAT GAGGCGTTCC TCGCCGATCC GCGCTTTTCC
GACCCGTGGC CAGCGCCCGC GGCTGCGGCC GAAGCCGAGC CGTACGCGGC CGCGCTTAGC
AGCATCGAGG AGACCACGCC ATGA
 
Protein sequence
MSVRVPGFAI AGALLSALLS ALLIGACGAE SNPQAPGAGG VDAAADMDGD TLAREGWRIP 
ESFPRPQVPE DNPMSAAKVV LGQHLFYDQR LSGNGTQSCA SCHRQELAFA DGERTPTGST
GEVLHRNAPG LGNAAYYATL TWTSPALLEL ESQILIPLFG ETPVEMGATG HEDEILARLR
AEPAYAPLFA AAYPEDGDAY TWGNIVRALA AFVRSMLTGD APIDEYVHKG NSDGVSDSVK
RGLDLFLSER LECHHCHGGF NLTTATKYEG TAFIELSFAN VGLYNLDEQG RYPEGNEGLW
TFTGDPGDMG KFRAPSLRNV ALTAPYMHDG SIATLDEVID HYERGGRLIE DGPLAGDGLD
NPNRSGFLHG FELTQQERAD LIAFLESLTD EAFLADPRFS DPWPAPAAAA EAEPYAAALS
SIEETTP