Gene Hoch_3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3951 
Symbol 
ID8546347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5448893 
End bp5451148 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content72% 
IMG OID646388623 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding protein 
Protein accessionYP_003268343 
Protein GI262197134 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.840508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0743558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGC GTGACATTCT GGCCGGCGCG CACCGCGGCG GCGACAGTGA CAGCGAACAG 
AGCTGGCTGC GGCCCAGCCG GCGCGCCTTC CTCAAGGGCA CGGCGGCCGC GGGCGCCGGC
CTGGTCATCG GCTTTCAGGT CGGCTGCGGC GGCAAGGCCC AGGATCCCGG CACCTCGCCC
GAGCAGCCCG GCGCGGGCGA GGACGAGTTC GCGCCCAACG CGTTCTTGCG CATCGCGCCC
GACGACTCGG TCACCGTGGT GTCCAAACAC ATCGAGTTCG GCCAGGGCAC CTACACCGGC
CTGGCCACGA TCCTGGCCGA GGAGCTGGGC GCCGACTGGA ACCAGGTGCG CGTCGAGTCG
GCTCCGGCCG ACGCCTCGCG CTACGCCAAC CTGGCTTTCG GCATGCAGGG CACCGGCGGC
AGCAACGCCA TGGCCAACTC CTGGCAGCAG CTCCGCGAGG CCGGCGCCAC CGGGCGCGCG
CTGCTGATCG CGGCCGCCTC CGACACCTGG GGCGTGAGCG CCGCCGACAT CACGGTCGAG
CGCGGCGTGG TCGCGCACGC CGGCAGCGGC CGCATGGCGC GCTTCGGCGA GCTGGTGGAC
AAGGCCGCCA CCATGCCGCT GCCGGCCAAG GTGGTCCTCA AGGACCCCGA GAACTTCACG
CTCATCGGCA CCGACGTGCC GCGCGTGGAC GTCGCCGGCA AGACCAACGG CGCCGCGCAG
TTCACGCTCG ACGTGTACCT GCCCGGCATG CTCACGGCCC TGGTGGCGCG GCCGCCGCGC
TTCGGCGCCA AGCCGGCCCG GGTCGACGCC AGCGCGGCCG AGGCCATGCC CGGTGTCGTC
CAGGTGGTCG AAATCGCCAG CGGCATCGCC GTGGTGGCCA AGAACTTCTG GGCCGCCAAG
AAGGGCCGCG ACGCGCTCGC GATCGAGTGG AACGAGGACG CGGCCGAGAC CCGCAGCTCG
GACGAGATGC AGGAGGCCCT GCGCCAGATG CTCGAGCAGG ACGGCATCGT CGCCAAGCAG
GAAGGCGACA TGGCCGCGGC GCTGGCTTCG GCCGCGCGCG TGGTCGAGGC CGAGTTCGAG
TTCCCGTACC TGGCGCACGC GCCCATGGAG ACCATGGACT GCGTGGCCAA GTTCGAGGAC
GGCCGCTGCG AGATGTGGTT CGGCTCGCAG ATCCAGACCA CGGATCAGAT GGGCGCGGCC
CAGGTGCTCG GCATCCAGCC GCAGAACGTC ATCATCCACA CCCTGCTGGC CGGCGGCAGC
TTCGGCCGCC GCGGCACCTT CGACGGCGCC ATCGCGGTCG AGTGCGCGAG CCTGCTCAAG
GCCACCGGCT CCACCGCGCC GATCAAGCTG GTGTGGACGC GCGAGGACGA CATCCGCGGC
GGCTTCTACC GGCCCATCTT CCGCCACCGC ATGCGCGGCG CCATCGACGC CCAGGGCAAG
GTCGCCGGCT GGGAGCATCG CCTCGCCGGC CCGTCGATCA TGCTGGCCAC GCCCGCGGGC
TCGCAGATGG TGCAAAACGG TGTCGACCCG ACCTCGGTCG AGGGCGCGGC GCCGCCCGAC
TACCAGCTCG ACAATCTCTA CGTCGACGTG CGCAACGCCG AGTTCGGCCC CAACCCGCAC
TTCTGGCGCT CGGTCGGCAG CACGCACACG GCCTTTGCCG TCGAGGTCTT CATCGACATG
CTGGCCGAGG CCATGGGCCA GGATCCCGTG GACCTGCGCC GCACCCTGCT CGGCGACAAG
CAGCGCCACC TGGCGGTTCT CGACCTGGTG GTCGAAAAAT CCGGCTGGGG CTCGGCGATG
CCCCGCGGCA AAGCGCGCGG CATCGCCATC CACGAGTCGT TTGGCAGCGT GGTGGCCGAG
GTCGCCGAGG TGTCGCTGGC CGAGGACGGC ATGCCCAAGG TCGAGCGCGT GGTCTGCGCC
GTGGACTGCG GCGTGGCCAT CAACCCCGAC AACGTCCGCG CGCAGGTCGA GGGCGGCCTG
GGCTACGGCC TCGGCGCCGC CCTGTACAAC GAGATCACGC TCGAGGGCGG CCGGGTCGTG
CAGAGCAACT TCGACCAGTA CCGGCCGCTG CGCATCCAGG ACATGCCCAC GGTCGAGGTC
CACATCGTGC CCTCGGGCAA CGCGCCCTCG GGCATCGGCG AGCCCGGCCT GCCGCCGATC
GCGCCGGCCG TGGCCAACGC GTACTTCCGG CTCACCGGCA AGCGCATCAC CAGCCTGCCG
TTCGCGCGGG CCATCACCAA GCAACGCCGA GGCTGA
 
Protein sequence
MKLRDILAGA HRGGDSDSEQ SWLRPSRRAF LKGTAAAGAG LVIGFQVGCG GKAQDPGTSP 
EQPGAGEDEF APNAFLRIAP DDSVTVVSKH IEFGQGTYTG LATILAEELG ADWNQVRVES
APADASRYAN LAFGMQGTGG SNAMANSWQQ LREAGATGRA LLIAAASDTW GVSAADITVE
RGVVAHAGSG RMARFGELVD KAATMPLPAK VVLKDPENFT LIGTDVPRVD VAGKTNGAAQ
FTLDVYLPGM LTALVARPPR FGAKPARVDA SAAEAMPGVV QVVEIASGIA VVAKNFWAAK
KGRDALAIEW NEDAAETRSS DEMQEALRQM LEQDGIVAKQ EGDMAAALAS AARVVEAEFE
FPYLAHAPME TMDCVAKFED GRCEMWFGSQ IQTTDQMGAA QVLGIQPQNV IIHTLLAGGS
FGRRGTFDGA IAVECASLLK ATGSTAPIKL VWTREDDIRG GFYRPIFRHR MRGAIDAQGK
VAGWEHRLAG PSIMLATPAG SQMVQNGVDP TSVEGAAPPD YQLDNLYVDV RNAEFGPNPH
FWRSVGSTHT AFAVEVFIDM LAEAMGQDPV DLRRTLLGDK QRHLAVLDLV VEKSGWGSAM
PRGKARGIAI HESFGSVVAE VAEVSLAEDG MPKVERVVCA VDCGVAINPD NVRAQVEGGL
GYGLGAALYN EITLEGGRVV QSNFDQYRPL RIQDMPTVEV HIVPSGNAPS GIGEPGLPPI
APAVANAYFR LTGKRITSLP FARAITKQRR G