Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3951 |
Symbol | |
ID | 8546347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5448893 |
End bp | 5451148 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646388623 |
Product | aldehyde oxidase and xanthine dehydrogenase molybdopterin binding protein |
Protein accession | YP_003268343 |
Protein GI | 262197134 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.840508 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0743558 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTGC GTGACATTCT GGCCGGCGCG CACCGCGGCG GCGACAGTGA CAGCGAACAG AGCTGGCTGC GGCCCAGCCG GCGCGCCTTC CTCAAGGGCA CGGCGGCCGC GGGCGCCGGC CTGGTCATCG GCTTTCAGGT CGGCTGCGGC GGCAAGGCCC AGGATCCCGG CACCTCGCCC GAGCAGCCCG GCGCGGGCGA GGACGAGTTC GCGCCCAACG CGTTCTTGCG CATCGCGCCC GACGACTCGG TCACCGTGGT GTCCAAACAC ATCGAGTTCG GCCAGGGCAC CTACACCGGC CTGGCCACGA TCCTGGCCGA GGAGCTGGGC GCCGACTGGA ACCAGGTGCG CGTCGAGTCG GCTCCGGCCG ACGCCTCGCG CTACGCCAAC CTGGCTTTCG GCATGCAGGG CACCGGCGGC AGCAACGCCA TGGCCAACTC CTGGCAGCAG CTCCGCGAGG CCGGCGCCAC CGGGCGCGCG CTGCTGATCG CGGCCGCCTC CGACACCTGG GGCGTGAGCG CCGCCGACAT CACGGTCGAG CGCGGCGTGG TCGCGCACGC CGGCAGCGGC CGCATGGCGC GCTTCGGCGA GCTGGTGGAC AAGGCCGCCA CCATGCCGCT GCCGGCCAAG GTGGTCCTCA AGGACCCCGA GAACTTCACG CTCATCGGCA CCGACGTGCC GCGCGTGGAC GTCGCCGGCA AGACCAACGG CGCCGCGCAG TTCACGCTCG ACGTGTACCT GCCCGGCATG CTCACGGCCC TGGTGGCGCG GCCGCCGCGC TTCGGCGCCA AGCCGGCCCG GGTCGACGCC AGCGCGGCCG AGGCCATGCC CGGTGTCGTC CAGGTGGTCG AAATCGCCAG CGGCATCGCC GTGGTGGCCA AGAACTTCTG GGCCGCCAAG AAGGGCCGCG ACGCGCTCGC GATCGAGTGG AACGAGGACG CGGCCGAGAC CCGCAGCTCG GACGAGATGC AGGAGGCCCT GCGCCAGATG CTCGAGCAGG ACGGCATCGT CGCCAAGCAG GAAGGCGACA TGGCCGCGGC GCTGGCTTCG GCCGCGCGCG TGGTCGAGGC CGAGTTCGAG TTCCCGTACC TGGCGCACGC GCCCATGGAG ACCATGGACT GCGTGGCCAA GTTCGAGGAC GGCCGCTGCG AGATGTGGTT CGGCTCGCAG ATCCAGACCA CGGATCAGAT GGGCGCGGCC CAGGTGCTCG GCATCCAGCC GCAGAACGTC ATCATCCACA CCCTGCTGGC CGGCGGCAGC TTCGGCCGCC GCGGCACCTT CGACGGCGCC ATCGCGGTCG AGTGCGCGAG CCTGCTCAAG GCCACCGGCT CCACCGCGCC GATCAAGCTG GTGTGGACGC GCGAGGACGA CATCCGCGGC GGCTTCTACC GGCCCATCTT CCGCCACCGC ATGCGCGGCG CCATCGACGC CCAGGGCAAG GTCGCCGGCT GGGAGCATCG CCTCGCCGGC CCGTCGATCA TGCTGGCCAC GCCCGCGGGC TCGCAGATGG TGCAAAACGG TGTCGACCCG ACCTCGGTCG AGGGCGCGGC GCCGCCCGAC TACCAGCTCG ACAATCTCTA CGTCGACGTG CGCAACGCCG AGTTCGGCCC CAACCCGCAC TTCTGGCGCT CGGTCGGCAG CACGCACACG GCCTTTGCCG TCGAGGTCTT CATCGACATG CTGGCCGAGG CCATGGGCCA GGATCCCGTG GACCTGCGCC GCACCCTGCT CGGCGACAAG CAGCGCCACC TGGCGGTTCT CGACCTGGTG GTCGAAAAAT CCGGCTGGGG CTCGGCGATG CCCCGCGGCA AAGCGCGCGG CATCGCCATC CACGAGTCGT TTGGCAGCGT GGTGGCCGAG GTCGCCGAGG TGTCGCTGGC CGAGGACGGC ATGCCCAAGG TCGAGCGCGT GGTCTGCGCC GTGGACTGCG GCGTGGCCAT CAACCCCGAC AACGTCCGCG CGCAGGTCGA GGGCGGCCTG GGCTACGGCC TCGGCGCCGC CCTGTACAAC GAGATCACGC TCGAGGGCGG CCGGGTCGTG CAGAGCAACT TCGACCAGTA CCGGCCGCTG CGCATCCAGG ACATGCCCAC GGTCGAGGTC CACATCGTGC CCTCGGGCAA CGCGCCCTCG GGCATCGGCG AGCCCGGCCT GCCGCCGATC GCGCCGGCCG TGGCCAACGC GTACTTCCGG CTCACCGGCA AGCGCATCAC CAGCCTGCCG TTCGCGCGGG CCATCACCAA GCAACGCCGA GGCTGA
|
Protein sequence | MKLRDILAGA HRGGDSDSEQ SWLRPSRRAF LKGTAAAGAG LVIGFQVGCG GKAQDPGTSP EQPGAGEDEF APNAFLRIAP DDSVTVVSKH IEFGQGTYTG LATILAEELG ADWNQVRVES APADASRYAN LAFGMQGTGG SNAMANSWQQ LREAGATGRA LLIAAASDTW GVSAADITVE RGVVAHAGSG RMARFGELVD KAATMPLPAK VVLKDPENFT LIGTDVPRVD VAGKTNGAAQ FTLDVYLPGM LTALVARPPR FGAKPARVDA SAAEAMPGVV QVVEIASGIA VVAKNFWAAK KGRDALAIEW NEDAAETRSS DEMQEALRQM LEQDGIVAKQ EGDMAAALAS AARVVEAEFE FPYLAHAPME TMDCVAKFED GRCEMWFGSQ IQTTDQMGAA QVLGIQPQNV IIHTLLAGGS FGRRGTFDGA IAVECASLLK ATGSTAPIKL VWTREDDIRG GFYRPIFRHR MRGAIDAQGK VAGWEHRLAG PSIMLATPAG SQMVQNGVDP TSVEGAAPPD YQLDNLYVDV RNAEFGPNPH FWRSVGSTHT AFAVEVFIDM LAEAMGQDPV DLRRTLLGDK QRHLAVLDLV VEKSGWGSAM PRGKARGIAI HESFGSVVAE VAEVSLAEDG MPKVERVVCA VDCGVAINPD NVRAQVEGGL GYGLGAALYN EITLEGGRVV QSNFDQYRPL RIQDMPTVEV HIVPSGNAPS GIGEPGLPPI APAVANAYFR LTGKRITSLP FARAITKQRR G
|
| |