Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5281 |
Symbol | |
ID | 4644671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 5655881 |
End bp | 5658580 |
Gene Length | 2700 bp |
Protein Length | 899 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639808756 |
Product | aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding |
Protein accession | YP_956058 |
Protein GI | 120406229 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.434999 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATCA CCGTCAACGG CACCGACCGC ACTGACGACC CGCGCCCCGG GCAGTGCCTG CGGACGTTCC TGCGCGACCT CGGGCACGTC GAGGTCAAGA AGGGCTGTGA CGCCGGCGAC TGCGGTGCGT GCTCGGTGCT GGTGGATGGT GCCGCCGTGC ACTCCTGTGT CTACCCGGCT TTCCGGGCCG ACGGGCGGAC GGTGACGACC GTCGCCGGCC TCGGCACGCC GAAGGATCTG CACCCGATGC AGCGGCGTTT CGTCGAGGCC GCCGGCTTCC AGTGCGGCTT CTGCACCGCC GGCATGGTCA CCACCGCCTC GGCGCTGACC GCCGATCAGC TCGAGCACCT GCCCCAGCAC CTCAAGGGCA ACCTGTGCCG CTGCACCGGC TACCGCGCCA TCACCGATGC CATCGCGGGG GTGGTGAACA CCGAGAAGTC GGGCGGCAGC GACGTGGGTC GGTCGATCGG CGCACCGGCC GGGGTCCGGG TGGTGACCGG CACAGAGGAA TACACCATGG ATGACACGCC GGTCGGCCTG CTGCACATGG CCGTGCTGGC CAGCCCCGTA CCGCACGCGC GGATCCGGTC CATCGACACC GCTGCGGCCG AATCGATTCC CGGTGTACGC CTGGTCCTGA CCCACCGGGA CAGTCCGGCT GTACGGTTCT CCACCGCCCG CCACGAGTCG CGCGACGACG ACCCCGATGA CACCGTCATC CTCGACGACA CCGTGCGATT CGTCGGGCAG CGCGTCGCGG CCGTGGTCGC CGACAGCGTC GCTGTCGCGG AGGCGGCCTG CCGAGCGATC GTCGTCGACT ACGAACCACG CCCCGCGGTG TTCGATCCGG ACACCGCGCG CCGGCCCGGC GCCCCGCTGC TGCACGCCGA CAAGGGACCC GAGTCCCGGA TCGCCGACCC GTCGCGCAAC CTGGTCGCCG AGTTGCACGG CGAGGTCGGC GATGTCGCCG AGGGAGTCAG GGCGGCCGAA GCCGGCGGCG GCGCCGTGGT ACGGGGGTCT TGGCGGACCC AGCGGGTCCA GCACGCGCAC CTGGAGACCC ACGGGTGCAC CGGATGGCGA GACGACGCCG GCCGCCTCGT GATCCGCACC AGCTCACAGG TCCCGTTTCT GGTGCGCGAC GAGCTGTGCC ACATCTTCGG CCTGGACACC GACGAGGTCC GGGTGTTCAC GCGGCGAGTG GGCGGCGGGT TCGGCGGTAA GCAGGAGATG CTCGCCGAGG ATCTCGTGGC GCTCGCCGTG TTGCGGCTCG GTGCGCCGGT GCGGTACGAG TTCAGCCGCA CCGACGAGTT CACCGTCGCC CCGTGCCGGC ACCCGTTCCG TGTCGAGGTG ACCGTGGCCG CCGGCCGTGA CGGCAGGCTC ACCGCTCTTG CGGTCGACGC GCTGGTCGAC GCCGGGGCCT ACGGCAACCA CAGCCCCGGC GTGATGTTCC ACGGCTGCGG CGAATCCGTC GCGGTGTACC GCAGCCCGAA CAAGCGCGTC GACGCAGAAG CCGTCTACAC CAACAACCTG CCGTCCGGAG CCTTCCGCGG CTACGGGCTG GGCCAGATCT CGTTCGCCGT CGAGTCCGCG ATCGACGAAC TGGCCGCCCG GCTCGGCATC GATCCTTTCG AGTTCCGCCG CCGCAATGTC GTCGTCCCCG GCGACACCTT CGTAGATTCC CACGTCCTCG AGGATGATCT GACGTTCGGC AGCTACGGTC TGGACCAGTG TCTGGACCTC GCCGAGGCTG CGCTGCGGGC AGGCAACGGA GCGACGGCGC CCGCCGGCTG GGCGGTCGGC GAAGGTATGG CGGTGGCCAT GATCGCCACG ATCCCGCCGC GCGGCCACTT CACCGAGGTG TCGGTGTCGG TCGACGCCGA CGGCGTCTAC ACCCTCGACG TCGGAACCGC CGAATTCGGC AACGGCACCA CGACGGTGCA CGTTCAACTC GCGGCCGCCG AGCTGAACAC CGTGCCCGAA CGAATCGTCG TCCGGCAATC CGACACCGCC ACAACGGGTT ATGACACCGG CGCGTTCGGG TCCGCAGGAA CGGTGGTGGC CGGTCTTGCC ATCCTGGCGG CCAGTCGAGA GTTGCGGACC GCGCTGGTCA CCGCGGCGGC GGAGTTGACC GGCGCCGCAC CGTCGTCGTG TGTGTTGGGC CGCAACGGAG TCCAGTGCGA TGCCCGGCTG GTGGACTTCG GTTCCCTGCC GACCCCGATG AATCGCACCG CCCGACATGA CGGTACACCC CGATCGGTCG CGTTCAACGT GCACGCGTTC CGGGTGGCGG TCGACACCGA GACCGGCGAG GTGCGGATTC TGCAGTCGAT CCAGGCCGCC GATGCCGGGG TGGTGATCAA CCCGCAGCAG TGCCGGGGCC AGGTGGAAGG TGGCGTGGCG CAGGCGATCG GGTCGGCGCT GTTCGAAGAG ATCCTGATCG GCCCTGACGG TGCGGTGATG ACGAAGGCGT TGCGCGACTA CCACATTCCA CAGGTCGCGG ACGTTCCGGC GACCGAGGTC TACTTCGCCG ACACTCACGA CGACTTGGGA CCGCTGGGGG CGAAGTCGAT GAGCGAATCT CCGTACAACC CGGTCGCGCC CGCTCTGGCG AATGCAATCG CCAGAGCGTG CGGGGCCCGG GTCTGTCAGT TGCCGATGAC CCCGGCACGG GTTTGGCGGG CGGTCAGAGC CTCTCGATGA
|
Protein sequence | MRITVNGTDR TDDPRPGQCL RTFLRDLGHV EVKKGCDAGD CGACSVLVDG AAVHSCVYPA FRADGRTVTT VAGLGTPKDL HPMQRRFVEA AGFQCGFCTA GMVTTASALT ADQLEHLPQH LKGNLCRCTG YRAITDAIAG VVNTEKSGGS DVGRSIGAPA GVRVVTGTEE YTMDDTPVGL LHMAVLASPV PHARIRSIDT AAAESIPGVR LVLTHRDSPA VRFSTARHES RDDDPDDTVI LDDTVRFVGQ RVAAVVADSV AVAEAACRAI VVDYEPRPAV FDPDTARRPG APLLHADKGP ESRIADPSRN LVAELHGEVG DVAEGVRAAE AGGGAVVRGS WRTQRVQHAH LETHGCTGWR DDAGRLVIRT SSQVPFLVRD ELCHIFGLDT DEVRVFTRRV GGGFGGKQEM LAEDLVALAV LRLGAPVRYE FSRTDEFTVA PCRHPFRVEV TVAAGRDGRL TALAVDALVD AGAYGNHSPG VMFHGCGESV AVYRSPNKRV DAEAVYTNNL PSGAFRGYGL GQISFAVESA IDELAARLGI DPFEFRRRNV VVPGDTFVDS HVLEDDLTFG SYGLDQCLDL AEAALRAGNG ATAPAGWAVG EGMAVAMIAT IPPRGHFTEV SVSVDADGVY TLDVGTAEFG NGTTTVHVQL AAAELNTVPE RIVVRQSDTA TTGYDTGAFG SAGTVVAGLA ILAASRELRT ALVTAAAELT GAAPSSCVLG RNGVQCDARL VDFGSLPTPM NRTARHDGTP RSVAFNVHAF RVAVDTETGE VRILQSIQAA DAGVVINPQQ CRGQVEGGVA QAIGSALFEE ILIGPDGAVM TKALRDYHIP QVADVPATEV YFADTHDDLG PLGAKSMSES PYNPVAPALA NAIARACGAR VCQLPMTPAR VWRAVRASR
|
| |