Gene Mvan_5281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5281 
Symbol 
ID4644671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5655881 
End bp5658580 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content70% 
IMG OID639808756 
Productaldehyde oxidase and xanthine dehydrogenase, molybdopterin binding 
Protein accessionYP_956058 
Protein GI120406229 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.434999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATCA CCGTCAACGG CACCGACCGC ACTGACGACC CGCGCCCCGG GCAGTGCCTG 
CGGACGTTCC TGCGCGACCT CGGGCACGTC GAGGTCAAGA AGGGCTGTGA CGCCGGCGAC
TGCGGTGCGT GCTCGGTGCT GGTGGATGGT GCCGCCGTGC ACTCCTGTGT CTACCCGGCT
TTCCGGGCCG ACGGGCGGAC GGTGACGACC GTCGCCGGCC TCGGCACGCC GAAGGATCTG
CACCCGATGC AGCGGCGTTT CGTCGAGGCC GCCGGCTTCC AGTGCGGCTT CTGCACCGCC
GGCATGGTCA CCACCGCCTC GGCGCTGACC GCCGATCAGC TCGAGCACCT GCCCCAGCAC
CTCAAGGGCA ACCTGTGCCG CTGCACCGGC TACCGCGCCA TCACCGATGC CATCGCGGGG
GTGGTGAACA CCGAGAAGTC GGGCGGCAGC GACGTGGGTC GGTCGATCGG CGCACCGGCC
GGGGTCCGGG TGGTGACCGG CACAGAGGAA TACACCATGG ATGACACGCC GGTCGGCCTG
CTGCACATGG CCGTGCTGGC CAGCCCCGTA CCGCACGCGC GGATCCGGTC CATCGACACC
GCTGCGGCCG AATCGATTCC CGGTGTACGC CTGGTCCTGA CCCACCGGGA CAGTCCGGCT
GTACGGTTCT CCACCGCCCG CCACGAGTCG CGCGACGACG ACCCCGATGA CACCGTCATC
CTCGACGACA CCGTGCGATT CGTCGGGCAG CGCGTCGCGG CCGTGGTCGC CGACAGCGTC
GCTGTCGCGG AGGCGGCCTG CCGAGCGATC GTCGTCGACT ACGAACCACG CCCCGCGGTG
TTCGATCCGG ACACCGCGCG CCGGCCCGGC GCCCCGCTGC TGCACGCCGA CAAGGGACCC
GAGTCCCGGA TCGCCGACCC GTCGCGCAAC CTGGTCGCCG AGTTGCACGG CGAGGTCGGC
GATGTCGCCG AGGGAGTCAG GGCGGCCGAA GCCGGCGGCG GCGCCGTGGT ACGGGGGTCT
TGGCGGACCC AGCGGGTCCA GCACGCGCAC CTGGAGACCC ACGGGTGCAC CGGATGGCGA
GACGACGCCG GCCGCCTCGT GATCCGCACC AGCTCACAGG TCCCGTTTCT GGTGCGCGAC
GAGCTGTGCC ACATCTTCGG CCTGGACACC GACGAGGTCC GGGTGTTCAC GCGGCGAGTG
GGCGGCGGGT TCGGCGGTAA GCAGGAGATG CTCGCCGAGG ATCTCGTGGC GCTCGCCGTG
TTGCGGCTCG GTGCGCCGGT GCGGTACGAG TTCAGCCGCA CCGACGAGTT CACCGTCGCC
CCGTGCCGGC ACCCGTTCCG TGTCGAGGTG ACCGTGGCCG CCGGCCGTGA CGGCAGGCTC
ACCGCTCTTG CGGTCGACGC GCTGGTCGAC GCCGGGGCCT ACGGCAACCA CAGCCCCGGC
GTGATGTTCC ACGGCTGCGG CGAATCCGTC GCGGTGTACC GCAGCCCGAA CAAGCGCGTC
GACGCAGAAG CCGTCTACAC CAACAACCTG CCGTCCGGAG CCTTCCGCGG CTACGGGCTG
GGCCAGATCT CGTTCGCCGT CGAGTCCGCG ATCGACGAAC TGGCCGCCCG GCTCGGCATC
GATCCTTTCG AGTTCCGCCG CCGCAATGTC GTCGTCCCCG GCGACACCTT CGTAGATTCC
CACGTCCTCG AGGATGATCT GACGTTCGGC AGCTACGGTC TGGACCAGTG TCTGGACCTC
GCCGAGGCTG CGCTGCGGGC AGGCAACGGA GCGACGGCGC CCGCCGGCTG GGCGGTCGGC
GAAGGTATGG CGGTGGCCAT GATCGCCACG ATCCCGCCGC GCGGCCACTT CACCGAGGTG
TCGGTGTCGG TCGACGCCGA CGGCGTCTAC ACCCTCGACG TCGGAACCGC CGAATTCGGC
AACGGCACCA CGACGGTGCA CGTTCAACTC GCGGCCGCCG AGCTGAACAC CGTGCCCGAA
CGAATCGTCG TCCGGCAATC CGACACCGCC ACAACGGGTT ATGACACCGG CGCGTTCGGG
TCCGCAGGAA CGGTGGTGGC CGGTCTTGCC ATCCTGGCGG CCAGTCGAGA GTTGCGGACC
GCGCTGGTCA CCGCGGCGGC GGAGTTGACC GGCGCCGCAC CGTCGTCGTG TGTGTTGGGC
CGCAACGGAG TCCAGTGCGA TGCCCGGCTG GTGGACTTCG GTTCCCTGCC GACCCCGATG
AATCGCACCG CCCGACATGA CGGTACACCC CGATCGGTCG CGTTCAACGT GCACGCGTTC
CGGGTGGCGG TCGACACCGA GACCGGCGAG GTGCGGATTC TGCAGTCGAT CCAGGCCGCC
GATGCCGGGG TGGTGATCAA CCCGCAGCAG TGCCGGGGCC AGGTGGAAGG TGGCGTGGCG
CAGGCGATCG GGTCGGCGCT GTTCGAAGAG ATCCTGATCG GCCCTGACGG TGCGGTGATG
ACGAAGGCGT TGCGCGACTA CCACATTCCA CAGGTCGCGG ACGTTCCGGC GACCGAGGTC
TACTTCGCCG ACACTCACGA CGACTTGGGA CCGCTGGGGG CGAAGTCGAT GAGCGAATCT
CCGTACAACC CGGTCGCGCC CGCTCTGGCG AATGCAATCG CCAGAGCGTG CGGGGCCCGG
GTCTGTCAGT TGCCGATGAC CCCGGCACGG GTTTGGCGGG CGGTCAGAGC CTCTCGATGA
 
Protein sequence
MRITVNGTDR TDDPRPGQCL RTFLRDLGHV EVKKGCDAGD CGACSVLVDG AAVHSCVYPA 
FRADGRTVTT VAGLGTPKDL HPMQRRFVEA AGFQCGFCTA GMVTTASALT ADQLEHLPQH
LKGNLCRCTG YRAITDAIAG VVNTEKSGGS DVGRSIGAPA GVRVVTGTEE YTMDDTPVGL
LHMAVLASPV PHARIRSIDT AAAESIPGVR LVLTHRDSPA VRFSTARHES RDDDPDDTVI
LDDTVRFVGQ RVAAVVADSV AVAEAACRAI VVDYEPRPAV FDPDTARRPG APLLHADKGP
ESRIADPSRN LVAELHGEVG DVAEGVRAAE AGGGAVVRGS WRTQRVQHAH LETHGCTGWR
DDAGRLVIRT SSQVPFLVRD ELCHIFGLDT DEVRVFTRRV GGGFGGKQEM LAEDLVALAV
LRLGAPVRYE FSRTDEFTVA PCRHPFRVEV TVAAGRDGRL TALAVDALVD AGAYGNHSPG
VMFHGCGESV AVYRSPNKRV DAEAVYTNNL PSGAFRGYGL GQISFAVESA IDELAARLGI
DPFEFRRRNV VVPGDTFVDS HVLEDDLTFG SYGLDQCLDL AEAALRAGNG ATAPAGWAVG
EGMAVAMIAT IPPRGHFTEV SVSVDADGVY TLDVGTAEFG NGTTTVHVQL AAAELNTVPE
RIVVRQSDTA TTGYDTGAFG SAGTVVAGLA ILAASRELRT ALVTAAAELT GAAPSSCVLG
RNGVQCDARL VDFGSLPTPM NRTARHDGTP RSVAFNVHAF RVAVDTETGE VRILQSIQAA
DAGVVINPQQ CRGQVEGGVA QAIGSALFEE ILIGPDGAVM TKALRDYHIP QVADVPATEV
YFADTHDDLG PLGAKSMSES PYNPVAPALA NAIARACGAR VCQLPMTPAR VWRAVRASR