Gene Mvan_5183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5183 
Symbol 
ID4645700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5547962 
End bp5550334 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content68% 
IMG OID639808658 
Productcarbon-monoxide dehydrogenase (acceptor) 
Protein accessionYP_955960 
Protein GI120406131 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.395354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGACA CCGCCACGAA CGCCGTCACC GCACGCTACG CCGGACAACG CGTGCCGCGT 
GTCGAAGACA GCCGGCTTCT GACTGGTCAC GGCCGTTTCG TCGATGACAT CAGCCGCCCG
GGGATGTTGC ACGCCTGCTT CGTCCGCAGC CCGTTCGCCC ACGCCACGAT CAACGGCGTC
GACGCGTCCG CGGCGTTGGC GCTACCCGGC GTGCACGCCG TCTTCACCGC CGCCGATCTG
AACCCCGACG TGAAGGAGGC CTGGCACGCC GTCGCGGGCA AGGACGTCCC GGACACCCCG
CGGCCACCGC TGGCCGAGGG CGAGGTGAAG TTCGTCGGCG ACCCGGTCGC GCTCGTCGTC
GCCGACAGCC GCTACCTCGC CGAGGACGCC GTCGACCTGG TCGAAGTCGA CTACGATCCG
CTGCCCGCGG TGGCCGACTT CCGCAAGGCT GTCGGCGGCG CGGAAGCCGG GGTTCCCGTC
GTGCACGCGG CCTACCCCGA CAACGTCGCC GGCGGGATGG GCGGCATGCC ACCGGACGAG
GAGATTTTCG CCACGGCGGC TCACGTTGTC GAAGAGCACA TCTACCAGCA GATGTACGTG
CCGGTGCCGA TGGAGACCCG CGGCATGGTG GTGGAGTGGA CGTCGACGAC CAACGAGTTG
ACGGTCTGGG CGTCGACGCA GACGCCGCAC GAGCTGCGCG CGTTCGCCGC CCGACTGCTC
GGCATCCCGG CCCAAGGGGT GCGGGTCATC ATGCGCGACA CCGGCGGCGC GTTCGGCCAG
AAGGTGGTGC CGATGCGCGA GGACATGTGC ATCCTGCTGG CCGCCCGCAA GATGCCCACC
GCGACGTCGA ACGCGCAATC GGGCGTGGCA GTGAAGTGGA TCGAGGACCG TCGGGAGAAC
CTGATGTCGG CCGGGCAGTC CCGCCATGTC GACGGCAAGG TGCGGATGGC GTTCGACTCC
GACGGCAAGA TCCTGGCCGC CGACATCGAC TTCGTCCAGG ACGTCGGCTC TTACCCGACC
CCGTACCCGG TGCTGACCAC CGCGGCCATC GGCATGTTCT TCCCCGGGCC CTACCGGGTG
CCCAAGGCCA GCTTCAATTA CAAGACGGTG TTCTCCAACA CCCCTGGCCT GCACGCCTAC
CGCGGGCCCT GGCAGTACGA AACCCTCACC AGGGAAATGC TTCTCGACAG TGCCGCACGC
AAGATCGGGA TGGACCCGGT CGAGCTGCGA CGCATCAACA TTCTGCGCGG CGACGAGATG
CCGTTCTTCA ACCCCAACGG CATGCCCTAC GACAACTGCG CCCCCGCCGA CACGTTCGAG
CAGGCGGTGA AGATCCTCGA CCACGAGGGC TTCCGCAAGG AGCAGGCCGA CGCGCTTGCC
GAGGGACGCT ACATCGGGCT CGGCTTCTCG GCGTACATCG AGCCAACCGG TGCGGCGACC
GGCAACCTCG CGACCGAGGG TGCCACCATC CGGATGGAGT CCACCGGCAA GATCAACGTG
TATGTCAACG GCGGTTCGGC GGGCAACAGC ATCGAGACCA CCGTCGTGCA GCTCACCGCG
GATGCGTTGG GCGCCAACAT CGAAGACGTC GCCACCATCC AGGGCGACAC CGCGGTGACG
CCCTACGGCG CGGGCACCCA GGGCAGCCGC AGCGCGCCGA TGACGGCGGG CGCGGTGAAC
GAGGCCGGCG CGATTCTGCG CAGGCAGATC ATCGCGATCG GGGCGCAGAT CCTCGGGGTG
GAGGAGTCCG AGATCGAACT GGCGAATTCC AGGGCCGGCG TGCGCAACGA CCCCGAGAGG
AGCGTCAGCT TCGCCGACAT CGCCTACCGC TCGTACTACG ACCCGGCTCA GCTCGGCGGC
GTCTCCCCCA CCCTGGAAGC CACGGCGCGC TTCAACTCAC AGGCAATGAT CCACTGGGCC
AACGCGACTC ACGTCTGCAC CTGTGAGGTC GACGTCGAGA CCGGTCAGGT GACGCTGACC
CGGTACATCG TCAGCGAGGA CGTCGGCCCG ATGATCAACC CGAACATCGT CGAAGGCCAG
GTCGCCGGCG GCACCGTCCA GGGCATCGGC GGCGCGCTGC TGGAGAAGCT CGCCTACGAC
GACGCGGGCA ACCCGGTCGC CTCGACGTTC GTCGACTACC TGCTGCCGAC GGCCACCGAG
GTGCCGCCGA TCGAGTTCGG GCACGTCGAG ATCCCCGGAC CCGGAGTCGG CGGCTACAAA
GGCGCAGGCG AAGGCGGCGC GATCGGCTCG CCACCGGCCG TCATCAACGC GATCAACGAC
GCGCTGGCAC CGCTGGGCGT CACACTGACT CAACTACCCG CCACGCCGGC CACGATCGTC
GAGCTCATCG AGCGCGCAGG AAAGGACCAC TGA
 
Protein sequence
MTDTATNAVT ARYAGQRVPR VEDSRLLTGH GRFVDDISRP GMLHACFVRS PFAHATINGV 
DASAALALPG VHAVFTAADL NPDVKEAWHA VAGKDVPDTP RPPLAEGEVK FVGDPVALVV
ADSRYLAEDA VDLVEVDYDP LPAVADFRKA VGGAEAGVPV VHAAYPDNVA GGMGGMPPDE
EIFATAAHVV EEHIYQQMYV PVPMETRGMV VEWTSTTNEL TVWASTQTPH ELRAFAARLL
GIPAQGVRVI MRDTGGAFGQ KVVPMREDMC ILLAARKMPT ATSNAQSGVA VKWIEDRREN
LMSAGQSRHV DGKVRMAFDS DGKILAADID FVQDVGSYPT PYPVLTTAAI GMFFPGPYRV
PKASFNYKTV FSNTPGLHAY RGPWQYETLT REMLLDSAAR KIGMDPVELR RINILRGDEM
PFFNPNGMPY DNCAPADTFE QAVKILDHEG FRKEQADALA EGRYIGLGFS AYIEPTGAAT
GNLATEGATI RMESTGKINV YVNGGSAGNS IETTVVQLTA DALGANIEDV ATIQGDTAVT
PYGAGTQGSR SAPMTAGAVN EAGAILRRQI IAIGAQILGV EESEIELANS RAGVRNDPER
SVSFADIAYR SYYDPAQLGG VSPTLEATAR FNSQAMIHWA NATHVCTCEV DVETGQVTLT
RYIVSEDVGP MINPNIVEGQ VAGGTVQGIG GALLEKLAYD DAGNPVASTF VDYLLPTATE
VPPIEFGHVE IPGPGVGGYK GAGEGGAIGS PPAVINAIND ALAPLGVTLT QLPATPATIV
ELIERAGKDH