Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_0491 |
Symbol | |
ID | 4109337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 546301 |
End bp | 548691 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638029617 |
Product | carbon monoxide dehydrogenase, large subunit apoprotein |
Protein accession | YP_637668 |
Protein GI | 108797471 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.391536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCGG TGGACGAACG CCCGGACGCG CAGGCGGACA ACGACAGGAA ACCGTGCGGT TACGGCCGGA TGCTGCGCAA GGAGGATCCG CGCTTCGTCC GCGGCCGGGG CAACTACGTC GACGACGTCA CGCTGCCCGG CATGCTGCAC CTGGCGATCC TGCGCTCACC GTATGCCCAT GCGCGGATCG CGAACATCGA CATCTCGGCC GCGGAGGCGC ATCCGAAGGT CAAGGCGGTC GTCACCGGCG CCGACCTGGC GGAGAAGGGC CTGGCCTGGA TGCCGACCCT GTCCAACGAC GTGCAGGCCG TCCTCGCCAC CGACAAGGTG CGATTCCAGG GGCAGGAGGT GGCATTCGTC GTCGCCGAGG ACCGGTATTC GGCTCGTGAC GGTGTCGAGT TGATCGACGT CGACTATGAG CCTCTCGACC CGGTCATCGA CGTCCGCCGG GCACTGGATC CCGACGCGGC GGTGATCCGC ACCGACCTCG ACGGTAAGAC CGACAACCAC TGCTTCGACT GGGAGACCGG CGACGCCGCC GCCACCGACG CGGCGTTCGC CAAGGCCGAC GTCGTGGTCA AACAGGAGAT GGTCTATCCG CGTGTGCATC CCGCGCCGAT GGAGACCTGC GGCGCGGTCG CCGACCTCGA CCCGGTCAGC GGCAAGCTCA CCCTGTGGTC GACCACGCAG GCGCCGCACG CACACCGCAC CCTCTACGCC CTGGTGGCCG GGCTGCCCGA ACACAAGATC CGGGTCATCT CACCGGACAT CGGCGGCGGC TTCGGCAACA AGGTGCCGAT CTATCCGGGA TACGTCTGCG CCATCGTGGG TTCGCTGCTG CTGGGTAAGC CGGTCAAGTG GATGGAGGAC CGCAGCGAGA ACCTGACCTC GACCGGATTC GCCCGGGACT ACATCATGGT CGGCGAGATC GCCGCCACGT CCGACGGCAA GATCCTGGCC GTCCGCTCCA ACGTGCTGGC CGACCACGGT GCATTCAACG GCGTTGCGGC GCCGACGAAG TACCCCGCGG GCTTCTTCGG GGTGTTCACC GGCAGCTACG ACCTCGAGGC CGCGTACTGC CACATGACCG CGGTCTACAC CAACAAGGCG CCCGGCGGCG TGGCCTACGC GTGTTCGTTC CGGATCACCG AGGCGGTGTA CTTCGTCGAG CGGTTGGTGG ACTGTCTGGC CTACGAGATG AAGATGGACC CGGCCGACCT GCGGCTGCGG AACCTGCTGA AACCCGACCA GTTCCCGTAC CACTCGAAAA CCGGCTGGAC CTACGACTCC GGCGACTACG AGACCACCAT GCGCAAGGCC ATGGACATGA TCGGCTACGA GCAGTTGCGC GCCGAGCAGG CCGCCAAGCG GGAGCGCGGT GAGCTGATGG GGATCGGCAT GGCGTTCTTC ACCGAGGCCG TCGGCGCCGG ACCGCGCAAG GACATGGACA TTCTGGGACT CGGCATGGCC GACGGCTGTG AACTGCGGGT GCATCCCACC GGCAAGGCCG TGGTCCGGTT GTCGGTGCAG ACCCAGGGGC AGGGCCACGA AACCACCTTC GCGCAGATCG TCGCCGAGGA GTTGGGGATC CCACCCGACG ACATCGACGT GGTGCACGGC GACACCGACC AGACGCCGTT CGGGCTGGGC ACCTACGGCA GCCGTTCGAC CCCCGTATCC GGTGCCGCGG CCGCGTTGGT CGCGCGCAAG GTGCGGGACA AGGCCAAGAT CATCGCCTCG GGGATGTTGG AGGCGTCAGT CGCCGACCTC GAATGGGACA AGGGCGCCTT CCACGTCAAG GGTGACCCGG CGGCGTCGGT CACCATCCAG GACATCGCGA TGCGCGCGCA CGGCGCCGGT GACCTCCCGG AGGGCATCGA GGGCGGCCTG GACGCCCAGA TCTGTTACAA CCCCGAGAAT CTCACCTATC CGTACGGCGC CTACTTCTGC GTGGTCGACG TCGATCCGGG CACCGCGGTG GTCAAGGTCC GGCGGTTCCT CGCGGTCGAC GACTGTGGCA CGCGGATCAA CCCGATGATC ATCGAGGGTC AGGTGCACGG CGGCATCGTC GACGGGATCG GGATGGCGCT GATGGAGATG ATCGCGTTCG ACGAGGAAGG CAACTGCCTC GGCGGGTCGC TGATGGACTA TCTGATCCCC ACCGCGCTCG AGGTGCCGGA GCTGGAGACC GGTCATACCG TCACGCCGTC CCCGCACCAC CCGATCGGCG CCAAGGGCAT CGGGGAGTCC GCCACCGTCG GCTCACCCCC CGCGGTGGTG AACGCCGTCG TCGATGCGTT GGCGCCGTTC GGAATCCGGC ACGCCGACTT GCCGCTGACC CCCTCCCGGG TGTGGGAGGC CATGCAGGGC CGGCCCACCC CGCCGATCTG A
|
Protein sequence | MTAVDERPDA QADNDRKPCG YGRMLRKEDP RFVRGRGNYV DDVTLPGMLH LAILRSPYAH ARIANIDISA AEAHPKVKAV VTGADLAEKG LAWMPTLSND VQAVLATDKV RFQGQEVAFV VAEDRYSARD GVELIDVDYE PLDPVIDVRR ALDPDAAVIR TDLDGKTDNH CFDWETGDAA ATDAAFAKAD VVVKQEMVYP RVHPAPMETC GAVADLDPVS GKLTLWSTTQ APHAHRTLYA LVAGLPEHKI RVISPDIGGG FGNKVPIYPG YVCAIVGSLL LGKPVKWMED RSENLTSTGF ARDYIMVGEI AATSDGKILA VRSNVLADHG AFNGVAAPTK YPAGFFGVFT GSYDLEAAYC HMTAVYTNKA PGGVAYACSF RITEAVYFVE RLVDCLAYEM KMDPADLRLR NLLKPDQFPY HSKTGWTYDS GDYETTMRKA MDMIGYEQLR AEQAAKRERG ELMGIGMAFF TEAVGAGPRK DMDILGLGMA DGCELRVHPT GKAVVRLSVQ TQGQGHETTF AQIVAEELGI PPDDIDVVHG DTDQTPFGLG TYGSRSTPVS GAAAALVARK VRDKAKIIAS GMLEASVADL EWDKGAFHVK GDPAASVTIQ DIAMRAHGAG DLPEGIEGGL DAQICYNPEN LTYPYGAYFC VVDVDPGTAV VKVRRFLAVD DCGTRINPMI IEGQVHGGIV DGIGMALMEM IAFDEEGNCL GGSLMDYLIP TALEVPELET GHTVTPSPHH PIGAKGIGES ATVGSPPAVV NAVVDALAPF GIRHADLPLT PSRVWEAMQG RPTPPI
|
| |