Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4442 |
Symbol | |
ID | 4649058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 4771528 |
End bp | 4773690 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639807913 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_955224 |
Protein GI | 120405395 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.510451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGGA CCGTGCACAC GTTCTGCCGG TACTGCCTGG CCGCGTGTGG TGTCGAGGTG ACGGTCGAGG GCAACCGCGT GGTGAAGATC TCGGCCGACA AGCAGAACCC GCACAGCTGG CAGGACTTCT GCGCGAAGGG TCGCACCGCG CACCAGGTGG TCGAGCATCC CCGCCGGATC GTCGCCCCGA TGAAGCGGGT GGGCGACCGG TATGTCGAGA CGAGTTGGGA CGAGGCGATC GACGACATCG CCGCGCGCCT GAACGCCGCC ATCGATGCCG ACGGACCGGA TGCCGTCGGC GCGTATTACG GCAACCCGGC AGGCTACTCG TCGTCGAACC TGATGTTCAT GAACGGCTGG CTCGACGCCG TCGGGACCTT CAACCGGTAC GCGGTGGGGT CGGTCGACCA GAACGCCATG CACGTCGTCG CGGAGAAAAT GTACGGGTCC GCGTTGCTGG CCCCGGTCTC CGATGTGGAC AACTGTGACT TCTTCCTGTT GATAGGTGCG AATCCGGCTG TCAGCGCCTG GAACTGGCTG GAATCGGTGC CCGGCGGGTG GCGCCGGGCC CTGGCCCGAC AGCAGCAGGG CGCCACGCTC GTCGTCGTCG ACCCGTTGCG CACCGAGTCC GCCGGGAAGG CCGATGTGCA CGTGGCGGTG CGGCCGGGAC AGGACTGGGC GCTGCTGCTC GCCATGGTCA AGGTCATCCT GGACGGGGGT CTGGAACATC GCGGCGACTG CAGCGACCTG GCAGTCGGCG TCGACAGGTT GCGCACGCTG GTCGCCGAGG CCGACCTCGA CGATCTCGCC ATGCGCTGCG ATGTCGCGCG CACCCAGATC GTGGACGTGG CAACGGCGTT CGCGACATCG CCACGGGCGA TGGCGGTGAC GCGGACCGGG GTTTCGCTAC ATCTCGCCGG TACGGTCGCC GAATGGCTCG GCCATGTCCT CAACGTCATC ACCGGCCGGA TGGACCGTCC CGGCGGACGA CGGTTCGAGC CGGGTTACGT CGACGCGCTA CGACTCGCCG AACTGGCCGG TACGAAACCC CACACCAGCC GGCTGCGCGG ACGGCCGCTG GTGGCAGGCG CGCACGCGTT GAGCGAGCTG CCCGACGAGA TCACCACCCC CGGCCGCGGG CAGATCCGCG CGCTGATCGT CAACAGCGGT AATCCGGTGA TCTCCGGACC CGACGGCGCC AAGCTGGATC ACGCGCTGTC GCAACTGGAT CTGCTGGTGG CGATCGATTT CGTGCAGCGC GAGAGCCACC GCCACGCACA CTGGCTGCTG CCCGCCGTGC ACTGGTTGGA GCGTGACGAC CTGCTCGCGT TCACCAGCAG TCTTCACGAC GAGCCGTACC TGCACTACGG TGTCCGCGCG GTCGACCCGC CCGCACAGGC CAGGCAGGAG TGGCGCATCT TCGTCGACCT CGCGTTGGCG ATGAACCGGC CGCTGTTCGG GGTGCCCGGG CTCAACCGGT TTGTCCGCGC CAGCAGGCGG CTGGCGCGCC TGACGCGCAG GCCCGGCCTC GAGTTCGGGC CACGGTGGGT CGACCGGTTG ATTGTGGCTA CAGGCCGAAA GGTCAACGGA CGCAGGATCA GATGGCGTGA CGTCCTGGGG CATCCGCACG GCTGGGTGCT CGGGCCGCGG GAGTTCGGGC ACTTCCGCGA GGCGTTGCGC ACTCCCGACA AGTTGGTGCA CGTCGCGCCG CCCGAGTTCC TGGCACGCGC GCGCGAGTTG CTCGCCGAAC CGGTGCCGCA GCCACCGTCG GGTTACCCGT TCCAGCTCGC CAACCGCCGG AACCGGCACT CGATGAACTC CTGGCTCAAC GACCTGCCCG GGCTGCATCC GTCGGGCAAG GGCAGCGAGG TCGTCATCCA TCCCGACGAC GCGGCGCGAC TCGGCATCCG GGACGGCGAC CGAGTCAGGG TCTCCTCCCC CGTCGGCGCG GTCGAACTCG GCGCCTCGGT CAGCGACCAG CCGCGCCCCG GTGTGGTGGT CATCGACCAC GGGTGGGGAT CAAGGGTTTT CGACCCGCGC GGCGGAGCGG ACCCGCAGGG CTACGGTGTC AACCGCAACC TGCTCGTCGA CGGCGACCCG GTGGACCCGC TGTCCCAGAC CGCGGCGCTG AACTCGAGCT ACGTGGCGGT CACCCGGGCC TAG
|
Protein sequence | MSRTVHTFCR YCLAACGVEV TVEGNRVVKI SADKQNPHSW QDFCAKGRTA HQVVEHPRRI VAPMKRVGDR YVETSWDEAI DDIAARLNAA IDADGPDAVG AYYGNPAGYS SSNLMFMNGW LDAVGTFNRY AVGSVDQNAM HVVAEKMYGS ALLAPVSDVD NCDFFLLIGA NPAVSAWNWL ESVPGGWRRA LARQQQGATL VVVDPLRTES AGKADVHVAV RPGQDWALLL AMVKVILDGG LEHRGDCSDL AVGVDRLRTL VAEADLDDLA MRCDVARTQI VDVATAFATS PRAMAVTRTG VSLHLAGTVA EWLGHVLNVI TGRMDRPGGR RFEPGYVDAL RLAELAGTKP HTSRLRGRPL VAGAHALSEL PDEITTPGRG QIRALIVNSG NPVISGPDGA KLDHALSQLD LLVAIDFVQR ESHRHAHWLL PAVHWLERDD LLAFTSSLHD EPYLHYGVRA VDPPAQARQE WRIFVDLALA MNRPLFGVPG LNRFVRASRR LARLTRRPGL EFGPRWVDRL IVATGRKVNG RRIRWRDVLG HPHGWVLGPR EFGHFREALR TPDKLVHVAP PEFLARAREL LAEPVPQPPS GYPFQLANRR NRHSMNSWLN DLPGLHPSGK GSEVVIHPDD AARLGIRDGD RVRVSSPVGA VELGASVSDQ PRPGVVVIDH GWGSRVFDPR GGADPQGYGV NRNLLVDGDP VDPLSQTAAL NSSYVAVTRA
|
| |