Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3147 |
Symbol | |
ID | 4646379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 3343538 |
End bp | 3344962 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639806624 |
Product | amidohydrolase |
Protein accession | YP_953955 |
Protein GI | 120404126 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0171456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0352292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGAAT TCCTGGTCCG CGGCCGCTAT GTGCTGTCCA TGGCCGGCCT GCCCGCACCG GCCGAGCGCG TCCGCACCCC CGGCGCGCAT CTGGACGGCG TACTCACCGA CGCGGCCGTG CATGTGCGCG ACGGCGCGAT CGTCGCGGTC GACGGCTACG CCGCACTGGT CGCCGCGCAG GCGCACCTGC CGGTGCACGG CGACGGCACA GGTCTGGTGA TCCCGGGCCT GATCTCCACC CACACCCACC TGTCCGAGTC GCTGGCCACC GGCATGGGTT CGGAGCTGTC CCTGTTCGAG TGGGCCGACG CGATCGTCGC GCCGCTGGGC ATGGTGCTGA CCCGCGAGGA CGCCGCCGAG GGCACCGCGC TGCGCGCGAT CGAGATGCTG CTGTCGGGGG TGACCACCGT CAACGACATG TTCTGCCACA CCAACATCGG CTCGCGGGCC AGCCTCGGTG TGGTCGACGG CCTCACCCGC GCCGGGATGC GCGGCGTCGT CGCCTACGGC GCCGAGGATC TGCCGCTGCT CGAGCGCAGC ACCCTCGCGC CGGGCGACGT CATCGACGAC GTGCTCGCCG AACAGCACGA CCTGGCCGCG CACGCGGCGA CCGCGCCGCT GCTGGACTTC CGCTACGGCG TCGGCACGCT GCTCGGGCAG AGCGACGAGC TGCTCGCCGC CGGGGTCGAG GAGTGCCGCC GCGCCGGCTG GGGTGTGCAC ACCCATCTGG CCGAGGTGCG CGAGGAGGTC ACCACCGCAC GGCACCGGTG GGGGCACCGC ACGGTGGAGC ATTCCTTGCG GGCCGGGCTG TTCGAGCGCC CGCTCATCGC CGGTCACGGC GTGTGGCTCA CCGAGGCCGA CATCGCGACG TTCGCCCGGC ACGGCGCCGC GATCGCCCAC AATCCGGTCG CCAACATGAT CCTGGCCTCC GGGGTGTGCC CGGTGCCGCG GCTGCGCGCG GCCGGGGTGC CCGTCGGCAT CGGCACCGAC GGCGCGGCCT CCAACGACAG CCAGGACATG CTGCAGGCGG TCAAGGCGGC GGCGCTGCTG CAGAAGGTGC ACCACCTCGA CGCGCTGGTG GTCGACGCGC TCGACGTGCT GACGATGGCG ACCATCGACG GCGCGCGGGC GCTGGGCCTG GACCACCTGG TCGGATCGCT GGAGCCCGGC AAACGCGCCG ACATCGTGCT GCTGCAGGAC ACCGTCGACG TCGCGGTGCT GCACGATCCG GTGGCCCAGC TGGTGTACGG CGCGTCGCCG CGGTCGGTGC GCGACGTGTG GGTGGACGGC GTGCAGGTGG TGGCCGATCA CCGGTGCACG ACCGTCGACG AGGCCACCCA GATCGCCCGC TGCCGCCCGC TGGCCGACCG GGTCGGGGTG AAGGCGGGCC TGGTCGCCAC CGGCCACTCC GTGGTGACCG GGTGA
|
Protein sequence | MTEFLVRGRY VLSMAGLPAP AERVRTPGAH LDGVLTDAAV HVRDGAIVAV DGYAALVAAQ AHLPVHGDGT GLVIPGLIST HTHLSESLAT GMGSELSLFE WADAIVAPLG MVLTREDAAE GTALRAIEML LSGVTTVNDM FCHTNIGSRA SLGVVDGLTR AGMRGVVAYG AEDLPLLERS TLAPGDVIDD VLAEQHDLAA HAATAPLLDF RYGVGTLLGQ SDELLAAGVE ECRRAGWGVH THLAEVREEV TTARHRWGHR TVEHSLRAGL FERPLIAGHG VWLTEADIAT FARHGAAIAH NPVANMILAS GVCPVPRLRA AGVPVGIGTD GAASNDSQDM LQAVKAAALL QKVHHLDALV VDALDVLTMA TIDGARALGL DHLVGSLEPG KRADIVLLQD TVDVAVLHDP VAQLVYGASP RSVRDVWVDG VQVVADHRCT TVDEATQIAR CRPLADRVGV KAGLVATGHS VVTG
|
| |