Gene Mvan_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3147 
Symbol 
ID4646379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3343538 
End bp3344962 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content74% 
IMG OID639806624 
Productamidohydrolase 
Protein accessionYP_953955 
Protein GI120404126 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0171456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0352292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGAAT TCCTGGTCCG CGGCCGCTAT GTGCTGTCCA TGGCCGGCCT GCCCGCACCG 
GCCGAGCGCG TCCGCACCCC CGGCGCGCAT CTGGACGGCG TACTCACCGA CGCGGCCGTG
CATGTGCGCG ACGGCGCGAT CGTCGCGGTC GACGGCTACG CCGCACTGGT CGCCGCGCAG
GCGCACCTGC CGGTGCACGG CGACGGCACA GGTCTGGTGA TCCCGGGCCT GATCTCCACC
CACACCCACC TGTCCGAGTC GCTGGCCACC GGCATGGGTT CGGAGCTGTC CCTGTTCGAG
TGGGCCGACG CGATCGTCGC GCCGCTGGGC ATGGTGCTGA CCCGCGAGGA CGCCGCCGAG
GGCACCGCGC TGCGCGCGAT CGAGATGCTG CTGTCGGGGG TGACCACCGT CAACGACATG
TTCTGCCACA CCAACATCGG CTCGCGGGCC AGCCTCGGTG TGGTCGACGG CCTCACCCGC
GCCGGGATGC GCGGCGTCGT CGCCTACGGC GCCGAGGATC TGCCGCTGCT CGAGCGCAGC
ACCCTCGCGC CGGGCGACGT CATCGACGAC GTGCTCGCCG AACAGCACGA CCTGGCCGCG
CACGCGGCGA CCGCGCCGCT GCTGGACTTC CGCTACGGCG TCGGCACGCT GCTCGGGCAG
AGCGACGAGC TGCTCGCCGC CGGGGTCGAG GAGTGCCGCC GCGCCGGCTG GGGTGTGCAC
ACCCATCTGG CCGAGGTGCG CGAGGAGGTC ACCACCGCAC GGCACCGGTG GGGGCACCGC
ACGGTGGAGC ATTCCTTGCG GGCCGGGCTG TTCGAGCGCC CGCTCATCGC CGGTCACGGC
GTGTGGCTCA CCGAGGCCGA CATCGCGACG TTCGCCCGGC ACGGCGCCGC GATCGCCCAC
AATCCGGTCG CCAACATGAT CCTGGCCTCC GGGGTGTGCC CGGTGCCGCG GCTGCGCGCG
GCCGGGGTGC CCGTCGGCAT CGGCACCGAC GGCGCGGCCT CCAACGACAG CCAGGACATG
CTGCAGGCGG TCAAGGCGGC GGCGCTGCTG CAGAAGGTGC ACCACCTCGA CGCGCTGGTG
GTCGACGCGC TCGACGTGCT GACGATGGCG ACCATCGACG GCGCGCGGGC GCTGGGCCTG
GACCACCTGG TCGGATCGCT GGAGCCCGGC AAACGCGCCG ACATCGTGCT GCTGCAGGAC
ACCGTCGACG TCGCGGTGCT GCACGATCCG GTGGCCCAGC TGGTGTACGG CGCGTCGCCG
CGGTCGGTGC GCGACGTGTG GGTGGACGGC GTGCAGGTGG TGGCCGATCA CCGGTGCACG
ACCGTCGACG AGGCCACCCA GATCGCCCGC TGCCGCCCGC TGGCCGACCG GGTCGGGGTG
AAGGCGGGCC TGGTCGCCAC CGGCCACTCC GTGGTGACCG GGTGA
 
Protein sequence
MTEFLVRGRY VLSMAGLPAP AERVRTPGAH LDGVLTDAAV HVRDGAIVAV DGYAALVAAQ 
AHLPVHGDGT GLVIPGLIST HTHLSESLAT GMGSELSLFE WADAIVAPLG MVLTREDAAE
GTALRAIEML LSGVTTVNDM FCHTNIGSRA SLGVVDGLTR AGMRGVVAYG AEDLPLLERS
TLAPGDVIDD VLAEQHDLAA HAATAPLLDF RYGVGTLLGQ SDELLAAGVE ECRRAGWGVH
THLAEVREEV TTARHRWGHR TVEHSLRAGL FERPLIAGHG VWLTEADIAT FARHGAAIAH
NPVANMILAS GVCPVPRLRA AGVPVGIGTD GAASNDSQDM LQAVKAAALL QKVHHLDALV
VDALDVLTMA TIDGARALGL DHLVGSLEPG KRADIVLLQD TVDVAVLHDP VAQLVYGASP
RSVRDVWVDG VQVVADHRCT TVDEATQIAR CRPLADRVGV KAGLVATGHS VVTG