Gene Mvan_3150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3150 
Symbol 
ID4646382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3347518 
End bp3348996 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content76% 
IMG OID639806627 
Productamidohydrolase 
Protein accessionYP_953958 
Protein GI120404129 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0917954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0611897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGC GGCAGGTTCT GCACGGCGGC ACCGTGCTGA CCGGCCCGCA GTGGCGCCCG 
CGGCCCGCCG ACCTTCTCAT CGCCGGTGGC CGCATCGAGG CCGTCGCGGC GCCCGGCAGC
CTCGCCGGCG TCGACGCCGC CACCCACGAC GTGACCGGCC GACTGGTGAT CCCGGGACTG
ATCAACGCGC ACACCCATTC GCACACCGCG CCGGCGCGCG GCGCCGCACG TGCCTGGACG
CTGGAGGACT CGCTGCTCAA CGGCGGCTGG ATGGCCGCGC CCCGCTCCGA GGAACTCACC
GAGCTGGCCG CCCTGCTGAC CGCCACCGAG CTGATCGCCT CCGGGTGCAC CGGCGCCTTC
GATCTCATCG CCCAGGCCGG CGGCCCCGAC CCGGCCGGCT GCCACGCCGC CGCCCGCGGC
TACGCCCGCG CCGGGCTGCG CACCGTGCTG GCGCCGATGG TGGCCGACCG CACGCTGCAC
GAGGCGGTGC CGGCGATCGG GGCATGCTGC GGGGCGCCGT CAACGGGCAA GCCGGGCCCG
TCGACAGCCG ACGTCCTCGC GGCGTGCACG GCCTACGTCC GGAACTTCCC GGCGCTTCAG
GGAGTCATGC CCGCGCTGGC CCCCACGATC CCGGGACACT GCACCCCGGA GCTGACGGTC
GGGTTGGGTC GGCTGGCCGC CGAACACGGG CTGCGGGTGC ACACCCATCT CGCCGAATCC
AAACCGCAGG CACTGGCCGG TGCGTCGCGG TTCGGGCATT CGATCACCCG CGAACTGGCC
CGCCTCGGTG TGCTCGGCGA CCGGCTCACC GTGGCGCACG CCATCTGGGT CGACGACGAG
GACATCAGGA TGCTGGCCGC CTCCGGCGCG GTCGCGGTCA CGGTGCCCGG CAGCAATCTG
CGGCTGGGCT CCGGCATCGC CGACACCCGC GCGATGCTGG CAGCCGGCCT GCGGCTGGCG
GTCGGCACCG ACGGCGCCAA CTCCGCCGAC GCGTTCGACG CGCTCGACGC GGTGCGGCTG
ACCGCGCTGC TGTCGAGGGT CAGTGAGCGG CCGGCCCGCC AGTGGCTGAC CGTCGAGGAG
ACCCTGGACG CCGCCACGGC CGGCGGTGCG GCGGCCTGCG GCTGGACCGA CACCGGTCGG
CTGGCACCCG GCCGGCGCGC CGACTTCGCA CTGCTCGACC TCGGCGCCCG GGCGTTCCGG
CCGCCCACCG ATCTGGCCAA CCAGCTGTTG ACCGCGGCGC GCGCCGCCGA CGTCACCGAT
GTCGTCGTCG GGGGGCGGTT CGTCTACCGC GACCGCGGGT TCCCGCACCT GGACGTCGCG
GCGGCACTGA ACCGGTTCGA CACGCTGGTC GAGGAGTTCC GCGCCCGGGT GGCGCCCGTG
CGTGCCGACG CCGACCGCCA GACCGCCCTG GCCGCGACCG CGCTGGCCGG GCTGCGCCGG
GCGCCGTCTC CGGTGCGACG GCTCATCGGC TGGCGATGA
 
Protein sequence
MTERQVLHGG TVLTGPQWRP RPADLLIAGG RIEAVAAPGS LAGVDAATHD VTGRLVIPGL 
INAHTHSHTA PARGAARAWT LEDSLLNGGW MAAPRSEELT ELAALLTATE LIASGCTGAF
DLIAQAGGPD PAGCHAAARG YARAGLRTVL APMVADRTLH EAVPAIGACC GAPSTGKPGP
STADVLAACT AYVRNFPALQ GVMPALAPTI PGHCTPELTV GLGRLAAEHG LRVHTHLAES
KPQALAGASR FGHSITRELA RLGVLGDRLT VAHAIWVDDE DIRMLAASGA VAVTVPGSNL
RLGSGIADTR AMLAAGLRLA VGTDGANSAD AFDALDAVRL TALLSRVSER PARQWLTVEE
TLDAATAGGA AACGWTDTGR LAPGRRADFA LLDLGARAFR PPTDLANQLL TAARAADVTD
VVVGGRFVYR DRGFPHLDVA AALNRFDTLV EEFRARVAPV RADADRQTAL AATALAGLRR
APSPVRRLIG WR