Gene Mvan_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1994 
Symbol 
ID4647913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2128756 
End bp2130003 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content64% 
IMG OID639805480 
Productamidohydrolase 2 
Protein accessionYP_952818 
Protein GI120402989 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.580851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCA CACAGCAGGA AGCGCAGCGG ACCGCGCCTG CGCCCAAGCC CCTCGGCTAC 
CGGGCGATCG ACGTCGACAA TCACTACTAC GAACCGATCG ACTCGTTCAC CCGCTACCTG
CCGAAAGAAT TCAGCCGCCG CGGCGTGCAG ATGTACCAGG ACGGCAAGCG CACCTGGGCG
GTGATGGGCG GGCGGGTCAA CACCTTCATC CCCAATCCCA CCTTCGATCC GATCATCGAA
CCGGGCTGCC TGGACCTGTT GTTCCGCGGC GAGATCCCCG AAGGTGTCGA CCCTGCTTCG
TTGATGAAGG TCGACCGGAT CTCGGACCAT CCCGAGTACC GGAACCGCGA CGCGCGCGTG
AAGATCCTCG ACAAGCAGAA CCTGGAAACC GTGTTCATGT TGCCGACGTT CGCGTGCGGT
GTTGAAGAGG CACTCAAACA CGACGTGGAA GCCACCATGG TGTCGGTGCA CGCCTTCAAC
CTGTGGCTCG ACGAGGACTG GGGATTCAAT CGACCCGATC ATCGGATCCT GGCGGCACCG
ATCATCTCGC TGGCCGACCC CGATAAGGCC GTCGAGGAGG CCGAATTCGT GCTCAGCCGC
GGCGCGAAGG TGGTGTGTGT CCGTCCCGCG CCAGTGCCGG GGGTGGTCAA GCCCCGCTCA
CTCGGCGACC CGGTGCATGA TCCGGTATGG GCACGGCTGG CTGAAGCCGG TGTGCCGGTG
GTCTTCCACC TGTCCGACAG CGGCTACATG GCGATCCCGG CGCTGTGGGG TGGGAAGGAC
ACCTTCGAGG GCTTCGGCAA GCGCGATCCG CTGGACATGG TGGTCATGGA CGACCGCGCC
ATCCACGATT CGATCGCGTC GATGATCGTG CACCAGGTCT TCACCCGGCA CCCCACGCTC
AAGGTGGCCA GCATCGAGAA CGGTTCGTAC TTCGTGTACC GGCTGATCAA GCGGCTCAAG
AAGTCCGCCA ACAACGCGCC GTACCACTAC AAGGAGGACC CGGTCGAGCA GCTGCGCAAC
AACGTCTGGA TCGCGCCGTA CTACGAGGAC GACGTGAAAC TGCTCGCCGA CACCATCGGG
GTGGACAAGA TCCTGTTCGG CTCGGACTGG CCGCACGGTG AAGGGCTGGC CGACCCCACG
TCGTTCACCG CCGACATCCC GCAGTTCCCC GAGTTCAGCC ACGAGGACAC CCGCAAGGTG
ATGCGCGACA ACGCTCTTAC CCTCGTCGGC ACACACGTGT CGGCCTGA
 
Protein sequence
MTATQQEAQR TAPAPKPLGY RAIDVDNHYY EPIDSFTRYL PKEFSRRGVQ MYQDGKRTWA 
VMGGRVNTFI PNPTFDPIIE PGCLDLLFRG EIPEGVDPAS LMKVDRISDH PEYRNRDARV
KILDKQNLET VFMLPTFACG VEEALKHDVE ATMVSVHAFN LWLDEDWGFN RPDHRILAAP
IISLADPDKA VEEAEFVLSR GAKVVCVRPA PVPGVVKPRS LGDPVHDPVW ARLAEAGVPV
VFHLSDSGYM AIPALWGGKD TFEGFGKRDP LDMVVMDDRA IHDSIASMIV HQVFTRHPTL
KVASIENGSY FVYRLIKRLK KSANNAPYHY KEDPVEQLRN NVWIAPYYED DVKLLADTIG
VDKILFGSDW PHGEGLADPT SFTADIPQFP EFSHEDTRKV MRDNALTLVG THVSA