Gene Mvan_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3331 
Symbol 
ID4644528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3545232 
End bp3546542 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content70% 
IMG OID639806808 
Productputative esterase 
Protein accessionYP_954134 
Protein GI120404305 
COG category[R] General function prediction only 
COG ID[COG0627] Predicted esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0826062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.299687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGGT GGTTGCCTGT CGCGGTGCAG ATCCTGGCCG TCGTCGCGGT GGTCACCGCC 
ATCGGTCGGC GGACCCGGCG CTGGCGGCTG GTGTGGGTGC CGTGGTCGGC GCTGATCGGC
GTGGTGCTCG CCGTGGCGGC CTACTGCTAC ATCGCGTCGG CCGGTGTCGC AGACGACTCG
AACCCGGCAC CGCACACACT CTGGGTCTGG ATCGCGTTGT CCGGGTTTGC TTTCGGTGTC
CTGGTGTCAG GCTGGCGCGG TGCCCGGTGG TGGCGCCGGA CCGCCGCGAC GCTGGCGGTA
CCGCTGTGCG TGCTCTCGGC GGCACTGGTG GTGAACCTGT GGGTCGGCTA CTTCCCCACC
GTGCAGACGG CGTGGAACCA GCTCACCGCG GGGCCGCTGC CGGACCAGAC CGATGCGGTC
ACCGTCGCGG CGATGCAGAA GCAGCACGCG ATGCCCGTCA AGGGCAGCCT CGTCGCCGTC
GACATTCCCG ACACCGCGTC CGGATTCCGG CACCGGCAGG AGTGGGTGTA CCTGCCGCCG
GCCTGGTATG CCAGTGACCC GCCACCCGCA CTGCCCACCG TCATGATGAT CGGTGGCGAG
TTCAACACCC CCGCGGACTG GCTGCGCGCG GGCGGTGCGG CAAGGACGCT CGACGCCTTC
GCGGCCGCCC ATGGCGGATA CGCTCCGGTG GTGGTGTTCG TCGATCCCGG CGGGACGTTC
AACAACGACA CCGAGTGCGT CAACGGCACG CGCGGCAACT CCGCCGACCA TCTGGTCAAA
GACGTGGTGC CGTATATGAA GTCGCATTTC GGGGTGAGCT CGGCCGCGGC CAACTGGGGT
GTGGTCGGCT GGTCGATGGG CGGCACGTGC GCCGTCGACC TGACCGTCAT GCACCCGGAG
GTGTTCAGCG CGTTCGTCGA CATCGCCGGT GACGCCGCAC CGAACGCCGG CACCCAGGCT
GAAACCGTGG ACCGCCTGTT CGGCGGCGAC ACGGCCGCCT ACACGTCGTT CGACCCGACC
GCGGTGATGA CCCGGCACGG CCCGTACCGT GGGGTGGCCG GGTGGTTCGA CGTCAACGGC
GCCGTCGCGG TGTCGGCCAC TGCGCAACCC AACGATCAAG CGGTCGCGGC CAGTTCACTG
TGCGCGACAG GTGGCAGGTC GGGCATCGAC TGCGCGGTGG TGAGTCAGCC GGGCAACCAC
GACTGGCCGT TTGCGTCCAC CGCCTTCACC TCGGCCCTGC CGTGGCTCGC GGGCCGGATC
GGCACCCCCG GTGTGCCGCA GATCGATCTG CCCCGCACCG TGTCCGGGTA G
 
Protein sequence
MHGWLPVAVQ ILAVVAVVTA IGRRTRRWRL VWVPWSALIG VVLAVAAYCY IASAGVADDS 
NPAPHTLWVW IALSGFAFGV LVSGWRGARW WRRTAATLAV PLCVLSAALV VNLWVGYFPT
VQTAWNQLTA GPLPDQTDAV TVAAMQKQHA MPVKGSLVAV DIPDTASGFR HRQEWVYLPP
AWYASDPPPA LPTVMMIGGE FNTPADWLRA GGAARTLDAF AAAHGGYAPV VVFVDPGGTF
NNDTECVNGT RGNSADHLVK DVVPYMKSHF GVSSAAANWG VVGWSMGGTC AVDLTVMHPE
VFSAFVDIAG DAAPNAGTQA ETVDRLFGGD TAAYTSFDPT AVMTRHGPYR GVAGWFDVNG
AVAVSATAQP NDQAVAASSL CATGGRSGID CAVVSQPGNH DWPFASTAFT SALPWLAGRI
GTPGVPQIDL PRTVSG