Gene Mvan_0846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0846 
Symbol 
ID4646179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp880655 
End bp882370 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content70% 
IMG OID639804346 
Producturoporphyrinogen III synthase HEM4 
Protein accessionYP_951690 
Protein GI120401861 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.75971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.92956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTGC GAGGACGCAA GGGCAAGCCC GGCCGCATCA CGTTCGTCGG CTCCGGGCCC 
GGGGATCCGG GTCTGCTGAC GACGCGGGCG CGGGCGGTGT TGGCCCACGC CGCCCTGGTG
TTCACCGATC CCGACGTGCC TGAAGCGGTG CTGGAGCTGG TGGGTTCCGA GCTACCGCCG
CCGTCGGGGC CGCTGCCGCC TGTCGCCGCC GGCGCGGCTC CGAAACCCGC CGATGCGGTG
GTGAAACAGG ACGCCGACGC GGCGGAGAAG CCGGCCGAGG GCAGCAAGGC TGCCGAAGAC
GCGGTCATCC CCGGGGGACC CGACGTGCGT CCCGCCCTGG GCGATCCGGT CGAGGTCGCC
AAGACGCTGG CCACCGAGGC CCGCACCGGG GTCGACGTGG TCCGGCTCGT CGCAGGAGAT
CCGCTGTCGA TCGACTCGGT GATCACCGAG ATCAACGCAC TGGCCAAGAC CCAGCTGAAC
TTCGAGATCG TGCCGGGACT GCCCGGTACC TCGGCGGTGC CCACCTACGC GGGTCTGCCG
CTGGGCTCGT CGCACACCGT CGCGGACGTG CGCGATCCCA ATGTCGACTG GGCGGCGCTG
GCCGCCGCAC CGGGACCGCT GATCCTGCAC GCCACGGCAT CGCACCTGCC GGAGGCGGCA
CGCACGCTGA TCGAGTACGG CCTCGCCGAT TCCACCCCGG CCGTGGTCAC GGCCAACGGC
ACCACCTGTC AGCAGCGTTC GATCGAGACG ACCCTTGGTG GGCTGCTCGA CAAGGCGGTG
CTGGACAAGC CGGCTGTCGC CGAACCCGCA GGCCCGCTGA CCGGCACGCT GGTGGTCACC
CTCGGTCGCA CCGTCGCCCA CCGCGCGAAG CTGAACTGGT GGGAGAGCAG GGCACTGTAC
GGGTGGACCG TTCTGGTGCC GCGCACCAAG GATCAGGCGG GTGAGATGAG CGACCGCCTT
GTCGGACACG GCGCGCTGCC CATCGAGGTG CCGACCATCG CGGTCGAGCC GCCGCGCAGC
CCGGCCCAGA TGGAAAGGGC CGTCAAGGGA TTGGTGGACG GCCGGTTCCA GTGGGTGGTG
TTCACCTCCA CCAATGCCGT GCGTGCGGTG TGGGAGAAGT TCAACGAATT CGGTCTGGAC
GCGCGCGCAT TCTCCGGCGT GAAGATCGCC TGCGTCGGTC AGGCGACGGC GGAACGGGTT
CGCGCCTTCG GGATCAACCC CGAGCTGGTA CCCACGGGTG AACAGTCGTC TCTGGGCCTG
CTCGACGAGT TCCCGCCGTA TGACGACATT TTCGATCCGG TGAACCGGGT GCTGTTGCCG
CGCGCGGACA TCGCCACCGA AACGCTGGCC GAAGGTCTGC GCGAGCGCGG CTGGGAGATC
GAGGACGTCA CCGCCTACCG CACGGTGCGC GCGGCACCGC CGCCGGCGCA GACCCGCGAG
ATGATCAAGA CCGGTGGCTT CGACGCCGTG TGCTTCACGT CGAGTTCGAC GGTGCGCAAC
TTGGTCGGCA TCGCGGGTAA GCCGCACGCC CGCACCATCG TGGCCTGCAT CGGACCCAAA
ACCGCCGAGA CCGCAGCGGA GTTCGGGCTG CGCGTGGACG TGCAGCCGGA GGTCGCCGCG
GTGGGACCGC TGGTGGAGGC GCTGGCCGAG CACGCCGCCA GGCTGCGGGC CGAGGGTGCA
TTGCCGCCAC CGCGGAAGAA GAGCCGCCGC CGCTAA
 
Protein sequence
MSLRGRKGKP GRITFVGSGP GDPGLLTTRA RAVLAHAALV FTDPDVPEAV LELVGSELPP 
PSGPLPPVAA GAAPKPADAV VKQDADAAEK PAEGSKAAED AVIPGGPDVR PALGDPVEVA
KTLATEARTG VDVVRLVAGD PLSIDSVITE INALAKTQLN FEIVPGLPGT SAVPTYAGLP
LGSSHTVADV RDPNVDWAAL AAAPGPLILH ATASHLPEAA RTLIEYGLAD STPAVVTANG
TTCQQRSIET TLGGLLDKAV LDKPAVAEPA GPLTGTLVVT LGRTVAHRAK LNWWESRALY
GWTVLVPRTK DQAGEMSDRL VGHGALPIEV PTIAVEPPRS PAQMERAVKG LVDGRFQWVV
FTSTNAVRAV WEKFNEFGLD ARAFSGVKIA CVGQATAERV RAFGINPELV PTGEQSSLGL
LDEFPPYDDI FDPVNRVLLP RADIATETLA EGLRERGWEI EDVTAYRTVR AAPPPAQTRE
MIKTGGFDAV CFTSSSTVRN LVGIAGKPHA RTIVACIGPK TAETAAEFGL RVDVQPEVAA
VGPLVEALAE HAARLRAEGA LPPPRKKSRR R