Gene Mjls_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2201 
SymbolhemE 
ID4877921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp2297341 
End bp2298405 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content72% 
IMG OID640139498 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001070478 
Protein GI126434787 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.145725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.153622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCC GCCGTGAACT GCCCGACTCG CCCTATCTCG CCGCCGCCCG CGGCCGCAAA 
CCCGCCCGGG TGCCGGTGTG GTTCATGCGG CAGGCGGGCC GGTCGCTGCC CGAATACCGG
GCGCTGCGGG CGCGCAACAC CATGATGCAG GCCTGCTTCG ACGCCGACCT GATCACCGAG
ATCACGCTGC AGCCGGTGCG CAGGCACGGA GTCGACGCCG CGATCCTGTT CTCCGACATC
GTCGTCCCGC TGCGGGCCTC GGGCATCGCG CTCGACATCG TGCCCGACGT GGGACCGGTG
ATCGACCATC CGGTGCGCAC CGCCGCCGAC GTCGCCGCGA TCCGGCCGCT CGAGCGGCAG
ACGGTCGAAC CCGTCGAGCA GGCGGTGCGG ATGCTCACCG CCGCACTCGG CGACGTCCCG
CTGATCGGGT TCGCCGGCGC CCCCTTCACC CTCGCCTCGT ATCTCGTGGA GGGTGGTCCG
AGCAAGCACC ACGAGCACAC CAAGGCGATG ATGCTCGGTG CGCCCGACAC CTGGCATGCG
CTCATGTCGG CGCTGACCGA CGTGACGATC GCGTTCCTGC AGGCCCAGGT CGACGCGGGC
GTCGACGCGA TCCAGGTGTT CGACTCCTGG GCGGGCACGC TGTCGCTGGC CGACTACCGC
GCCTACGTGC TGCCCCACAG CGCCCGCGTG TTCCAGGCGC TGGCACCGGC CGGGGTGCCG
ATGACGCACT TCGGGGTGGG CACCGCCGAA CTGCTCGGCG CCATGTCCGA GGCCATCGCC
ACCTCCGGGG CACCCGGTGT GGTGGGCGTG GACTGGCGCA CCTCGCTGAC CGACGCCGCG
GGCCGGGTCG AACGCGGCAG CGCGTTGCAG GGGAACCTCG ATCCGGTCGT CCTGCTGGCC
GGGTGGCCCG TGGTCGAACG TGCGGTGCGC GCCGTGGTCG AGGACGGCAG GCGCGCCGTC
GACGCCGGGG CGGCCGGACA CGTGTTCAAC CTCGGGCACG GCGTGCTGCC GGCGACCGAC
CCGGAGATCG TGACCGCCAC GGTGGAGCTG GTGCACTCGC TGTGA
 
Protein sequence
MNTRRELPDS PYLAAARGRK PARVPVWFMR QAGRSLPEYR ALRARNTMMQ ACFDADLITE 
ITLQPVRRHG VDAAILFSDI VVPLRASGIA LDIVPDVGPV IDHPVRTAAD VAAIRPLERQ
TVEPVEQAVR MLTAALGDVP LIGFAGAPFT LASYLVEGGP SKHHEHTKAM MLGAPDTWHA
LMSALTDVTI AFLQAQVDAG VDAIQVFDSW AGTLSLADYR AYVLPHSARV FQALAPAGVP
MTHFGVGTAE LLGAMSEAIA TSGAPGVVGV DWRTSLTDAA GRVERGSALQ GNLDPVVLLA
GWPVVERAVR AVVEDGRRAV DAGAAGHVFN LGHGVLPATD PEIVTATVEL VHSL