Gene Arth_2772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2772 
SymbolhemE 
ID4444571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3124541 
End bp3125578 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID639690594 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_832251 
Protein GI116671318 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.164918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGCC GCACGGCTGA TTCGCCGCTC ATCACCGCCT ACCGCGGCGG CAAGCCGACC 
CGCCGTCCCG TCTGGTTCAT GCGGCAGGCA GGACGCTCCC TGCCCGAGTA CCTGAAGGTG
CGCGAAGGCG TGGCCATGCT GGACTCCTGC CTGCGACCCG AGCTGGCCTC GGAGATCACG
CTGCAGCCGG TGCGACGGCA CGACGTCGAC GCAGCCATCT TCTTTTCGGA CATCGTCATT
CCGCTGAAGC TCGCCGGCGT GGGAGTGGAC ATCGTTCCGG GCGTCGGTCC GGTCCTGGAC
AAGCCGGTCC GCACCGCCGA GGACGTCGCG GCACTCCCGC AGCTGACCTG GGAAGCGCTG
GAGCCCATCC GGGAAGCCGT CCGGCTCACC GTGGCCCAGC TGGGCAAGAC CCCGCTGATC
GGTTTTGCGG GCGCGCCGTT CACCCTCGCC GCATACATGG TGGAAGGCAA GCCTTCCCGC
GACCACCTCG GCCCGCGCAC CATGATGCAC GCAGATCCTG AAACCTGGAA TGCGCTGGCC
AACTGGGCTG CCGACGCGTC CGGCATGTTC CTGCGCGCCC AGCTTGAAGC CGGCGCTTCC
GCGGGCCAGC TGTTCGATTC CTGGGCGGGC TCGCTGGGAC TGGCCGATTA CAAACGCTTC
GTGGCCCCGG CCTCCGCCCG CGCGCTGGAC CACGTCCGCC ACCTCGGCGC GCCACTGATC
CACTTTGGAA CCGGTACATC GGAACTGCTG GTGGCGATGC GCGACGTCGG CGTCGACGTG
GTGGGGGTCG ACTACCGGCT TCCGCTGGAC GAAGCCAACC GCCGGCTGGG CGGCACTGTG
CCGCTGCAGG GCAATATCGA CCCCGCGCTG CTGTCAGCCC CGTGGGCAGT CCTCGAGGCC
CACGTCCGGG AAGTCATCAA GGCGGGATCC TTCGCGCCCG GACACGTCCT GAACCTGGGC
CACGGCGTGC CGCCGGAGAC CGACCCGGAC GTCCTGACGC GCGTCGTCGA ACTCATTCAC
TCCATTTCCC CGGAGTAA
 
Protein sequence
MDGRTADSPL ITAYRGGKPT RRPVWFMRQA GRSLPEYLKV REGVAMLDSC LRPELASEIT 
LQPVRRHDVD AAIFFSDIVI PLKLAGVGVD IVPGVGPVLD KPVRTAEDVA ALPQLTWEAL
EPIREAVRLT VAQLGKTPLI GFAGAPFTLA AYMVEGKPSR DHLGPRTMMH ADPETWNALA
NWAADASGMF LRAQLEAGAS AGQLFDSWAG SLGLADYKRF VAPASARALD HVRHLGAPLI
HFGTGTSELL VAMRDVGVDV VGVDYRLPLD EANRRLGGTV PLQGNIDPAL LSAPWAVLEA
HVREVIKAGS FAPGHVLNLG HGVPPETDPD VLTRVVELIH SISPE