Gene Achl_2490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2490 
SymbolhemH 
ID7293965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2807568 
End bp2808833 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content67% 
IMG OID643590899 
Productferrochelatase 
Protein accessionYP_002488544 
Protein GI220913235 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000000132256 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCCGC TCGACCCCCA GACCACCGCC GTCAACCCGG TCACCGAAGC CGGACGCATG 
GCTCCCAAGA ACTACGACGC TGTACTCCTC GCCTCCTTCG GCGGGCCCGA GGGCCAGGAC
GACGTCATCC CGTTCCTCCG CAACGTCACC CGCGGCCGCG GCATCCCCGA CGAACGGCTC
GAGGAAGTTT CGCACCACTA CCGCGCCAAC GGCGGCATCA GCCCCATCAA CCAGCAGAAC
CGCGAGCTGA AAGCTGCGCT CGAGGCTGAG CTCGCTGCCC GCGGCATCGA ACTGCCCGTG
CTGTGGGGCA ACCGCAACTG GGCGCCGTAC ATCCCGGAGA CGCTGCAGGA CGCGTACGAC
GCCGGGCACC GCCGCCTGCT GATGGTCACC ACCAGTGCCT ACTCCTGTTA CTCCAGCTGC
CGCCAGTACC GCGAGGACAT CGGCATGGCC CTGACGGAAA CCGGCCTTGA CGGCCGCCTC
GAAGTGGACA AGGTCCGCCA GTACTTCGAC CACCCCGGTT TCGTGGAGCC GTTCATTGAG
GGCACCGCTG CAGGTCTCGC CGAGGTCCGT GCAAAGCTGA CCGAGGCCGG GACCCCCGAC
GCTCCGGTCC AGATCCTGTT TGCAACGCAC TCCATCCCCA CCCGGGACGC CGAGGCCGCA
GGCCGGTCGG AGGGTGAACC GCGTGAATTC GAGGACGGCT CCGCCTACGT GGCGCAGCAC
CTTGCCACGG CCGCTGCTGT GATGGAGCGC GTGACGGAAG AATCCGGACT TACCGCGGAC
TGGTCCCTGG TGTACCAGTC GCGGTCCGGT GCCCCGCATG TGCCGTGGCT CGAACCGGAC
ATCAACGACG CCATCGAGGA ACTCGCAGGC AAGGGAACCA AGGGAATCGT CATCGTCCCG
CTCGGCTTCG TCAGCGACCA CATGGAGGTG GTCTGGGACC TGGACACGGA GGCCCTGGAA
ACCTGTGCCA ACCTTGGGTT GGCTGCTACC CGGGTCCCCA CTCCGGGCAC GCACCGGAAG
TTCGTCAACG GGTTGGTGGA CCTGATCTCC GAACGCACGC TGGCCAACAA CATCAGCGAC
CGCCCGGCGC TGACCGGCCT TGGACCCTGG TACGACGTGT GCCGGCCCGG CTGCTGCGCA
AACTTCCGTG GCGAGAAGCC GACCATCGCA GGGGCAGACA CCACCGTCGG CACCGGGCAC
GACCCGTATC CCGCTGCCGG ATCCGCAACC TCGTCCGGCC AGTCCGGGGA AGCAGCGCAG
CTGTGA
 
Protein sequence
MSPLDPQTTA VNPVTEAGRM APKNYDAVLL ASFGGPEGQD DVIPFLRNVT RGRGIPDERL 
EEVSHHYRAN GGISPINQQN RELKAALEAE LAARGIELPV LWGNRNWAPY IPETLQDAYD
AGHRRLLMVT TSAYSCYSSC RQYREDIGMA LTETGLDGRL EVDKVRQYFD HPGFVEPFIE
GTAAGLAEVR AKLTEAGTPD APVQILFATH SIPTRDAEAA GRSEGEPREF EDGSAYVAQH
LATAAAVMER VTEESGLTAD WSLVYQSRSG APHVPWLEPD INDAIEELAG KGTKGIVIVP
LGFVSDHMEV VWDLDTEALE TCANLGLAAT RVPTPGTHRK FVNGLVDLIS ERTLANNISD
RPALTGLGPW YDVCRPGCCA NFRGEKPTIA GADTTVGTGH DPYPAAGSAT SSGQSGEAAQ
L