Gene Arth_2768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2768 
SymbolhemH 
ID4444567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3119960 
End bp3121240 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID639690590 
Productferrochelatase 
Protein accessionYP_832247 
Protein GI116671314 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00310189 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCCGC TTGAATCACA GGAGGCACCG GCCTCGGTGA CGGCCGTCAA CCCGGTCACC 
GAATCCGGGC GTATGGCTCC GAAGGAATAC GACGCCGTCC TCCTCGCCTC ATTCGGCGGG
CCTGAGGGCC AGGATGACGT CATCCCCTTC CTCCGCAATG TCACCCGGGG GCGCGGAATC
CCCGACGAAC GGCTTGAAGA GGTTTCGCAC CACTACCGTG CCAACGGGGG CATCAGCCCG
ATCAACCAGC AGAATCGCGA GCTCAAGGCC GGGATCGAAG CGGAACTCTC GGCCAGGGGC
ATCAACCTGC CCGTTTTCTG GGGCAACCGC AACTGGGACC CCTACATTCC GCAGACCCTC
CAGGACGTGT ACGACGCCGG CCACCGCAAG GTCCTCATGG TCACCACGAG CGCCTACTCC
TGCTATTCCA GCTGCCGCCA GTACCGCGAG GACATCGGCA TGGCGCTGAC CGAGACCGGC
CTGGACGGGA AGCTGGAAGT GGACAAAGTC CGCCAGTACT TCGACCACCC GGGCTTCGTG
GAGCCCTTCG TGGAAGGGAC CGCTGCCGGC CTTGCCGACG TCCGCGCCCA GCTTGCCGCG
GTTGGTACTC CGGACGCACC GGTCCACATC CTGTTCGCCA CGCACTCCAT TCCGACGCGT
GACGCTGAAG CTGCCGGACG CTCCGAGGGT GAACCGCGCA CCTTCGCTGA AGGCTCGGCC
TACGTGGCGC AGCACCTGGC ATCCGGCGCC GAGGTCATCC GACGTGTCGA GGAAGAATCG
GGCCTGACCG CCCCATGGTC CCTCGTTTAC CAGTCCCGTT CCGGTGCTCC GTCCGTTCCG
TGGCTCGAAC CGGACATCAA CGACGCCATC GAGGAGCTTG CCGGCGAGGG TGTCAAGGGA
ATCGTGATCG TCCCCCTGGG TTTCGTCAGC GACCACATGG AGGTTGTCTG GGACCTGGAC
ACCGAAGCGC TGGAAACGTG CCGCAACCTT GGCCTGTCCG CAACCCGGGT GCCCACCCCC
GGCACGCACC GCAAATTCGT GAGCGGCATC GTGGACCTGG TCTGTGAGCG CACTGCCGCG
AACAATATTG CCGACCGGCC GCACCTCACC GACCTGGGGC CCTGGTATGA CGTCTGCCGC
CCCGGCTGCT GCGCCAACTT CCGGGGCGAG AAGCCCACCA TCGCAGGAGC TGACACCTCA
GTGGGCACAG GCCACGCCTC CTACCCTTCT GGTTCGGCTG ACACTCCGGC TGCCCAGGCG
GCGGGACAGG ACTCACTGTG A
 
Protein sequence
MSPLESQEAP ASVTAVNPVT ESGRMAPKEY DAVLLASFGG PEGQDDVIPF LRNVTRGRGI 
PDERLEEVSH HYRANGGISP INQQNRELKA GIEAELSARG INLPVFWGNR NWDPYIPQTL
QDVYDAGHRK VLMVTTSAYS CYSSCRQYRE DIGMALTETG LDGKLEVDKV RQYFDHPGFV
EPFVEGTAAG LADVRAQLAA VGTPDAPVHI LFATHSIPTR DAEAAGRSEG EPRTFAEGSA
YVAQHLASGA EVIRRVEEES GLTAPWSLVY QSRSGAPSVP WLEPDINDAI EELAGEGVKG
IVIVPLGFVS DHMEVVWDLD TEALETCRNL GLSATRVPTP GTHRKFVSGI VDLVCERTAA
NNIADRPHLT DLGPWYDVCR PGCCANFRGE KPTIAGADTS VGTGHASYPS GSADTPAAQA
AGQDSL