Gene MCA2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2089 
SymbolcysG 
ID3103677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2247424 
End bp2248848 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content68% 
IMG OID637171243 
Productferrochelatase 
Protein accessionYP_114519 
Protein GI53803867 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0691041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTACC TGCCGATTTT CCTCAAGCTT CGCGACCTCC CCTGCCTGAT CGTGGGGGGC 
GGCGAGGTCG CCGTCCGCAA ACTCGGCCTG CTGCTCGATG CGGGCGCAGC AGTCACCGTG
ATCGCTGCCA GCGCCGAAGC GGTCATCGTC GAACTCGCCG ACCGCGGCCT GATCCGACTA
CGTACCAAAG TTTTCGAGGC GACCGACAGC GAAGGTTTCC GGCTGATCAT TGCGGCAACC
GACGACCGCG CGGTCAACGC CGCCGTCGCA ACCGCTGCGC GGCGACACGG CATCCCGGTC
AACGTGGTCG ATTGCCCCGA CCTGTGCGAC TTCATCTTCC CGGCCATCAT CGACCGCTCG
CCAGTGCTGG TCGCGGTTTC CACCGGTGGC GCCTCACCCG TGTTGGCCCG GCAGCTTCGG
ACCCGCATCG AGACCTGTAT TCCCTCCCGT TTCGGTACGC TGGCCCGGCT CGCCGCCGAC
TTGCGGGAAC GGGTCCGGCA GGCGATTCCC GAACCGCGCG CCCGCCGCCA TTTCTGGGAG
CGGACACTGG AGGGGCCAGC CGCAGAACTG GCGCTGCAGG GGCGCGCCGA GGACGCAGAG
CGCGTGCTGC TCGAAGCAGC GGACGCAGCC GCCCGGCAAG AAAGACCGGC GTGGGGCTCG
GTCGCGCTGG TCGGCGCGGG ACCCGGCGAT CCGGACCTCC TGACGCTACG CGCTCTGCGC
CTGATCCAGG AAGCCGACGT CATCGTCTAT GACCGTCTGG TCTCGGCCGA GATCCTGGCG
CTGGCGCGCC GGGACGCGCG GCGCATCTAT GCCGGCAAGG AACGCAGCCG CCATAGCATC
CCGCAGGACG ATATCAACGC CCTGCTGGCG AATCTGGCCG CCGAAGGCAA CCGGGTGGTC
CGCCTCAAAG GCGGCGACCC CTTCATTTTC GGCCGGGGCG GCGAAGAGAT CGAAACCCTG
ATGGCCTGCG GCATCCCATT CCAGGTCGTC CCGGGAATCA CGGCGGCGTC GGGATGCGCC
GCCTATGCCG GCATCCCCCT GACCCACCGC GCCCATGCGC ATGCCTGCGT GTTCGTCGCC
GGCCACCTGA AGGATGGAAC CCTGCAGGAT CTGGACTGGT CTCAGCTGGT CCAACCCCAC
CAGACCGTGG TGGTGTACAT GGGCTTACAG GGCCTGCCAC AGATCTGCGC CGAACTGATC
CGCCACGGCG CGCCGCCATC CCGTCCTGCA GCGCTGATCC AGCAGGGCAC CACCCGCGAC
CAGAAGGTCT TGACCGCCAC GCTGGAAACG CTCCCGGACA AGGTCGCCGA CGCGGGAATC
AAGGCCCCCA CCCTGATCAT CATCGGCGAG GTGGTAGGGC TGCGGAAAAA ACTCGCCTGG
TACCGCAGCC GCCAGGAAAC CGAGGGAAGG TCCGGAAACG GCTGA
 
Protein sequence
MDYLPIFLKL RDLPCLIVGG GEVAVRKLGL LLDAGAAVTV IAASAEAVIV ELADRGLIRL 
RTKVFEATDS EGFRLIIAAT DDRAVNAAVA TAARRHGIPV NVVDCPDLCD FIFPAIIDRS
PVLVAVSTGG ASPVLARQLR TRIETCIPSR FGTLARLAAD LRERVRQAIP EPRARRHFWE
RTLEGPAAEL ALQGRAEDAE RVLLEAADAA ARQERPAWGS VALVGAGPGD PDLLTLRALR
LIQEADVIVY DRLVSAEILA LARRDARRIY AGKERSRHSI PQDDINALLA NLAAEGNRVV
RLKGGDPFIF GRGGEEIETL MACGIPFQVV PGITAASGCA AYAGIPLTHR AHAHACVFVA
GHLKDGTLQD LDWSQLVQPH QTVVVYMGLQ GLPQICAELI RHGAPPSRPA ALIQQGTTRD
QKVLTATLET LPDKVADAGI KAPTLIIIGE VVGLRKKLAW YRSRQETEGR SGNG