Gene Rcas_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1541 
Symbol 
ID5539017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1964853 
End bp1965968 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content57% 
IMG OID640893679 
Productmagnesium-protoporphyrin IX monomethyl ester cyclase 
Protein accessionYP_001431652 
Protein GI156741523 
COG category 
COG ID 
TIGRFAM ID[TIGR02029] magnesium-protoporphyrin IX monomethyl ester aerobic oxidative cyclase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00244092 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAACA TGCCCGGCGA GTACAGTCCG ACGACACGCC ATGCGCTGCG CGAGGCGATC 
CTCAACCCGC GCTTCTACAC AACCGATTTC CGCGCAATTG ATCGTCTGAA TGTGGCGCAC
ATGCGTGATG AGTTCGATTG GATTCGCAAC GAATTCGAGT TCGATTACAA CAAGAAGCAT
TTTGTGCGCA ACGAGGAGTT TCTGGCAAAC TTCGACGATA TGCCAGCCCG CGATCTGTTC
ATCGAGTTCC TCGAGCGTAG TTGCACCGCC GAATTCAGCG GGTGTCTGCT CTATGCCGAG
ATGGTGAAAC ACCTGCACGA TCCGACGCTC AGGGCGATCT TTCGCTGTAT GAGCCGTGAT
GAAGGGCGTC ATGCCGGTTT TCTCAACAAA ACCATGGCGG ATCTGGGCGT CGAGATGAAT
CTTCAGGTGC TTCACACGCG CAAGAAGTAC ACGTATTTTC AGCCGAAATT CATTTTCTAT
AGCGTCTATC TCTCCGAGAA GATCGGCTAT GCCCGGTATA TCACAATTTA TCGCCATTTG
CAGCACCATC CGCAGGGCAT GATCCATCCG ATCTTCAAAT GGTTCGAGAA GTGGTGCAAC
GATGAATATC GGCATGGCGA GTTCTTCTCG CTCTTGATGC GCAGTCAGCC TGATCTGTTG
CGCGGCGGCA ACCTGCGCTG GATCCGATTC TTCCTGCTGG CGGTGTACGC CACGATGTAC
CTGAATGACG CGCGCCGCGC CGGGTTCTAC GAGGCGCTTG GACTGAACTG GCGCGACTAC
GATCAGCGCG TGATCCGGCT GACGAACCAT ATCGCCACGC AAGTGTTCCC GGTGACTCTG
CCGGTGGACG ATCCGCGCTT CTTCCGCCAT CTCGACGCCT GTGTGCGCTA CGATGCTCAG
ATCCGTGCGC TCGAAGGACG AAACGATCCG ATCGCGCAGG TGCAACGGGC GCGTCTGGGC
GCCGGGATTG CTGCGCGCCT CCTGGCGACC TATCGCCTGC CGCCGGCGCC GACCACCGAT
GCCAATCGTT GGAAGGGTCT CGAAGGCTTC CCGAACTATC CCGGTCCGGG CTGGAAACAG
GACGCATCAC TCAACGAACG GGTCATATCA GCATAA
 
Protein sequence
MINMPGEYSP TTRHALREAI LNPRFYTTDF RAIDRLNVAH MRDEFDWIRN EFEFDYNKKH 
FVRNEEFLAN FDDMPARDLF IEFLERSCTA EFSGCLLYAE MVKHLHDPTL RAIFRCMSRD
EGRHAGFLNK TMADLGVEMN LQVLHTRKKY TYFQPKFIFY SVYLSEKIGY ARYITIYRHL
QHHPQGMIHP IFKWFEKWCN DEYRHGEFFS LLMRSQPDLL RGGNLRWIRF FLLAVYATMY
LNDARRAGFY EALGLNWRDY DQRVIRLTNH IATQVFPVTL PVDDPRFFRH LDACVRYDAQ
IRALEGRNDP IAQVQRARLG AGIAARLLAT YRLPPAPTTD ANRWKGLEGF PNYPGPGWKQ
DASLNERVIS A