Gene Cphy_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1787 
Symbol 
ID5741460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2202084 
End bp2203577 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content38% 
IMG OID641292885 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_001558896 
Protein GI160879928 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000345374 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCAG GAAAAGTGTG GTTAGTTGGT GCAGGGCCGT CAGACCCTGG ATTGCTCACG 
ATAAAAGGAA AAGCTGTTTT AGAGCAAGCG GAAGTAGTGG TATATGATCA GTTGGTAGGA
GATGGAATTC TTTGTATGAT ACCAAAGTCT GCAAAGAAGA TAAATGTTGG TAAATATGCA
GGAAATCATA CGGTAGTACA AGAGCGTATT AATGAGATTC TATTAGAAGA GGCTTTGGAA
GGTAAAAGGG TAGTAAGGTT AAAAGGCGGT GATCCTTTCT TATTTGGTAG AGGAGGAGAG
GAGCTTGAGT TACTTTGTGA ACATGGTATT CCTTATGAGA TTGTACCAGG CATAACATCT
GCAATCTCTG TACCTGCCTA TGCGGGGATA CCAGTGACTC ATCGTGATTT TACCTCATCC
TTACATATTA TTACCGGACA TAGAAAGAAA GGATGTAAAG AATCCATAGA TTATAAAAGT
TTAGTAGCTC TTGGAGATAC AACGCTTGTG TTTTTAATGG GTGTTGCTGC ATTATTCGAT
ATTTGCAGAG GATTAATTGA TGCAGGAATG GATCAAGCTA CACCGGCTGC AATATTAGAA
CGTGGCACTA CTGCAAAGCA AAGAAAGATT ATTGCAAACC TGGCTACGTT ACCTGAGGAA
GCAAACAATC AGAACATAGG CACACCTGGT ATTATTGTAG TAGGTAAAGT GTGTTCGCTT
TCTCAAACCT TTGCCTGGGC AGAAAAGAGA TTACTTGGCG GGGTAAGAGT AGTTGTAACA
AGACCAAAAG ATAAAGTATC TTCACTAGCA ACGAAACTTT ATGATCTTGG AGCAGAGGTA
GTAATGTTAT CTGGAACTTA TACAAAACCA ATTGAGGATA ATACCCAGTT GATCGAGGTG
CTTGATACAA TCCAGGATAA TAAATGGATT CTATTTGCAA GCGAAGTGGC GGTAGATATT
TTCTTTGATT GTTTATGGCA GCAGCAGATT GACATTCGAA GCTTATGGAA TTGTCATTTT
GCAGCTGTAG GACCTACGAC CCGAAAGGCA ATAGAGAAAA GAGGTATTCG TGTAGACTAT
ATGCCAGAGA AATATTATGG AATGGAACTG GCAAAAGGAA TTGAAGCGAT GGCTAAACCA
CAGGATAAAG TGCTTGTACT AATACCGAAA GAAACAGATA GCGAACTTGC TAATTATTTA
TCCACTACAA AGGTAAATAT GAAGGCTGTT CCGGTGTATG AAATTGTCTA TGAGGATAAT
GAGCAAGTAA ATATAGAGGA ATCAGATATT GTAACATTTA CAAGTGCTTC TACAGTTCGA
AGTTTTGTTA ATACAATGAA AGATATAGAT TTTCATAAGG TTCAAGCAGT ATGTATTGGA
GAGCTAACAG CAAAAGAAGC AGCTTTCTAT GGAATGAAAA TTAATGTTGC AAAAGAAGCT
ACTATTGATA GTTTAATTGA GAGTGTTATA GAAATAAACA AGAGAGAATT GTGA
 
Protein sequence
MMAGKVWLVG AGPSDPGLLT IKGKAVLEQA EVVVYDQLVG DGILCMIPKS AKKINVGKYA 
GNHTVVQERI NEILLEEALE GKRVVRLKGG DPFLFGRGGE ELELLCEHGI PYEIVPGITS
AISVPAYAGI PVTHRDFTSS LHIITGHRKK GCKESIDYKS LVALGDTTLV FLMGVAALFD
ICRGLIDAGM DQATPAAILE RGTTAKQRKI IANLATLPEE ANNQNIGTPG IIVVGKVCSL
SQTFAWAEKR LLGGVRVVVT RPKDKVSSLA TKLYDLGAEV VMLSGTYTKP IEDNTQLIEV
LDTIQDNKWI LFASEVAVDI FFDCLWQQQI DIRSLWNCHF AAVGPTTRKA IEKRGIRVDY
MPEKYYGMEL AKGIEAMAKP QDKVLVLIPK ETDSELANYL STTKVNMKAV PVYEIVYEDN
EQVNIEESDI VTFTSASTVR SFVNTMKDID FHKVQAVCIG ELTAKEAAFY GMKINVAKEA
TIDSLIESVI EINKREL