Gene Cphamn1_1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1843 
Symbol 
ID6375534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1999872 
End bp2001227 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content49% 
IMG OID642684339 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_001960241 
Protein GI189500771 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.572708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.146892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA GTGGTGGATA TGTGTATATC ATAGGAGCCG GTCCCGGTGA TCCTGAATTG 
TTGACCATAA AGGCTGAAAG GGTTCTTCAT GAGGTGGATG TGATACTCTA TGACAATCTT
GTGTCGAGTG AGCTGATAGA GCAGTTTGAT GCGTTGAAGA TTTACACCGG AAAGAGAAAA
GACCGGCATC ATTTTGAACA GGACGCGATT AACGAAGAGA TCGTCAGGCA CGCCCGGCAG
GGGAAAACAG TCGCCAGACT GAAAGGCGGC GATCCATTTG TTTTCGGCAG GGGTGGAGAA
GAAATCGCGT ACATGAGAAA AAAAGGTATC GAGTATGAAA TAATCCCCGG CATCACGGCG
GCTCATGGGG CGAGTGCCTA TACGGAGATA CCGCTGACCA TGAGGAATAT CTCATCGTCG
GTCGCTTTCT GTACCGGACA TCCACTGGAA AAGATTCATG TCCCCGATGC AGACACCATC
GTCTACTATA TGGTTGCGTC AGCTGCAAGT GACGTTTCGA AAGCCCTTAT CGACAAGGGA
AAGGCGGAAA CAACCAAAGT TGCAATAGTT CAGAACGCAA CGAGGTACAA TCAGAAGATT
TTCGCTGGAA CGCTGGGGGA GCTGAAAAAC AGGGACCTGC AGGTTGTTTC TCCCGCCCTT
TTGATCCTTG GCGAAAACAT TAACGAGTTT ATAGCCGACA GCTGGTACGC AAAAAAGAAA
AAGGTGCTGA TTGCCGGAGA TGATTCCGGG AGGTATACAC CGAGGCATCA CGTTATCGTG
CGATATCCCT TTACCGTGGT CAAAAGTATT GACAACAAGG AGGTAAGAAG ATGCATGGAG
AAGATAGGCG ACTATTCCTG TGTTCTGTTT ACCGATTTCC ATGCCGTGGG CTATTTTTTC
AACGCTCTTG TGGAGAGCGG CAGAGACGCG CGTCACCTGG CAAATGCTTC CATCTGCGTC
ACTGGAAAAA AGGTTTCCGC AGAACTGTTG CGGCATGGTG TCAAACCTGA CTATTCGTTT
GATGCGGGAA ACATCGTGGA GGAACTGAGG ACTGTGAGGG GGGATGGCTT TTCCGGAGAA
GATGTGCTGT TGCCGGGTTC GGGCTCTATT GACGATCCGG TTGTCAATGC GTTGACGGCA
TTGCATAACA GGGTGACTCC GTTACAGGTT CATGTTCAAG GCCTCGACTC TTCGGATGAG
CTGATCGATC TGGATTGTAT AGATGAAATT TATTTCTCTG CGCCTTCCTG TGTGGAGAAC
TTCAAATCTT TCTATGAAGC GATTCCTGAA AACATAACAG TGACTACAGC TGACAGGGAG
ACCGGCATGG AGTACAAAAA GCTTTTCAGG GGATGA
 
Protein sequence
MKRSGGYVYI IGAGPGDPEL LTIKAERVLH EVDVILYDNL VSSELIEQFD ALKIYTGKRK 
DRHHFEQDAI NEEIVRHARQ GKTVARLKGG DPFVFGRGGE EIAYMRKKGI EYEIIPGITA
AHGASAYTEI PLTMRNISSS VAFCTGHPLE KIHVPDADTI VYYMVASAAS DVSKALIDKG
KAETTKVAIV QNATRYNQKI FAGTLGELKN RDLQVVSPAL LILGENINEF IADSWYAKKK
KVLIAGDDSG RYTPRHHVIV RYPFTVVKSI DNKEVRRCME KIGDYSCVLF TDFHAVGYFF
NALVESGRDA RHLANASICV TGKKVSAELL RHGVKPDYSF DAGNIVEELR TVRGDGFSGE
DVLLPGSGSI DDPVVNALTA LHNRVTPLQV HVQGLDSSDE LIDLDCIDEI YFSAPSCVEN
FKSFYEAIPE NITVTTADRE TGMEYKKLFR G