Gene Cpha266_0174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0174 
SymbolhemE 
ID4568471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp198999 
End bp200054 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content49% 
IMG OID639764774 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_910665 
Protein GI119356021 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAAAA ATGATCTCTT CCTCCGGGCG TTAAAAAGAC AGCCTTGCTC ACGAACACCA 
ATCTGGGTGA TGCGGCAGGC TGGCCGTTAC TTGCCGGAAT ATCGTGCCGT AAGAGAAAAA
ACAGACTTTT TAACCTTGTG CAAAACCCCG GAACTGGCAG CAGAAGTAAC CATTCAGCCG
GTTGACTTAA TGGGTGTCGA TGCAGCCATT ATTTTCTCGG ATATCCTTGT CGTCAACGAA
GCCATGGGAA TGGACGTCGA AATCATCGAA TCCAAAGGCA TACGGCTCTC ACCTGCTATC
CGCTCGCAGG TCGACATTGA TCGGCTTATC ATCCCTGACA TCAATGAAAA GCTCGGGTAT
GTGATGGATG CCATCCGCCT GACAAAAAAA GAGCTTGACA ACAGAGTCCC GCTTATCGGA
TTTTCCGGTG CAGCATGGAC GCTCTTCACC TATGCCGTCG AAGGTGGTGG GTCAAAGAAC
TACGCCTTTG CCAAAAAGAT GATGTATCGT GAGCCGAAAA TGGCCCATAT GCTCCTCAGC
AAAATTTCCA GCGTCATCAC CGAATATGTC CTGATGCAGA TCGAAGCCGG TGCAGATGCA
ATCCAGATAT TCGATTCATG GGCAAGCGCA CTTTCAGAAG ACGACTATCG CGAATTTGCC
CTTCCCTATA TCAAGGAAAA CGTTCAGGCA ATCAAGACAA AATATCCCGA CACTCCGGTA
ATTGTCTTCT CGAAAGACTG TAACACCATT CTCTCCGAAA TTGCCGATAC CGGCTGCGAT
GCCATGGGCC TTGGATGGAA CATGGATATT GCCAAAGCGC GCAAAGAACT GAACGACAGA
GTCTGCATTC AGGGCAATAT GGATCCGACA GTACTGTACG GCACTCCGGA TAAAATCCGC
TCGGAAGCAG CCAAAATACT CAAGCAGTTT GGCCAGCATA CAGCGACATC AGGCCATGTG
TTCAACCTCG GACACGGCAT TCTTCCGGAT GTCGATCCTG CAAACCTGAA ACTCCTTGTG
GAATTTGTCA AGGAAGAAAG TGTCAAGTAC CACTAA
 
Protein sequence
MLKNDLFLRA LKRQPCSRTP IWVMRQAGRY LPEYRAVREK TDFLTLCKTP ELAAEVTIQP 
VDLMGVDAAI IFSDILVVNE AMGMDVEIIE SKGIRLSPAI RSQVDIDRLI IPDINEKLGY
VMDAIRLTKK ELDNRVPLIG FSGAAWTLFT YAVEGGGSKN YAFAKKMMYR EPKMAHMLLS
KISSVITEYV LMQIEAGADA IQIFDSWASA LSEDDYREFA LPYIKENVQA IKTKYPDTPV
IVFSKDCNTI LSEIADTGCD AMGLGWNMDI AKARKELNDR VCIQGNMDPT VLYGTPDKIR
SEAAKILKQF GQHTATSGHV FNLGHGILPD VDPANLKLLV EFVKEESVKY H