Gene Clim_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2341 
SymbolhemE 
ID6355687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2570962 
End bp2572017 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content51% 
IMG OID642669933 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001944343 
Protein GI189347814 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAAAA ATGATCTATT TCTCCGGGCA TTGAAGCGTC AGCCCTGTTC CAGGACGCCT 
ATCTGGGTGA TGAGACAGGC CGGCCGCTAT CTTCCGGAGT ATCGGGCAGT CAGAGAAAAA
ACTGACTTTT TAACCCTTTG CAAAACGCCT GAACTGGCCG CCGAGGTAAC CATTCAGCCT
GTTGATCTGA TGGGTGTCGA TGCCGCGATC ATCTTTTCCG ACATTCTTGT GATCAATGAA
GCCATGGGAA TGAACGTCGA GATTATCGAG ACAAAAGGAA TCAAGCTTTC TCCTGTCATC
CGCAGCAAGG CAGATATCGA CAAGCTCATC GTGCCCGATA TCGATGAGAA GCTCGGCTAT
GTTATGGATG CCCTGCGTCT TACCAAGAAA GAACTCGACA ATCGCGTTCC GCTTATCGGT
TTTTCCGGTG CCGCATGGAC GCTCTTTACC TATGCAGTGG AAGGCGGCGG CTCGAAGAAC
TACGCTTTTG CCAAGAAAAT GATGTACCGT GAGCCGCAGA TGGCCCATCT TCTGCTCGGC
AAGATTTCCG AAACCATCAG CGCCTATCTG CTCAAGCAGG TCGAGGCCGG TGCCGACGCA
ATCCAGATTT TCGATTCATG GGCAAGCGCT CTCTCCGAGG ACGATTATCG TGAATTCGCT
CTTCCTTACA TCAAGCAGAA TGTTCAGGCT GTCAAGGCGA AGTATCCAGA CATTCCCGTT
ATCGTATTTT CGAAAGACTG CAACACCATT CTTTCCGATA TTGCTGATAC CGGCTGCGAT
GCCATGGGTC TTGGATGGGG CATAGATATC GCAAAAGCCC GTGCCGAGCT CAAGGACCGA
GTCGCCCTGC AGGGTAATCT CGATCCGACA GTGCTCTACG GCACCCCTGA AAAGATCAAG
TCGGAAGCAG CAAAAGTCCT GAAACAGTTC GGTCAGCACA CCGAAAGCTC AGGTCATGTT
TTCAACCTCG GACATGGTAT TCTTCCCGAT GTCGATCCGG CAAACCTGAA GCTTCTTGTC
GAATTCGTCA AGGAAGAGAG CGCCAGGTAC CACTGA
 
Protein sequence
MLKNDLFLRA LKRQPCSRTP IWVMRQAGRY LPEYRAVREK TDFLTLCKTP ELAAEVTIQP 
VDLMGVDAAI IFSDILVINE AMGMNVEIIE TKGIKLSPVI RSKADIDKLI VPDIDEKLGY
VMDALRLTKK ELDNRVPLIG FSGAAWTLFT YAVEGGGSKN YAFAKKMMYR EPQMAHLLLG
KISETISAYL LKQVEAGADA IQIFDSWASA LSEDDYREFA LPYIKQNVQA VKAKYPDIPV
IVFSKDCNTI LSDIADTGCD AMGLGWGIDI AKARAELKDR VALQGNLDPT VLYGTPEKIK
SEAAKVLKQF GQHTESSGHV FNLGHGILPD VDPANLKLLV EFVKEESARY H