Gene Csal_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0116 
Symbol 
ID4027042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp139207 
End bp140268 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content65% 
IMG OID637965267 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_572179 
Protein GI92112251 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.754255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTGC AAAACGACCG CCTACTGCGT GCCTTGGCGC GCCAACCGGT AGACCGCACA 
CCGGTGTGGA TGATGCGCCA AGCGGGCCGT TATCTGCCCG AATATCGGGA GACGCGCGGC
CAGGCCGGCA GTTTCATGGA CCTGTGCCGC AACGCCGAAC TGGCGTGCGA GGTCACCATG
CAGCCGCTTC GCCGCTATGC GCTCGATGCG GCGATCCTGT TTTCCGACAT CCTCACGATT
CCCGACGCCA TGGATCTGGG GCTGTACTTC GAAACGGGCG AAGGCCCCAA GTTTCGCAAG
ACGGTGCGCA GCGCCGAGGC TGTGGACGCC TTGCCGGTGC CGGATGCCGA GCGGGATCTC
GATTATGTGA TGAACGCGGT GCGCACCATT CGCCACGAAC TGGCGGACAG CGTGCCGTTG
ATCGGCTTTT CGGGCAGCCC CTGGACGCTG GCGACCTACA TGATCGAAGG CGGCTCGAGC
AAGGACTTCC GGCACGCCAA GGCATTGATG TACGGCGATC CCGCGGCGAT GCACGCGCTG
CTCGACAAGC TGGCGCGGTC GGTCACCGAC TACCTCAATG CGCAGATTCG TGCCGGAGCC
CAGATCGTGC AGATCTTCGA CACCTGGGGC GGCGTGTTGT CGACGCCGGC CTACCGCGAG
TTCTCGCTGG CCTACATGGC GCGCATCGTC GAAGGACTGA TCCGGGAGCA CGAGGGGCGC
CACGTGCCGG TGATCCTGTT CACCAAGCAG GGCGGCCAGT GGCTGGAGAC CATCGCCGAC
AGCGGCGCCG ATGCCGTGGG CCTGGACTGG ACCACCGAGC TGAGCGACGC CCGGGCCCGT
GTCGGGGATC GCGTGGCGCT GCAGGGCAAT CTCGATCCCA ATGTGCTCTT CGCCTCGCCC
CAGGCGATTC GCGATGAGGT GGCGCGCATT CTGGCCAGCT ATGGCAGCGG TCCCGGCCAT
GTCTTCAACC TGGGGCATGG TGTCAGCCAA TTCACTGATC CCGATCATGT CGCCGCCTTC
ATCGAGGCAC TGCATGAACT CAGCCCGCGT TATCATGGCT GA
 
Protein sequence
MPLQNDRLLR ALARQPVDRT PVWMMRQAGR YLPEYRETRG QAGSFMDLCR NAELACEVTM 
QPLRRYALDA AILFSDILTI PDAMDLGLYF ETGEGPKFRK TVRSAEAVDA LPVPDAERDL
DYVMNAVRTI RHELADSVPL IGFSGSPWTL ATYMIEGGSS KDFRHAKALM YGDPAAMHAL
LDKLARSVTD YLNAQIRAGA QIVQIFDTWG GVLSTPAYRE FSLAYMARIV EGLIREHEGR
HVPVILFTKQ GGQWLETIAD SGADAVGLDW TTELSDARAR VGDRVALQGN LDPNVLFASP
QAIRDEVARI LASYGSGPGH VFNLGHGVSQ FTDPDHVAAF IEALHELSPR YHG