Gene Caul_5016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5016 
Symbol 
ID5902478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5419078 
End bp5420118 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content68% 
IMG OID641565537 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001686634 
Protein GI167648971 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.739471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCC TCCCGAAACC CCTCGTCCAG CCTAAGCTCC TTTCCGTCCT GGATGGCACG 
AGCGCCAAAC GCCCGCCGGT GTGGTTCATG CGCCAGGCTG GTCGCTACCT GCCGGAATAC
CGGGCGGTGC GGGCGACGGA ACCGACCTTC ATCGATTTCT GTCTGAATCC GGAGAAGGCC
GCCGAGGTCA CCCTGCAGCC GATGCGGCGG TTTCCCTACG ACGCGGCCAT CGTGTTCGCC
GATATCCTGC TGATCCCCCA GGCGTTGGGC CAAAAGGTGT GGTTCGAAGC CGGCGAGGGA
CCCAAGCTGG GCGAACTGCC GTCGATCGAG TCGATGCGCG AGCTGACGGG CCAGGCCGGC
CAGGCGCTGG GGGCGGTGGG CGAGACCCTG AGCCGCGTGC GTTCGGTCCT GGAACCGGAG
CGCGCCCTGA TCGGCTTCGC CGGCGCGCCG TGGACGGTGG CGACCTACAT GATCGAAGGC
GGATCCAGCG ACCGCTCCGG CGCGCGGACC TTTGCCTATC AGCAGTCCGA CAAGCTTGAC
GCCCTGATCC AGGTGCTGGT CGATGCGACG ATCGACTATC TGGCGATGCA GGCGGCGTCG
GGCGCCCAGG TGCTAAAGCT GTTCGAGAGC TGGGCCGAGG GCCTGTCTGA GCCGCTGTTC
GAGCGGCTGG TGACGAAACC GCATACGGCC ATCGTCGAGG GTCTGCGGGC GAAGGGTGTG
ACCACCCCGA TCATCGGCTT CCCGCGCGGG GCGGGGACGC TGGTCGAGGC CTATGCCAGA
ACCGCGCCGG TGCAGGGCGT GGCGCTGGAC ACCCAGGCCT CGGCGGCGCT GGGCCAGGCG
ATCCAGAAGA CCAAGGCCAT CCAGGGCGCG CTGGATCCGT TGCTGCTGCG GGCCGGCGGC
GACGCCCTGC TGACGCGGGT CGATCAGCTT CTGGAGCAAT GGGGTCACGG CCCCTACATC
TTCAACCTGG GTCACGGCAT CCTGCCTGAT ACGCCGATCG CCCATGTCGA ATCGGTGCTG
GCCCGGGTCA CCGGCAAGTG A
 
Protein sequence
MTSLPKPLVQ PKLLSVLDGT SAKRPPVWFM RQAGRYLPEY RAVRATEPTF IDFCLNPEKA 
AEVTLQPMRR FPYDAAIVFA DILLIPQALG QKVWFEAGEG PKLGELPSIE SMRELTGQAG
QALGAVGETL SRVRSVLEPE RALIGFAGAP WTVATYMIEG GSSDRSGART FAYQQSDKLD
ALIQVLVDAT IDYLAMQAAS GAQVLKLFES WAEGLSEPLF ERLVTKPHTA IVEGLRAKGV
TTPIIGFPRG AGTLVEAYAR TAPVQGVALD TQASAALGQA IQKTKAIQGA LDPLLLRAGG
DALLTRVDQL LEQWGHGPYI FNLGHGILPD TPIAHVESVL ARVTGK