Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5016 |
Symbol | |
ID | 5902478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5419078 |
End bp | 5420118 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641565537 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_001686634 |
Protein GI | 167648971 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.739471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCCC TCCCGAAACC CCTCGTCCAG CCTAAGCTCC TTTCCGTCCT GGATGGCACG AGCGCCAAAC GCCCGCCGGT GTGGTTCATG CGCCAGGCTG GTCGCTACCT GCCGGAATAC CGGGCGGTGC GGGCGACGGA ACCGACCTTC ATCGATTTCT GTCTGAATCC GGAGAAGGCC GCCGAGGTCA CCCTGCAGCC GATGCGGCGG TTTCCCTACG ACGCGGCCAT CGTGTTCGCC GATATCCTGC TGATCCCCCA GGCGTTGGGC CAAAAGGTGT GGTTCGAAGC CGGCGAGGGA CCCAAGCTGG GCGAACTGCC GTCGATCGAG TCGATGCGCG AGCTGACGGG CCAGGCCGGC CAGGCGCTGG GGGCGGTGGG CGAGACCCTG AGCCGCGTGC GTTCGGTCCT GGAACCGGAG CGCGCCCTGA TCGGCTTCGC CGGCGCGCCG TGGACGGTGG CGACCTACAT GATCGAAGGC GGATCCAGCG ACCGCTCCGG CGCGCGGACC TTTGCCTATC AGCAGTCCGA CAAGCTTGAC GCCCTGATCC AGGTGCTGGT CGATGCGACG ATCGACTATC TGGCGATGCA GGCGGCGTCG GGCGCCCAGG TGCTAAAGCT GTTCGAGAGC TGGGCCGAGG GCCTGTCTGA GCCGCTGTTC GAGCGGCTGG TGACGAAACC GCATACGGCC ATCGTCGAGG GTCTGCGGGC GAAGGGTGTG ACCACCCCGA TCATCGGCTT CCCGCGCGGG GCGGGGACGC TGGTCGAGGC CTATGCCAGA ACCGCGCCGG TGCAGGGCGT GGCGCTGGAC ACCCAGGCCT CGGCGGCGCT GGGCCAGGCG ATCCAGAAGA CCAAGGCCAT CCAGGGCGCG CTGGATCCGT TGCTGCTGCG GGCCGGCGGC GACGCCCTGC TGACGCGGGT CGATCAGCTT CTGGAGCAAT GGGGTCACGG CCCCTACATC TTCAACCTGG GTCACGGCAT CCTGCCTGAT ACGCCGATCG CCCATGTCGA ATCGGTGCTG GCCCGGGTCA CCGGCAAGTG A
|
Protein sequence | MTSLPKPLVQ PKLLSVLDGT SAKRPPVWFM RQAGRYLPEY RAVRATEPTF IDFCLNPEKA AEVTLQPMRR FPYDAAIVFA DILLIPQALG QKVWFEAGEG PKLGELPSIE SMRELTGQAG QALGAVGETL SRVRSVLEPE RALIGFAGAP WTVATYMIEG GSSDRSGART FAYQQSDKLD ALIQVLVDAT IDYLAMQAAS GAQVLKLFES WAEGLSEPLF ERLVTKPHTA IVEGLRAKGV TTPIIGFPRG AGTLVEAYAR TAPVQGVALD TQASAALGQA IQKTKAIQGA LDPLLLRAGG DALLTRVDQL LEQWGHGPYI FNLGHGILPD TPIAHVESVL ARVTGK
|
| |