Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0118 |
Symbol | hemC |
ID | 5897830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 129164 |
End bp | 130120 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641560602 |
Product | porphobilinogen deaminase |
Protein accession | YP_001681754 |
Protein GI | 167644091 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.522497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCGGC AACCCATCCG CATCGGCGCG CGCGGCTCCA AGCTGTCGCT GGCCCAGTCT GGCCTGATGC AGGCCCGAAT CGCCGCCGCC CTCGGCGCCG GTCCTGGCGA CGACATCGAC GCCTTCGCCC AACTGATTCC GATCGTCACC AGCGGCGACC GCATCCAGGA CCGCCGGCTG ATGGAGATCG GCGGCAAGGG GCTATTCACC AAGGAGATCG AGGAGGCCCT GCTCGACGGC CGCATCGATT GCGCGATCCA TTCGCTCAAG GACATGCCGG CCGAGTTGCC GCCCGGGCTG GTGCTGGCCG CCGTGCCGGA ACGCGAGGAT CCTCGCGACG CCTTCATCAG CCATGTCGCC GAGCGGCTGG AGGACCTGTC CAAGGGCGCG CGCCTGGGCA CGGCCTCGCT GCGCCGCCAG GCCCAGGCCC TGCATGTGCG GCCCGACCTC GAGATCGTCA TGCTGCGCGG CAATGTCGAC ACCCGCCTGG CCAAGCTGGA GCGCGGCGAG GCCGACGCCA TCCTGCTGGC TCAGTCGGGC CTCAACCGCC TGGGCCTGGG TCACATCACC AACAGCTGGC TGGATCCCCT GGCCGCCCCG CCCGCCCCGG GCCAGGGCGC CCTGGTCATC GAGACCCGCG CCGAGGATGT CGACCTGCCC TGGCTGCAAG CCGTGCGCTG CCAGGCCACG ACCCTGGCCG TCGCCGCCGA ACGCGGCGCG CTCTACGCCC TGGAAGGCTC GTGCCGCACG GCGGTCGGGG CCCATGCCCG GCTGGACGGC CTGATCCTGA CGATGATCGT CGAGGCCCTG ACCCCGGACG GCGTCCAGCG TTTCCGCCGC GAGGGCTCGG CGACGCTGTC CAGCCTCGAC GCCGCCGATC AGGCCCGGGC GCTGGGGCTG GAGTTGGGCG GCGCGGTGCG GGCCGAGGGC GGTCCGGCCC TGATCCTGAC CGAGTAG
|
Protein sequence | MSRQPIRIGA RGSKLSLAQS GLMQARIAAA LGAGPGDDID AFAQLIPIVT SGDRIQDRRL MEIGGKGLFT KEIEEALLDG RIDCAIHSLK DMPAELPPGL VLAAVPERED PRDAFISHVA ERLEDLSKGA RLGTASLRRQ AQALHVRPDL EIVMLRGNVD TRLAKLERGE ADAILLAQSG LNRLGLGHIT NSWLDPLAAP PAPGQGALVI ETRAEDVDLP WLQAVRCQAT TLAVAAERGA LYALEGSCRT AVGAHARLDG LILTMIVEAL TPDGVQRFRR EGSATLSSLD AADQARALGL ELGGAVRAEG GPALILTE
|
| |