Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3169 |
Symbol | |
ID | 8334522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3499087 |
End bp | 3500127 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644956315 |
Product | 2OG-Fe(II) oxygenase |
Protein accession | YP_003113918 |
Protein GI | 256392354 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.000235169 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGACGC ACGAGCTCGG CACCGATCCC GGTCTTCCGG TCGATCCCGG TCTGCCGGTC GTCGACCTGT CGAAGGCCGA CGGGTCGGAG GCGGAGCGGG CCGCGCTGCA CGAGGCGCTG CGCACGGCGG CCACCGAGGT CGGCTTCTTC CAGCTGGTCG GGCACGGCAT CACCGAGAGT GAGACCGCCG CGCTCAACGA CGCGATGCGC TCCTTCTTCG CCCTGCCGCA CGCCGACCGC CTGGCGGTCA GCAACCTCAA CTCCCCGCAC TTCCGCGGCT ACACCGGCAC CGGCGACGAG CAGACCGCCG GCGCGCGGGA CTGGCGGGAC CAGATGGACA TCGGGCTGGA GCTGCCGCCG CACGTGCCGG GCGCCGGGGA GCCGGCGTAC TGGTGGCTGC AAGGGCCCAA CCAGTGGCCG GCGAGGCTGC CCCAGTTGCG GGCGGCGACG CTGGGCTGGA TCGACAAGCT CAGCGCGATC TCCCGGCGGC TGCTGCACGA GCTGCTGGCC TCGATCGGCG CGCGCCCGGA TTTCTACGAC GCCGCCTTCG CCGGGCATCC GCATCTGCGG CTGAAGCTGG TGCGCTATCC CGGTACCGCT CCGGACGGCG CGGGTCAGGG CGTCGGGATG CACAAGGACT ACGGCTTCAT CACGCTGTTG CTGCAGGACT CGGTCGGCGG GCTGCAGGTG GCGCGGGCGG ACGGGACGTT CCTGGACGTG CCGCCGATGC CGGGGGCGTT CGTGGTGAAC CTCGGCGAGC TGCTGGAGGT GGCGACCGAC GGGTATCTGA AGGCGACGAG CCACCGGGTG GTCAGCCCCC CGAGGGCGCG GGAGCGGTTC TCGGTGCCGT TCTTCTTCAA CCCGCGGCTG GACGCGCACA TCGAGCCGCT GGAGTTCCCG CACGCGCACC ACGCGCCCGG CGCCGACGAC GATCCGTCGA ACCCGCTGTA CGCGGAGTTC GGACGCAACG AGCTGAAGGG GTATCTGCGG GCGCATCCGG AGGTGACGAG GAAGTTCCAT CCGGATCTGG CGACGGTGTA G
|
Protein sequence | MVTHELGTDP GLPVDPGLPV VDLSKADGSE AERAALHEAL RTAATEVGFF QLVGHGITES ETAALNDAMR SFFALPHADR LAVSNLNSPH FRGYTGTGDE QTAGARDWRD QMDIGLELPP HVPGAGEPAY WWLQGPNQWP ARLPQLRAAT LGWIDKLSAI SRRLLHELLA SIGARPDFYD AAFAGHPHLR LKLVRYPGTA PDGAGQGVGM HKDYGFITLL LQDSVGGLQV ARADGTFLDV PPMPGAFVVN LGELLEVATD GYLKATSHRV VSPPRARERF SVPFFFNPRL DAHIEPLEFP HAHHAPGADD DPSNPLYAEF GRNELKGYLR AHPEVTRKFH PDLATV
|
| |