Gene Caci_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3169 
Symbol 
ID8334522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3499087 
End bp3500127 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID644956315 
Product2OG-Fe(II) oxygenase 
Protein accessionYP_003113918 
Protein GI256392354 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000235169 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGACGC ACGAGCTCGG CACCGATCCC GGTCTTCCGG TCGATCCCGG TCTGCCGGTC 
GTCGACCTGT CGAAGGCCGA CGGGTCGGAG GCGGAGCGGG CCGCGCTGCA CGAGGCGCTG
CGCACGGCGG CCACCGAGGT CGGCTTCTTC CAGCTGGTCG GGCACGGCAT CACCGAGAGT
GAGACCGCCG CGCTCAACGA CGCGATGCGC TCCTTCTTCG CCCTGCCGCA CGCCGACCGC
CTGGCGGTCA GCAACCTCAA CTCCCCGCAC TTCCGCGGCT ACACCGGCAC CGGCGACGAG
CAGACCGCCG GCGCGCGGGA CTGGCGGGAC CAGATGGACA TCGGGCTGGA GCTGCCGCCG
CACGTGCCGG GCGCCGGGGA GCCGGCGTAC TGGTGGCTGC AAGGGCCCAA CCAGTGGCCG
GCGAGGCTGC CCCAGTTGCG GGCGGCGACG CTGGGCTGGA TCGACAAGCT CAGCGCGATC
TCCCGGCGGC TGCTGCACGA GCTGCTGGCC TCGATCGGCG CGCGCCCGGA TTTCTACGAC
GCCGCCTTCG CCGGGCATCC GCATCTGCGG CTGAAGCTGG TGCGCTATCC CGGTACCGCT
CCGGACGGCG CGGGTCAGGG CGTCGGGATG CACAAGGACT ACGGCTTCAT CACGCTGTTG
CTGCAGGACT CGGTCGGCGG GCTGCAGGTG GCGCGGGCGG ACGGGACGTT CCTGGACGTG
CCGCCGATGC CGGGGGCGTT CGTGGTGAAC CTCGGCGAGC TGCTGGAGGT GGCGACCGAC
GGGTATCTGA AGGCGACGAG CCACCGGGTG GTCAGCCCCC CGAGGGCGCG GGAGCGGTTC
TCGGTGCCGT TCTTCTTCAA CCCGCGGCTG GACGCGCACA TCGAGCCGCT GGAGTTCCCG
CACGCGCACC ACGCGCCCGG CGCCGACGAC GATCCGTCGA ACCCGCTGTA CGCGGAGTTC
GGACGCAACG AGCTGAAGGG GTATCTGCGG GCGCATCCGG AGGTGACGAG GAAGTTCCAT
CCGGATCTGG CGACGGTGTA G
 
Protein sequence
MVTHELGTDP GLPVDPGLPV VDLSKADGSE AERAALHEAL RTAATEVGFF QLVGHGITES 
ETAALNDAMR SFFALPHADR LAVSNLNSPH FRGYTGTGDE QTAGARDWRD QMDIGLELPP
HVPGAGEPAY WWLQGPNQWP ARLPQLRAAT LGWIDKLSAI SRRLLHELLA SIGARPDFYD
AAFAGHPHLR LKLVRYPGTA PDGAGQGVGM HKDYGFITLL LQDSVGGLQV ARADGTFLDV
PPMPGAFVVN LGELLEVATD GYLKATSHRV VSPPRARERF SVPFFFNPRL DAHIEPLEFP
HAHHAPGADD DPSNPLYAEF GRNELKGYLR AHPEVTRKFH PDLATV