Gene Caci_5094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5094 
Symbol 
ID8336448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5849697 
End bp5851457 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content71% 
IMG OID644958193 
Product3-ketosteroid-delta-1-dehydrogenase 
Protein accessionYP_003115795 
Protein GI256394231 
COG category[C] Energy production and conversion 
COG ID[COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0693824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.136582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTGC ACGGAACCCC GGACCTCTCC CGCCGCCGCG TCCTTCAGGG CGCCGCCGCG 
GGCACCGCAG CTCTGTTCGG CGCGGGGGTG GCGACCGGCG CGAACGCCTA CGCCGATGCT
CCCGTCCTGG GCACGTACGA CGTCGTGGTG GTCGGCTGCG GCGCGGCCGG AATGACCGCC
GCGCTGACCG TGGCCGCACG CGGGCTGTCG GTGGTCGTCG TGGAGAAGGC GCCGACGTTC
GGCGGCTCGG CGGCGCGCTC GGGAGCCGGG ATCTGGCTCC CCAACAACTC GGTGATCCTG
GCCGCCGGCG TCCCCGACAC CCCGGCGCTG GCCGCGCAGT ACCTGGCCGC CGTGGTCGGC
GACGGCTCGA CCCCGGCGCG CCAGCACGCT TTCCTGAGCA CCGGACCGGC GATGCTGTCG
TTCGTCATGG CGCACAGTCC GTGCCGCTTC AAGTGGATGG ACGGCTACTC CGACTACTAT
CCGGAGCTTC CCGGCGGGAT GCCCGGCGGC CGCTCGATCG AGCCCGACGT GATCGACGGC
CACATCCTCG GCGCCGAACT CGCGCACCTG AACCCGGCCT ACCTCCCGGT CCCGGCCGGG
CTCGTGGTGT TCAGCCAGGA CTACAAGTGG ATCAACCTCG CCGCGGTCAA CGCCAAGGGC
ACGGCGGTCG CCGCCGAGGC TGCCGCTCGC GGAGCCGCCG CCGCGCTCGC CGGTCAGAAG
CCGCTGACGA TGGGCCAGTC GCTGGCCACC GGGCTGCGCG CCGGACTGAT GCAGGCCGGG
GTTTCGGTTC TGCTGAACAC GCCGCTGACG GACCTGAACA TCGTCAACGG ACGGGCGACC
GGCGTCGTGG TCACGCAGAA CGGCGCGCCC GGCGTGATCA CCGCGCGCCG AGGCGTCATC
GTCGGCTCCG GCGGCTTCGA GCACAACGCC GCGATGCGCG CGCAGTACCA GCAGCAGCCG
ATCGGGACGA CGTGGAGCGT CGGCGCCAAG GAGAACACCG GCGACGGCAT CCTGGCCGGG
CAGCGCGCCG GCGCCGCCTT GGCGCTGATG GACGACGCGT GGTGGGGTCC GACGATCCCC
AGCGGCGACG GGCCGTACTT CTGCCTGGCA GAACGGACGC TGCCCGGCGG GCTGATCGTC
AACCAGACCG GCCACCGGTT CGTCGACGAG GCGGCGCCGT ACGTGGACGT CGTGCACACG
ATGTACCGGC AGAACGCCAC GGCGGCGGAC ATCCCGTCCT GGCTGATCAT CGACCAGAAC
TTCCGCGACC GCTACGTCTT CCGGGACATC CTGCCGACCC TTCCGTTCCC CGCCTCGTGG
TACCAGAACG GCTCGGTCTA CAGGGACCTG ACCCTGTGGG GTCTGGCGAA CCAGATCGGA
GTCTCGCCCT CGACGCTGAC GAACACCGTC GCGCACTTCA ACGGACTGGC GATCACCGGC
AAGGACACCG ACTACGGCCG CGGCGTCAGC GTCTACGACC ACTACTACAC CGACCCGGCG
ATCTCCCCGA ACTCCTGCCT GGCGCCCCTG TGGCTCGCAC CGTTCTACGC ACTGAAGATC
GTCCCCGGCG ACCTCGGCAC GAAGGGCGGC ATGGTCACCG ACGAGCGAGC CCGCGTACTC
CGCGCCGACG GCTCGGTCAT CGGCGGCCTG TACGCGGCAG GCAACGCCAG CGCCGCCGTC
ATGGGACACA GCTACGCCGG CGCCGGCTCG ACGATCGGGC CCGCGATGAC CTTCGGGTAC
ATCGCGGGCA ATGACATCTA G
 
Protein sequence
MPVHGTPDLS RRRVLQGAAA GTAALFGAGV ATGANAYADA PVLGTYDVVV VGCGAAGMTA 
ALTVAARGLS VVVVEKAPTF GGSAARSGAG IWLPNNSVIL AAGVPDTPAL AAQYLAAVVG
DGSTPARQHA FLSTGPAMLS FVMAHSPCRF KWMDGYSDYY PELPGGMPGG RSIEPDVIDG
HILGAELAHL NPAYLPVPAG LVVFSQDYKW INLAAVNAKG TAVAAEAAAR GAAAALAGQK
PLTMGQSLAT GLRAGLMQAG VSVLLNTPLT DLNIVNGRAT GVVVTQNGAP GVITARRGVI
VGSGGFEHNA AMRAQYQQQP IGTTWSVGAK ENTGDGILAG QRAGAALALM DDAWWGPTIP
SGDGPYFCLA ERTLPGGLIV NQTGHRFVDE AAPYVDVVHT MYRQNATAAD IPSWLIIDQN
FRDRYVFRDI LPTLPFPASW YQNGSVYRDL TLWGLANQIG VSPSTLTNTV AHFNGLAITG
KDTDYGRGVS VYDHYYTDPA ISPNSCLAPL WLAPFYALKI VPGDLGTKGG MVTDERARVL
RADGSVIGGL YAAGNASAAV MGHSYAGAGS TIGPAMTFGY IAGNDI