Gene Caci_4948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4948 
Symbol 
ID8336302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5650237 
End bp5651217 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content70% 
IMG OID644958047 
Productaldo/keto reductase 
Protein accessionYP_003115649 
Protein GI256394085 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0576267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCC GGACGATCGC GGGCACGCTG CTGGCCCTGA CCGAGCTCGG GTTCGGGGGA 
TCGGTCATCG GGAACCTGTA CCGGCCGGTC TCCGACGACG ATGCCGAAGC CGCGGTCGCC
GCGGCCTGGG ACGCCGGGAT CCGCTCCTTC GACACCGCGC CGCACTACGG ACTCGGGCTG
TCCGAACGCC GCCTCGGCGC CGTGCTGAGC GACCACCCTC GAGACGAGTA TGTGCTGTCG
TCCAAAGTCG GGCGGCTGCT CGTCCCCAAC GACTACCCCA CCGGCCGTGA CAGCGACGGT
TTCGCCGTCC CGGACACAGT CCGCAGGAAG TGGGACTTCA GCAAGGACGG CGTCCTGCGC
TCCATCGAAG CGAGCCTGGA CCGCCTCGGG ACGGACCGCC TCGACATCGT CTACCTGCAC
GATCCCGACG ACCACTGGCA GCAGGCCGCC GACGAGGCCA TGCCCACCCT CGCGCAGCTG
CGCGACGAGG GCGTCGTCGG CGCGATCGGC GCCGGCATGA ACCAGTCCGC GATGCTCACC
CGCTTCCTGC GGGAGACCGC CGCGGACGTG GTCATGCTGG CCGGCCGGTA CACCCTGCTC
GACCAGAGCG CCCTGGAGGA CGTGCTGCCG GCCGCGATCG AGCAGGGGAA GAGCGTCGTC
GCGGTCGGCG TCTTCAACTC CGGTCTGTTG GCGGGCGACC GGCCGGGCGC CGGGATGAAG
TACGACTACG GCGACGCGCC CGCAGACCTC GTCGACCGCG CCCGGATGAT CGCCGAGGTC
TGCGAAGCCC ATGGGACGAC GCTGCCCGCC GCGGCCATCG CCTTTCCGTT CACCCACCCC
GCTGTCGTCA ACGTCACGCT CGGAATGCGG ACCTCTCGGC AGGTGGCGCG CAACATCGAG
CTCCACCGGT CGAGCGTCCC CCAAGCCTTG TTCGCCTACC TGGGGTCGCT GGGACTGATC
ACCTACAGCG AAGGATCGTA G
 
Protein sequence
MNRRTIAGTL LALTELGFGG SVIGNLYRPV SDDDAEAAVA AAWDAGIRSF DTAPHYGLGL 
SERRLGAVLS DHPRDEYVLS SKVGRLLVPN DYPTGRDSDG FAVPDTVRRK WDFSKDGVLR
SIEASLDRLG TDRLDIVYLH DPDDHWQQAA DEAMPTLAQL RDEGVVGAIG AGMNQSAMLT
RFLRETAADV VMLAGRYTLL DQSALEDVLP AAIEQGKSVV AVGVFNSGLL AGDRPGAGMK
YDYGDAPADL VDRARMIAEV CEAHGTTLPA AAIAFPFTHP AVVNVTLGMR TSRQVARNIE
LHRSSVPQAL FAYLGSLGLI TYSEGS