Gene Caci_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1417 
Symbol 
ID8332756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1613593 
End bp1614642 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID644954565 
Productaldo/keto reductase 
Protein accessionYP_003112181 
Protein GI256390617 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.479026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0673448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACC GGCGGCTGGG TGACTCGGGG CTCGTCGTGC CCGCTTTAAG CTTCGGGGCG 
GGCACGTTCG GCGGGAAGGG GGCGCTGTTC AGCGCGTGGG GCGACACCGA CGCGCGCGAG
GCGCGGCGGC TGATCGACGT CAGTCTCGAC GCCGGGGTGA CGATGTTCGA CACCGCGGAC
GTCTACTCCG ACGGGGCGTC CGAGGAGGTC CTCGGCGCGG CGATCAAGGG CCGGCGGGAT
CAGGTACTGC TGTCGACCAA GGCCGGGCTG CCGGTCGGCG ACGGACCACA AGACGCGGGC
ACATCGCGTT CGCGGCTGGT CAAGGCGACG GAGAGCGCAC TGCGCCGACT CGGCACCGAC
TACATCGATC TGTTCCAGCT ACACGCCTTC GACGCCCGCA CCCCGGTTGA CGAGACGATC
TCAGCCCTCG ACGATCTGGT CCGCCAGGGC AAGATCCGCT ACGTCGGCGC GTCCAACTAC
TCAGGCTGGC AGCTGATGAA GTCACTGGCG GCAGCCGACC GCCTGCACGC CCCACGCTAC
GTGGCGCACC AGGTCTACTA CTCGCTCGTG GGGCGCGACT ACGAGTGGGA ACTGATGCCG
CTCGCCGCCG ATCAGGGCGT CGGCGCGCTG GTCTGGAGCC CGCTGGGATG GGGACGGCTG
ACCGGTCGGA TCCGGCGCGG GCAGGCGCTC CCAGCGGGAA GCCGACTGCA TCAGACTGCT
GACTTCGGCC CGCCGGTCGA CAACGAGTTG CTGTACGACG TGGTCGACGT CCTGGACCAG
ATCGCCGAGG AGACCGGCAA GGCGGTACCG CAGATCGCCA TCAACTGGCT CCTGCGCCGG
CCAACCGTCG CCTCAGTCAT CATCGGCGCC CGCAACGAGG AGCAGCTACA GCAGAACCTC
GGCGCCATCG GCTGGGAACT GACGGCGGAG CAGGTCGCGC GACTGGACGC GGCAAGCGGG
AAGGAGGCGC CGTATCCGTA CTTTCCTTAC GAGCGACAGG AGGCTTTCGC CCGGTTGAAC
CCGCCGATGT TTGGCAGCGC TCGGCTGTAG
 
Protein sequence
MEYRRLGDSG LVVPALSFGA GTFGGKGALF SAWGDTDARE ARRLIDVSLD AGVTMFDTAD 
VYSDGASEEV LGAAIKGRRD QVLLSTKAGL PVGDGPQDAG TSRSRLVKAT ESALRRLGTD
YIDLFQLHAF DARTPVDETI SALDDLVRQG KIRYVGASNY SGWQLMKSLA AADRLHAPRY
VAHQVYYSLV GRDYEWELMP LAADQGVGAL VWSPLGWGRL TGRIRRGQAL PAGSRLHQTA
DFGPPVDNEL LYDVVDVLDQ IAEETGKAVP QIAINWLLRR PTVASVIIGA RNEEQLQQNL
GAIGWELTAE QVARLDAASG KEAPYPYFPY ERQEAFARLN PPMFGSARL