Gene Caci_4192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4192 
Symbol 
ID8335546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4746271 
End bp4747287 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content69% 
IMG OID644957295 
Productaldo/keto reductase 
Protein accessionYP_003114897 
Protein GI256393333 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.957663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.276407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATACA CGACCTTCGG GAACACCGGC CTGCGCGTTT CAGAGGCCTT CCTCGGCACG 
ATGGGCTTCG GCGAGGACTG GGGCTGGGGC GTCTCCGTCG AGGACTGCCG GAAGATCTTC
ACCGCCTACG CCGAGGCCGG CGGCAACGTC ATAGACACCG CGAACCGCTA CACCGACGGC
TCCAGCGAGC GCATCGTCGG CGAACTGCTC GGCACGGACC GCGACCGCTT CGTCCTGGCA
ACGAAATACA CCCTGAGCAT GGACGACACC GACCCCAACG CTGCGGGCAA CCACCGCAAG
AACCTGCGAC GCTCGGTCGA GGACAGCCTG AGCCGCCTGA ACACCGACTA CCTGGACGTG
CTCTGGGTCC ACATCTGGGA CGCGCACACC CCGCTGGAGG AGACCATGCG CGCCCTGGAC
GACCTCGTCC GCTCCGGCAA GGTCCTCTAC CTCGGCCTGT CCGACGCCCC CGCCTGGGTC
GCCGCCCGTG CCCAGACCAT GGCCGAACTC CGCGGCTGGA CCCCCTTCGC AGGCCTGCAA
CTCAACTACA GCCTGCTGGA ACGCGGCATC GAGCGCGAAC TCCTGCCCAT GGCCGAATCC
CTGAACCTCT CAGTCGCCGC CTGGGCACCC CTGGCCCGCG GCGTCCTCAC CGGCAAGTTC
ACCCGCCACG GCGCCACCGA AGGCTCTCGC ACCAGCCGCG ACAAACTGAC TGAGCACGAC
CTGCACATCG CCGCCACCCT CGACGCGGTA GCCGACGACC TCGGCATCAC TTCCTCCCAA
GCCGCCGTGG CTTGGACCCG CGCCCACCAC CGCTGGATCC ACCCCATCAT CGGCGCCCGC
ACCGTCGACC AGCTGAACGA CAGCGTCGCA GCCCTCGACG TCCGCCTCCC CGCCGACGCG
GTGCGGCGCC TGGAGGAAGC GACGTCGTTC GACCTCGGCT TCCCCCAGGA ATTCATCGCC
GAGGCACGGG AGTTCGTCTA CGGACCGGGC ATCGAGCGGT TCGAGCCGCG CCGGTGA
 
Protein sequence
MRYTTFGNTG LRVSEAFLGT MGFGEDWGWG VSVEDCRKIF TAYAEAGGNV IDTANRYTDG 
SSERIVGELL GTDRDRFVLA TKYTLSMDDT DPNAAGNHRK NLRRSVEDSL SRLNTDYLDV
LWVHIWDAHT PLEETMRALD DLVRSGKVLY LGLSDAPAWV AARAQTMAEL RGWTPFAGLQ
LNYSLLERGI ERELLPMAES LNLSVAAWAP LARGVLTGKF TRHGATEGSR TSRDKLTEHD
LHIAATLDAV ADDLGITSSQ AAVAWTRAHH RWIHPIIGAR TVDQLNDSVA ALDVRLPADA
VRRLEEATSF DLGFPQEFIA EAREFVYGPG IERFEPRR