Gene Caci_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3661 
Symbol 
ID8335014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4094527 
End bp4095702 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content62% 
IMG OID644956801 
ProductEpoxide hydrolase domain protein 
Protein accessionYP_003114404 
Protein GI256392840 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.543395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAT CGACGTCCGC ATCCTCAGAC TCACCCTCAG ACCTGGTCCG TCCGTTCACC 
GTCGCGATCT CCGACGCCGA GATCGAGGAC CTGAAGCAGC GGCTGGCCAG GACGCGCTGG
CCGAATCCGG AGACCGTCCC CGACTGGTCG CAGGGAGTCC GCCTGGAGAA CGCCAGATCG
CTCGTCGACT ACTGGGAGCG AGAATACGAC TGGCGCCGAT TCGAAGCGGA ACTTAATAGT
TTTCCCCATT TCCTGACCAC GATCGATGGG CTCGACATTC ACTTCATTCA TGTCAAGTCC
AAGAATCCGA ATGCGATGCC TCTGATCTTG ACGCACGGCT GGCCGGGGTC GATCGTCGAA
TTCCTGAAAC TGATCGGCCC GCTGACCGAC CCGGTGTCCT TCGGAGGAAC CATCGAAGAT
TCCTTCGACG TCGTCATCCC GTCGCTGCCC GGGTTTGGGT TCAGTCAAAA GCCGACCGAT
ACGGGCTGGA CTGTTTCCCG TATCGCAGGC GCGTGGGCGG AACTCATGAA GCGTCTTGGC
TATACGAGCT GGGCTGCTCA AGGCGGCGAT TGGGGCGCGG TCGTTACTAC CGCCCTCGGA
GCGATGCAGC CTGAGGGCCT TCTCGGGATT CACTTGAACA CTCAATACGC TTTCCCTGCG
CAGATACCTG ACACGCTGTC GCCCGAAGAG CGCTACGCCG TGGACACCCT CGCGCATTAC
CTCGGTGATC TCGGCGGATC CAACCACCTT CAGGGCACGA AGCCGGAGAC CGTCGGCATC
GCTCTCGCGG ACTCCCCGGC CGGGCAAGCC GCCTGGATCT ACGAAAAATT CCAATCCAAG
ACGGACAATC AGGGACTCGC CGAACAGGCT ATCGGCATCG ACGACATGCT CGATGCGATA
TCTCTGTACT GGTTCACCAA CAGCGCCGCG TCGTCCGCCC GCATCTACTG GGAGAACAAG
GCGAGCAGCA TGGCCGGCCC GAAGCTGGCG CTGCCCGTGG CGGTGACGGT CTTCCCCCGC
GACATCCCGC GCCTTCCGCG AACCTGGATC GAAGACACCT ACACGAACCT GATCCACTAC
GGCGAGGCTG CCCAGGGCGG ACACTTCGCA GCATTGGAAC AGCCCGAGAT TTTGATCGGC
GAAATCCGCG CCGGCCTCAG GAGCCTCCGT TCCTGA
 
Protein sequence
MTASTSASSD SPSDLVRPFT VAISDAEIED LKQRLARTRW PNPETVPDWS QGVRLENARS 
LVDYWEREYD WRRFEAELNS FPHFLTTIDG LDIHFIHVKS KNPNAMPLIL THGWPGSIVE
FLKLIGPLTD PVSFGGTIED SFDVVIPSLP GFGFSQKPTD TGWTVSRIAG AWAELMKRLG
YTSWAAQGGD WGAVVTTALG AMQPEGLLGI HLNTQYAFPA QIPDTLSPEE RYAVDTLAHY
LGDLGGSNHL QGTKPETVGI ALADSPAGQA AWIYEKFQSK TDNQGLAEQA IGIDDMLDAI
SLYWFTNSAA SSARIYWENK ASSMAGPKLA LPVAVTVFPR DIPRLPRTWI EDTYTNLIHY
GEAAQGGHFA ALEQPEILIG EIRAGLRSLR S