Gene Caci_6458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6458 
Symbol 
ID8337822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7449242 
End bp7450342 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content68% 
IMG OID644959557 
Productamidohydrolase 2 
Protein accessionYP_003117150 
Protein GI256395586 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0823861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.151509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGC GTCGTCAAGC ACTGACCGGA TTGGGCGCAC TGGCGGTGAC CGGAGTCGGC 
ATGTCCCAGC TCACCCCTGC GATGGCGGCA CCGAAGGGAA CAAAGAGCAC ATCGGCGCAA
GCGGTGCGCT CAGCGGCCGC GGGCGGGCCC TTGGTGGGGG CGGTCGACGT TCACGCCCAC
TACCTCACAC CCACCTACCG CCAGGCCCTG ATCAACGCGG GCATCACCCA GCCGGACGGT
ATGCCGTCCA TCCCGCAGTG GAGCGCCGAC AGCGCGCTGG CCACGATGGA CACCACTGGT
ATCGCCGTGG CGATGCTGTC CGTTTCCTCG CCGGGATTCG ACTTCGGCGA GGCGGGCAAG
GTGAGTGACC TGGTCCGCCA GGTCAACGAG GAGGGCGCCG CGATCGTCAA AGCCCATCCC
ACCCGCTTCG GGCTGATGGC GTCGTTGCCG CTGCCGGACA TCAACGCCGC CGTCGCCGAG
GTGAACTACG CGTTCGATGT GCTGAAGGCC GACGGGATCG CCCTGGAGAC CAACTACGGC
GGCACCTACC TGGGGGACCC CTCTTTCAGC CCGGTCCTGG CCGAGTTGCA CAAGCGGAAT
GCCGTCGTCC ATCTGCACCC GACCTCGCCG GCCTGCTGGG AAGCCACGTC GCTCGGCGCA
CCCCGCCCCA TGATCGAGTT CCTCTTCGAC ACGACGCGGA CGATCACGCA GCTGATCCTC
GGGGGCGTCC TGCTGAAGTA CCCCGGCATC CGCTTCATCG TTCCCCACAC CGGCGCCGCG
CTGCCTGTCC TCGCCGACCG GATCTCCGCG TTCGACCTGA CTCAGCCTTC GCCGGTCGAT
GTCATCGGCG CGCTCAAGCG CCTGCACTAC GACGTCGCCG GCTTCGCTCT GCCTCGGGCG
CTGCCCGCGC TGCTCAATCT CGTCGGCCCG GAGACGCTTC TCTACGGCAG TGACTTCCCG
TTCACCGAGG ACCCCATCGT CAAGCTGCTG GCAGCACAGC TGGCGGGCAC CACCGTCCTG
ACGCCGCAGC AGAAGCAGGC CATGCTCAAC GGCAACGCCG CCGGACTCTT CCCACGGCTG
AAGAACATGG CGCGACTGTA G
 
Protein sequence
MATRRQALTG LGALAVTGVG MSQLTPAMAA PKGTKSTSAQ AVRSAAAGGP LVGAVDVHAH 
YLTPTYRQAL INAGITQPDG MPSIPQWSAD SALATMDTTG IAVAMLSVSS PGFDFGEAGK
VSDLVRQVNE EGAAIVKAHP TRFGLMASLP LPDINAAVAE VNYAFDVLKA DGIALETNYG
GTYLGDPSFS PVLAELHKRN AVVHLHPTSP ACWEATSLGA PRPMIEFLFD TTRTITQLIL
GGVLLKYPGI RFIVPHTGAA LPVLADRISA FDLTQPSPVD VIGALKRLHY DVAGFALPRA
LPALLNLVGP ETLLYGSDFP FTEDPIVKLL AAQLAGTTVL TPQQKQAMLN GNAAGLFPRL
KNMARL