Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6458 |
Symbol | |
ID | 8337822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 7449242 |
End bp | 7450342 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644959557 |
Product | amidohydrolase 2 |
Protein accession | YP_003117150 |
Protein GI | 256395586 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0823861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.151509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACGC GTCGTCAAGC ACTGACCGGA TTGGGCGCAC TGGCGGTGAC CGGAGTCGGC ATGTCCCAGC TCACCCCTGC GATGGCGGCA CCGAAGGGAA CAAAGAGCAC ATCGGCGCAA GCGGTGCGCT CAGCGGCCGC GGGCGGGCCC TTGGTGGGGG CGGTCGACGT TCACGCCCAC TACCTCACAC CCACCTACCG CCAGGCCCTG ATCAACGCGG GCATCACCCA GCCGGACGGT ATGCCGTCCA TCCCGCAGTG GAGCGCCGAC AGCGCGCTGG CCACGATGGA CACCACTGGT ATCGCCGTGG CGATGCTGTC CGTTTCCTCG CCGGGATTCG ACTTCGGCGA GGCGGGCAAG GTGAGTGACC TGGTCCGCCA GGTCAACGAG GAGGGCGCCG CGATCGTCAA AGCCCATCCC ACCCGCTTCG GGCTGATGGC GTCGTTGCCG CTGCCGGACA TCAACGCCGC CGTCGCCGAG GTGAACTACG CGTTCGATGT GCTGAAGGCC GACGGGATCG CCCTGGAGAC CAACTACGGC GGCACCTACC TGGGGGACCC CTCTTTCAGC CCGGTCCTGG CCGAGTTGCA CAAGCGGAAT GCCGTCGTCC ATCTGCACCC GACCTCGCCG GCCTGCTGGG AAGCCACGTC GCTCGGCGCA CCCCGCCCCA TGATCGAGTT CCTCTTCGAC ACGACGCGGA CGATCACGCA GCTGATCCTC GGGGGCGTCC TGCTGAAGTA CCCCGGCATC CGCTTCATCG TTCCCCACAC CGGCGCCGCG CTGCCTGTCC TCGCCGACCG GATCTCCGCG TTCGACCTGA CTCAGCCTTC GCCGGTCGAT GTCATCGGCG CGCTCAAGCG CCTGCACTAC GACGTCGCCG GCTTCGCTCT GCCTCGGGCG CTGCCCGCGC TGCTCAATCT CGTCGGCCCG GAGACGCTTC TCTACGGCAG TGACTTCCCG TTCACCGAGG ACCCCATCGT CAAGCTGCTG GCAGCACAGC TGGCGGGCAC CACCGTCCTG ACGCCGCAGC AGAAGCAGGC CATGCTCAAC GGCAACGCCG CCGGACTCTT CCCACGGCTG AAGAACATGG CGCGACTGTA G
|
Protein sequence | MATRRQALTG LGALAVTGVG MSQLTPAMAA PKGTKSTSAQ AVRSAAAGGP LVGAVDVHAH YLTPTYRQAL INAGITQPDG MPSIPQWSAD SALATMDTTG IAVAMLSVSS PGFDFGEAGK VSDLVRQVNE EGAAIVKAHP TRFGLMASLP LPDINAAVAE VNYAFDVLKA DGIALETNYG GTYLGDPSFS PVLAELHKRN AVVHLHPTSP ACWEATSLGA PRPMIEFLFD TTRTITQLIL GGVLLKYPGI RFIVPHTGAA LPVLADRISA FDLTQPSPVD VIGALKRLHY DVAGFALPRA LPALLNLVGP ETLLYGSDFP FTEDPIVKLL AAQLAGTTVL TPQQKQAMLN GNAAGLFPRL KNMARL
|
| |