Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0460 |
Symbol | |
ID | 8331787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 521339 |
End bp | 524392 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644953626 |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003111253 |
Protein GI | 256389689 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTCC GCACACCCCG GTTCCGCATC ACGGGCCCGA CCGCGACGCT GCTCGCGGGC CTGCTGGTCG CCAACGTGCT GACGCTGGCG ATACCGACCG CAGCGCACGC CGGCACGCCC TCGGCCACCG GCGAGTTCGA CTACGCCGAG GCGTTGCAGG ACTCGATGCT CTTCTACGAG TCGCAGCGCT CCGGCCCGCT CCCGGCCGAC AACCGCGTGT CCTGGCGCGG ACCGTCGGAC CTCACCGACG GCGCCGACCA CGGCCTGGAC CTGACCGGCG GCTACCACGA CGCCGGCGAC GAGGTGAAGT TCGGCCTGCC CGAGGCGTAC TCGATGACCG CGCTGGCCTG GGGAGCGATC GACGACAAAT CCGGGTACCA GAAGTCCGGC CAGTGGCAGT ACCTGGAACG CGACCTGCGG TGGGGCGACG ACTACATCAT CAAGGCGCAC CCCTCGCCGC ATGTGTTCTA CGGCCAGGTC GGCGACGGCA GCAGCGACCA CAGCTTCTGG GGACCGGCCG AGGTCAACCC CGAGCCGAGG CCTTCCTACG CGGTGACCGA GTCCTGCCCG GGCTCGGACC TGGTGGGCCA GGCGTCCGCG GCGATGGCCG CGTCCTCGAT CGTGTTCCAG ACCGATGATC CCTCGTACTC CGCGAAGCTG CTCGCGCAGG CGAAGTCCCT GTACGAGTTC GCCGACGACT ACCGCGGCAA GTACGACGCC TGCATCACCG GCGCTTCCAG CTTCTACACC TCGTTCAGCG GCTACTGGGA CGAGCTGGTG TGGGGCGCCA TCTGGCTCTA CAAGGCGACC GGCGACACCG CGTACCTGAC CAAGGCCGAG ACGTACTTCG CCAACCTGAA CAAGGCGAAT CAGACCACGA CGCCGGAATA CGCCTGGACG ATCAGCTGGG ACGACTCCTC GTACGCGTCG TACATCCTGC TCGCCGAGAT CACCGGCCAG CAGCAGTACA TCGACGACGC CGAGCGCAAC CTGGACTGGT TCACCACCGG CTACAACGGC CAGCACGTGA GCATGTCCCC CGGCGGTGAG GCGCAGGTCG ACGTCTGGGG CACGGCGCGC TACTCCGCGA ACGAGGCATA CCTGGCCCTG GACTTCGAGA ACTGGCTCAA ATCGCAGAGC CTTGATACGG CGAGGCAGGC CACGTACCAC GACTTCGCGG TGCGCCAGAT GAACTACATC CTCGGCGACA ACCCGAACAA GGAAAGTTAC GAGGTCGGCT TCACCAACGG CGGCACGAAC ACCGCCTGGC CGCAACAGAT CCACAACCGG CCCGCCCACG ACTCGTGGGA CCAGAGCATG AGCGACCCGC CGAACACCCG GCACCTGGAC TACGGCCTGC TGGTCGGCGG CCCGACCTCC GGCGACGGCT TCACCGACAG CCGCCAGAAC TACCAGCAGA CCGAGGGCGC GCTGGACTAC AACGCCCTGT TCTCCGGCGC GCTCGCCGAG CTGACCACCG AGTACGGCGG CACGCCGCGC GCGAACTTCC CGCCGACCGA GACGCCCGAC GGTCCCGAGG AGCTCATGCA GGCCTCGCCG AACCAGACCG GCTCGAACTT CATCGAGATC AAGGCCGAGG TGGTCAACAA GTCCGGCTGG CCGGCCCGGC ACCTGACCAA CGGCTCGTTC CGCTACTACT TCACCCTCGA CGCCGGCGAG ACCGCCTCGC AGCTCCAGCT GACCTCGCCC TACAGCCAGT GCAACGCGCC GGGCCCGATC ACGCAGTACT CCGGCTCGAC CTACTACGTG ACGATCAGCT GCGCCGGCGA CGACGTCGCC CCGGCCGGGC AGTCGCAGTT CCACCGCGAG GTGCAGTTCC GCATCACCTT CCCGGCGGCG CACGACTACA CCAAGGACTG GTCCTACCAG GACCTGGTGG GCATGGCGAC CAACTCGACG CCCGTGAACA CCAGCCACAT CGAGCTCTAC GACGGCAGCA CGAAGGTGTG GGGCACCGCC CCGGGCAGCG GCACGCCGGT CACCCCGCCC GGAACGCCCG GGACGCCGAC CGCCTCGGCG ATCACGGCCA CCGGCGCGAC GCTGGCGTGG GCCGCCTCGA CTCCGGGCAC GAACGCGGTG GCCGGCTATG ACGTGTACAG CGTGTCGGGC GCGACCTCGA CCAAGGTGGC GTCCTCGACC ACGACGTCGG CGCCGCTGAC CGGGCTGACG CCGGGCACGG CGTACACGTT CGACGTCGTG GCGCGGGACA GTGCGGGCAA CCAGTCCCCG GCATCGCCGA CGGTCGCGGT GACCACCACG TCCTCGTCCG CGACGCCGCC GAGCGCCCCG ACCGCGCTGA CGGTCACCGC GACCGGGTCC ACGAGCGTCG GGCTGAGCTG GACCGCCGCC AAGGCGGGGA CGTCGGCGAT CGCGAGCTAC ACCGTCTACA AGACCGGTTC GCCGGCGACC GCGGTCGCGA CCGTGGCAGC TCCGGCGATC ACCGCGACTG TCAGTGGCCT GACGCCCTCG ACGGTGTACC AGTTCTATGT CGTGGCGACC GATGTCAATG GCCTGAGTTC CGCTGCGTCA TCGAGCGTAT CGGCGACCAC CACAGCGGGC ACTCCGCCGC CGCCGTCGTC CGTCTCGGTG CAGTACGAGA CCAGCGTCAC CACAGCCACC ACGCAGTCGT TCCAACCATT GCTGAACGTG GTCAACAACG GGACCAGCGC GGTGCCGCTG TCATCGGTGA CCATCCGCTA CTGGTTCACC TCCGACGGCG GCTCCAGCAC CTTCGCCACG AACTGCTGGT ACGCCGTCAT CGGCTGCGCG AAGGTCACGC AGTCAGTGTC CTCGGTAACC GCGACAACCG GCGCCGACCA CTACGTCCAG GTCGGCTTCA CCACCGCGGC CGGCAGCCTC GCCCCCGGCG CCTCGACCGG CCAGGTCCAA AGCGCGATCA ACAAGAGCGA CTGGTCGAAC TTCACCCAGA CCAACGACTA CAGCTTCAAC GCAGCCGACA CGGCGTGGAC GGCGAACACG AACGTCACCG TCTACGTCAA CGGAACGCTG GTCTGGGGAA CCGAACCGCA CTGA
|
Protein sequence | MRLRTPRFRI TGPTATLLAG LLVANVLTLA IPTAAHAGTP SATGEFDYAE ALQDSMLFYE SQRSGPLPAD NRVSWRGPSD LTDGADHGLD LTGGYHDAGD EVKFGLPEAY SMTALAWGAI DDKSGYQKSG QWQYLERDLR WGDDYIIKAH PSPHVFYGQV GDGSSDHSFW GPAEVNPEPR PSYAVTESCP GSDLVGQASA AMAASSIVFQ TDDPSYSAKL LAQAKSLYEF ADDYRGKYDA CITGASSFYT SFSGYWDELV WGAIWLYKAT GDTAYLTKAE TYFANLNKAN QTTTPEYAWT ISWDDSSYAS YILLAEITGQ QQYIDDAERN LDWFTTGYNG QHVSMSPGGE AQVDVWGTAR YSANEAYLAL DFENWLKSQS LDTARQATYH DFAVRQMNYI LGDNPNKESY EVGFTNGGTN TAWPQQIHNR PAHDSWDQSM SDPPNTRHLD YGLLVGGPTS GDGFTDSRQN YQQTEGALDY NALFSGALAE LTTEYGGTPR ANFPPTETPD GPEELMQASP NQTGSNFIEI KAEVVNKSGW PARHLTNGSF RYYFTLDAGE TASQLQLTSP YSQCNAPGPI TQYSGSTYYV TISCAGDDVA PAGQSQFHRE VQFRITFPAA HDYTKDWSYQ DLVGMATNST PVNTSHIELY DGSTKVWGTA PGSGTPVTPP GTPGTPTASA ITATGATLAW AASTPGTNAV AGYDVYSVSG ATSTKVASST TTSAPLTGLT PGTAYTFDVV ARDSAGNQSP ASPTVAVTTT SSSATPPSAP TALTVTATGS TSVGLSWTAA KAGTSAIASY TVYKTGSPAT AVATVAAPAI TATVSGLTPS TVYQFYVVAT DVNGLSSAAS SSVSATTTAG TPPPPSSVSV QYETSVTTAT TQSFQPLLNV VNNGTSAVPL SSVTIRYWFT SDGGSSTFAT NCWYAVIGCA KVTQSVSSVT ATTGADHYVQ VGFTTAAGSL APGASTGQVQ SAINKSDWSN FTQTNDYSFN AADTAWTANT NVTVYVNGTL VWGTEPH
|
| |