Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6865 |
Symbol | |
ID | 8338231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7927021 |
End bp | 7930071 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644959954 |
Product | glycoside hydrolase family 31 |
Protein accession | YP_003117545 |
Protein GI | 256395981 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATCCG ACTCTATGGG AGACGTGATG AAAGACTATT CGATCAAGCG CAGAACCGTC CTTTCCACGG CCGTCGGCGC GCTGGCCGTG AGCACCGTGA ACGCGGTCCC CGCCTTCGGG GCGAGCGCGG ATCAGCTCAC GGACTACGCC TCGCACACCA CTGACAGCCG TAGCATCACC GTGACCAGCA CGACCGGTCA GCAGCTGAGG ATCACCGCGT ACGGAGACCA GATCGTCAGG GTGCACGCGG TCCGTTCCGG GGAGAGCTTC TTCTCCGACA CCCGGTACGA GATGGTGGTC CCCGCGAACC ACACCTCCAT GGGCGGCAGC CTGACCGTGA CCGTCACCAC GGACACGATC GAGATGCACA CCGCCGCGGC GGACGGTCTG CGCATCGTCC TGCACCGCAA GCCCCTGAGG CTGGAGTTCT ACAACCGGGC CACCGGCGCG CTGCTGGCCA AGGAGGACGC GACCCGGGGC ATCACGTGGA GCGGCACCAA CTCCACCGTC GTGGCCGAGG CCTTCGTCCC CTCGTCCTCC GGTGAGCGCT TCCTGAAGGC CGGACACGGC ATCCTCGGGC GCGTGCCGTC ACTGGATCGC ACCGGTACCA CGGTCTCGGA GAACTACGCC GACGCCAACG CCGCCGCTCA CAACCCTCAG GAACAGGCGC CGGGCATCGT GCCGTTCTAC CTCTCCAACC TGGGGTATGG GGTGTTCTTC AACACCACCT TCGACACGAC CTTCACCTTC AACAGCAGCA ACGGGTACGG GTTCTCCGCC ACCGGGTACG GCGTCAGCGG CATCCGGCCC CAGGTCGACT ACTTCCTGAT CAACGGTCCC CAGTTCACGC AGCTCTTCGA CCGCTACACC CAGCTCACCG GCCGTCCGCG GCTGCCCCAG CGGTCCATCT TCGGGCTCCA CATGACCGAC CACAGCTTCC CCGACACCAG CGACGAGAAC TGGTGGCGTC AGAAGATCAC CCAGCACCGC GCGGCCGGCT TCCCGTTCGA CCACCAGGTC AACGACAACC GGTGGCGGGC CGGCTCCGGC GCCTGGTCCG GCTCGTATTT CGAGTTCAGC TCCGTCCGCT GGCCCGACCC CGCCGGCTAC GCGAAATGGG CCGCCACCAA CGGTGTCACC GTGACGCTGG ACTACAACCG CAACAACTCC GACCTCATGG AGAACTGGAA GGCGGGGCCG CCCCCCGGCT ACAGCTTCGC GTCGGCCGAC ATTTCCAGCG TGCCGCAGAA CAACGCCGTC CCCGACTGGT CCTACCCCGC CACCCGCGCC TGGGTGTGGA AGGTCTTCTG GGACAAGGCC CTCAACCCGA GCCTGAAGTA CCCCTGTGAC GGCCTGTGGA TCGACGAGAC CGACGAGATG GGCGGGATCC CGTACCCTGC GAAGATGGCC GACGGCCACA CGTGGGCCGA AGGGCGGAAC GCCTACCTGC TGAACCTGCA CAAGGGCATC GGCGAAGAAG GCTGGGACCC GGCCGGCAGC GGCCACATCG GCTCCGCGAA GCGCCCGTGG ACCTGGAGCC GCGGCGCCAC CGCGGGCCAG CAGCGCTACG GCCACTACTG GACCGGCGAC ATCCCCTCGA CCTACGACGA GATGCGCTCC CAGATCAGGG GCATGCTGAC GGCGGGCCTC GGCGGCTTCC CGTTCGCCAA CATCGACGGC GGCGGCTACG GCAACGGCAG CGTGATTTCC GACGCTTTCT ACCGCAACTG GCCGGTCGCG TGGTCCAGCC TCGCGCCGAT CTGGCGCCCG CACACCTCCG CCACGGTCCC GTCGAAGGGC ACGCTCGCCT CACGCTGGCC GCTCGACCAG GGCACGCAGG CGCAGGCGGA CTTCGCCCGG TACGGCCGGC TGCGCTACAC CCTGATGCCC TACATCTACT CGCTCGCCCA CCAGTCCGCC GCAACCGGTA TGCCGATGGC TCGGGCCATG GTGATCGACT ACCAGAGCCG CTCCCAGGCT TACACCCACG ACCTGCAGTA CATGTGGGGC CCTTCGCTGC TGGTCGCGCC CTGCACCAAC GACGGCGGGG CCGTCCAGCA GATCTGGCTG CCGGCCGGTT CGACCTGGTA CAACTTCTGG GCCGACATCA AGCACACCGG TTCCGACTCC GGGGACTTCG CCTACACCAC CCGCACCGGC GAGACTCCGT TGTTCGTCAA GGCGGGGGCG ATTCTGCCCA AGTACCCGTA CGCGCAGAGC GCCGCCTACT TCACCAAGCA GCAGCTTGAG ATGGATGTCT ACGCGGGGGC CGACGGCACC TTCTCAGTCA TCGAAGACGA CGGAGTGACC GAGTCCTATC GGAGCGGCGC CCAGAGCACC ACGCAGCTCA CCTACACCGA CGCGGCGACC CGCGTCGCTG TCGCCCATCC GCAGGGGACG TACGCGGGCG CGCCCACCAG CCGCCGCTAC ATCGTCCGCT TCCACGGATT GGCGAATCCG GTGGGGATGC GGGTCAACGG CGGGGCGACC CTGCCGGCCT TCACCAGCGA AGCCGCAGCG CTGATCAGCT CGGGTGGAGC CGGCAGCGTG TGGAACGCGT CTACGAAGGT CCTGAGCGTC GTCACCTCGC AGATAGCCGT GGTCGCGAAC GGCGGCACCG CCGCGACGGT CGAACCGAGC GGCGCCGCCT TCCCCGCCGT CAGCGGCGGC ACGGTCTACG AGGCCGAGAC GGCCCATCTC GACAGCGCGT TCATCATCGA CACCAGCCAC CCCGGCTACA CCGGGACCGG CTATGCCGAC TTCAACGGAT CGTCCTCGGG CCCCGGCATC AGCTGGACGG TCACGGCGGC CGCGGCGGGG AAGAAGCAAC TCTCGATCCG CTATGCCAAC GGGGGCACCA CGAACCGCCC GATGGCCGTC GCGGTCAACG GCACCACTGT CGCCACGCTC ACTATGGCGC CCACTGGTGC GTGGGACAGC TGGGCGACTG TGTCTTGTAC TGCCACGCTT CCGCAGAGTA CGACGATCAC TGTTCGAGCT ACGGTCACCA CGGCTAATGG GGCGAACATC GACAGCTTGA TTGTGGGGTA G
|
Protein sequence | MGSDSMGDVM KDYSIKRRTV LSTAVGALAV STVNAVPAFG ASADQLTDYA SHTTDSRSIT VTSTTGQQLR ITAYGDQIVR VHAVRSGESF FSDTRYEMVV PANHTSMGGS LTVTVTTDTI EMHTAAADGL RIVLHRKPLR LEFYNRATGA LLAKEDATRG ITWSGTNSTV VAEAFVPSSS GERFLKAGHG ILGRVPSLDR TGTTVSENYA DANAAAHNPQ EQAPGIVPFY LSNLGYGVFF NTTFDTTFTF NSSNGYGFSA TGYGVSGIRP QVDYFLINGP QFTQLFDRYT QLTGRPRLPQ RSIFGLHMTD HSFPDTSDEN WWRQKITQHR AAGFPFDHQV NDNRWRAGSG AWSGSYFEFS SVRWPDPAGY AKWAATNGVT VTLDYNRNNS DLMENWKAGP PPGYSFASAD ISSVPQNNAV PDWSYPATRA WVWKVFWDKA LNPSLKYPCD GLWIDETDEM GGIPYPAKMA DGHTWAEGRN AYLLNLHKGI GEEGWDPAGS GHIGSAKRPW TWSRGATAGQ QRYGHYWTGD IPSTYDEMRS QIRGMLTAGL GGFPFANIDG GGYGNGSVIS DAFYRNWPVA WSSLAPIWRP HTSATVPSKG TLASRWPLDQ GTQAQADFAR YGRLRYTLMP YIYSLAHQSA ATGMPMARAM VIDYQSRSQA YTHDLQYMWG PSLLVAPCTN DGGAVQQIWL PAGSTWYNFW ADIKHTGSDS GDFAYTTRTG ETPLFVKAGA ILPKYPYAQS AAYFTKQQLE MDVYAGADGT FSVIEDDGVT ESYRSGAQST TQLTYTDAAT RVAVAHPQGT YAGAPTSRRY IVRFHGLANP VGMRVNGGAT LPAFTSEAAA LISSGGAGSV WNASTKVLSV VTSQIAVVAN GGTAATVEPS GAAFPAVSGG TVYEAETAHL DSAFIIDTSH PGYTGTGYAD FNGSSSGPGI SWTVTAAAAG KKQLSIRYAN GGTTNRPMAV AVNGTTVATL TMAPTGAWDS WATVSCTATL PQSTTITVRA TVTTANGANI DSLIVG
|
| |