Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1994 |
Symbol | |
ID | 8333337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 2254832 |
End bp | 2258455 |
Gene Length | 3624 bp |
Protein Length | 1207 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644955143 |
Product | glycoside hydrolase family 31 |
Protein accession | YP_003112755 |
Protein GI | 256391191 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCAG GCGCTCCCGG CACAGACCTC CCCGTTCCGC CGCCCCGCGG CGCCCGACGC CGCCGCCGAA AGTCGGCGCG GCTGGCGGCC GCCGTCGGCC TTCTCGCGCT GGCGGGTACA GCCCTGTACT CGCCCGCGGC GGCGACAGCC GCAACGGCTG CCAGCAGCGG AAAAGCGAAT GCGACCGCCG CCTCGACCCC CGTACTGGAC GGCAGCGCGC GGTTCGAAGT CCTCACCCCG ACGTTGATCC GCATGGAGTA CGCCGGGGAC AACCAGTTCC AAGATGCCGC GACTTTCAAC GCTGTAAACC GGTCGTTCCC CGCGGTCTCG TACACGACGT CGGTCTCCGG CGGGTACCGC GTCATCACCA CCAGCTCGAT CACCTTGCGC TACAAGGAGG GCAGCGGACC GTTCACCGCG GCGAACACCT CGATCACGGT CGCGGGCTCC AGCGCGACCG CCGCGCCGTC GTTCCCGTCC TACTGCGCGT TCGGCACGGC GTGCGAGGCT GAGGACGGCC TTCTCAGCTG GGGCGCGTCG GCCTCCTACG ACCACGCCAA CCACACCGGA TCCGGCTTCG TCGCCGGCTT CAACCAGGTC CACAGCGCGG TCCAGCAGGA TGTCTCGGCG GTTCCGTCCA GCGGGACCTA CCAACTCAGC GTCCGCTACG CCAACGCCAC CGGCGGCGAC GGAAAGAACG TCACCCGCAC CCTGTCGACC ACGGTCAACG GTGCGTCCGG ACCGACACTC TCCCTGCCAG TCACCGGTTC CTGGGACACG TGGTCGACGG TATCCGTGCC GGTCACGCTG AAGGCCGGCG TCAACACGGT CACCGTCGTG CAGAACGCCT CCGACAGCGC CAACGTGAAC CTCGACAGCC TGGCCGTCAC CGCCTCCGGC GCCGCCTACC CCGCGCCCGG CGCCTCCAGC GCGCTGCTGA CCACCGCCTA CGGCGCCGGC CCCACCGACA CCCTCGGCGG CTGGTCCGGC TCGCTGGACA ACCAGAACCC GGCCACTGCG ACAGAGCGGC CGGGGATCCT GGACCGCGAC GGCTGGTACC TGCTGGACGA CTCGCGCTCG GCGCTGCTGA ACGCGAACGG CACCGTGACC GACCGTCCCG CCCACGGCGG CCAGCCGTAT CAGGACGGGT ACTTCTTCGG CTACGGCACC AACTACAAGC AGGGTCTTTC CGATCTCAAT GCACTGACCG GCAGCGCGAA CCTCCTGCCG GAATCGGCTT ACGGCGTCTG GTATTCGCGC TACTACGCCT ACAGCGCCTC GGACTACGAG AACACGCTGC TGCCGACCTT CCGCAGCACG TTCACGCCGC TGGACTGGCT GGTCGTCGAT ACCGACTGGA AGGCGCCCAA CCAGTGGGAC GGCTGGAACT GGAACTCCTC CCTGTTCCCA GATCCCACCG GATTCATGAA CTGGACTAAA CAACAGGGCC TTGAGACGTC CTTGAACATC CACACCGCCA TCTCCGGCTC CGACCCGCAG TTCGCCGCCG CGAACAGCAC CGCCGGCGGC CTGTCCCCCG ACGTCACGCG CTCCGGCGAC TACGAGTTCG ACTGGTCGAA CCCGAACCAG CTCGCGGCGT ACTTCAACCT GCACAAGCCC TTCGAGCAGC AGGGCGTCCG CGAGTGGTGG CTGGACTACT GCGGCGGCTG CGGCAACTCC ACGGCCGGCG ACCCGCACGT CGCCCCGGGC AACTTCATCA ACCAGGCCTA CGCCCAGGAC GGCACCGCCC GCGGCCTGCG CGGCTTCTCC TTCGGCCGCA TCGGCTCCTC CTCGCAGGCC GGCGACAACG GCAACTACGC CCTCGGCCCG TGGTCCGAGC GGCGCAACAC CATGCAGTTC ACCGGCGACA CCGAGGCCAC CTGGGCCATG ATGGCGCTGG AGGCGCAGTT CGCCGGGGAC GAGGCGGCAG CCGGGATCAC CAACGTCAGC AACGACATCG GCAGCTTCCA CGGCAACCAC CTGGCCGACG ACATGTACGC GCGCTGGGTG CAGCTGGGCA CCTTCCAGCC GGTCGATCGC CTGCACTCCG ACCACGGCGA CCGGCTGCCG TGGAACTACG GCGCGGCGGC CGACGCCAGC AGCGAGCGGT TCCTGCGGCT GCGCGAAGCG CTGGTCCCGT ACACCTACAC GCTGGCCGAC CAGGCGCACA CCACCGGCGT GCCGATCATC AGGCCTCTGT ATCTCGACTA TCCCTCGAAC AACGAGGCGT ACACCTTCAA GCAGGAGTAC CTCTACGGCG ACAACGTCCT CGTCGCCCCG ATCACCACTC CCGACGACGC CAACGGAAAC GGTTCCGTCA GCGCGTGGAT CCCGCCGGGC ACCTGGACCG ACTACTTCAC CGGCACCAGC TACACCGGCC CGACCACCGT CACCATCACC GACCCGCTGT CGCAGATGCC AGTCCTGATC AAGAGCGGCG GCATCATGCC GACCCGCACC AACTACGTGA ACGACGCCAA CTCCTCGCCG CTGACCCAGG TGACGCTCTC CGTCGCCGCC GGCGCCGACG GCTCGTTCCC GCTCTACCAG GACGCCGGCG AAGGCAACGG CTACCAGAGC GGCCAGTCCA CCACGACCCC GATCTCCTGG AGCAACGCCT CCCGCACGCT GACCATCGGC GCCGACAGCG GCAGCTTCAC CGGCGAGGCG ACGCAGCGCT CGTACACGCT GCGGCTGTCC AACACCGTCG CCCCGACCGC CGTCTCCATC GACGGCACGC AGGTCCCCGA AACCGCGTGG GCCTACAACC CGAACGAGCG CACCACGACC GTGACCACCG CCGCGCTGCC CGTCGGCACG CAGCACACGA TCAGCCTGAC CGGCAGCGCC ACCGCCAACC CGGCCGCCGG CGAGGTCGTC GGCGACGCCG GCCTGTGCCT GGACACCCGC GGCGGCACCA CCGCCAACGG CACCGCGATG CAGCTGTACA CCTGCAACCA CACCGCCGGC CAGCAGGTCG CCTACACCCC CGGCGGCGCC CTGCAGGTCC TCGGCAAGTG CCTGGACGCC GCCAACGCCG GCACCGCCAA CGGAACGCTC ATCCAGCTCT ACGACTGCAA TTCCACCGGC TCGCAGAACT GGACGGCGCA GAGCAACGGC GAACTGATCA ATCCCCAATC AGGACGCTGT CTCACTGTCC CCGGTGGCAA CACCACCCCC GGCGCGGTCC AACTGCAACT CCAGGACTGC ACCGACGCCG CATCGCAGAT CTGGAAGCTC CCGCCCGGAC CGCTCAAGGG ACCCGGCGGG CTGTGCGCGG ACGTGGCCAA CGCCGATCCG TCCTACGCGA CCAGCGTCCA ACTCTGGGGC TGCAACCAGA GCGACGCCCA GCGCTGGTAC ACCCCAGGCG ACAGCACGAT CCGCGTCTTC GGCAAGTGCC TCGACGTGAC CAACGGCGGC ACCGCCAACG GCACCCACGT CCAGCTGTTC GACTGCAACG GCAGCGGATC GCAGAACTGG ACGACGCAGG CGAACGGATC GCTGGTCAAC CCGCAGTCCG GCCGCTGCCT CGACGACCCC AACAACACCG AGAAGGCAGG AGATCTGCTC GAGATCTACG ACTGCAACAA CTCCGCCGCA CAGCAGTTCA GTCTCGGCGG CTGA
|
Protein sequence | MKPGAPGTDL PVPPPRGARR RRRKSARLAA AVGLLALAGT ALYSPAAATA ATAASSGKAN ATAASTPVLD GSARFEVLTP TLIRMEYAGD NQFQDAATFN AVNRSFPAVS YTTSVSGGYR VITTSSITLR YKEGSGPFTA ANTSITVAGS SATAAPSFPS YCAFGTACEA EDGLLSWGAS ASYDHANHTG SGFVAGFNQV HSAVQQDVSA VPSSGTYQLS VRYANATGGD GKNVTRTLST TVNGASGPTL SLPVTGSWDT WSTVSVPVTL KAGVNTVTVV QNASDSANVN LDSLAVTASG AAYPAPGASS ALLTTAYGAG PTDTLGGWSG SLDNQNPATA TERPGILDRD GWYLLDDSRS ALLNANGTVT DRPAHGGQPY QDGYFFGYGT NYKQGLSDLN ALTGSANLLP ESAYGVWYSR YYAYSASDYE NTLLPTFRST FTPLDWLVVD TDWKAPNQWD GWNWNSSLFP DPTGFMNWTK QQGLETSLNI HTAISGSDPQ FAAANSTAGG LSPDVTRSGD YEFDWSNPNQ LAAYFNLHKP FEQQGVREWW LDYCGGCGNS TAGDPHVAPG NFINQAYAQD GTARGLRGFS FGRIGSSSQA GDNGNYALGP WSERRNTMQF TGDTEATWAM MALEAQFAGD EAAAGITNVS NDIGSFHGNH LADDMYARWV QLGTFQPVDR LHSDHGDRLP WNYGAAADAS SERFLRLREA LVPYTYTLAD QAHTTGVPII RPLYLDYPSN NEAYTFKQEY LYGDNVLVAP ITTPDDANGN GSVSAWIPPG TWTDYFTGTS YTGPTTVTIT DPLSQMPVLI KSGGIMPTRT NYVNDANSSP LTQVTLSVAA GADGSFPLYQ DAGEGNGYQS GQSTTTPISW SNASRTLTIG ADSGSFTGEA TQRSYTLRLS NTVAPTAVSI DGTQVPETAW AYNPNERTTT VTTAALPVGT QHTISLTGSA TANPAAGEVV GDAGLCLDTR GGTTANGTAM QLYTCNHTAG QQVAYTPGGA LQVLGKCLDA ANAGTANGTL IQLYDCNSTG SQNWTAQSNG ELINPQSGRC LTVPGGNTTP GAVQLQLQDC TDAASQIWKL PPGPLKGPGG LCADVANADP SYATSVQLWG CNQSDAQRWY TPGDSTIRVF GKCLDVTNGG TANGTHVQLF DCNGSGSQNW TTQANGSLVN PQSGRCLDDP NNTEKAGDLL EIYDCNNSAA QQFSLGG
|
| |