Gene Caci_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1994 
Symbol 
ID8333337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2254832 
End bp2258455 
Gene Length3624 bp 
Protein Length1207 aa 
Translation table11 
GC content69% 
IMG OID644955143 
Productglycoside hydrolase family 31 
Protein accessionYP_003112755 
Protein GI256391191 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAG GCGCTCCCGG CACAGACCTC CCCGTTCCGC CGCCCCGCGG CGCCCGACGC 
CGCCGCCGAA AGTCGGCGCG GCTGGCGGCC GCCGTCGGCC TTCTCGCGCT GGCGGGTACA
GCCCTGTACT CGCCCGCGGC GGCGACAGCC GCAACGGCTG CCAGCAGCGG AAAAGCGAAT
GCGACCGCCG CCTCGACCCC CGTACTGGAC GGCAGCGCGC GGTTCGAAGT CCTCACCCCG
ACGTTGATCC GCATGGAGTA CGCCGGGGAC AACCAGTTCC AAGATGCCGC GACTTTCAAC
GCTGTAAACC GGTCGTTCCC CGCGGTCTCG TACACGACGT CGGTCTCCGG CGGGTACCGC
GTCATCACCA CCAGCTCGAT CACCTTGCGC TACAAGGAGG GCAGCGGACC GTTCACCGCG
GCGAACACCT CGATCACGGT CGCGGGCTCC AGCGCGACCG CCGCGCCGTC GTTCCCGTCC
TACTGCGCGT TCGGCACGGC GTGCGAGGCT GAGGACGGCC TTCTCAGCTG GGGCGCGTCG
GCCTCCTACG ACCACGCCAA CCACACCGGA TCCGGCTTCG TCGCCGGCTT CAACCAGGTC
CACAGCGCGG TCCAGCAGGA TGTCTCGGCG GTTCCGTCCA GCGGGACCTA CCAACTCAGC
GTCCGCTACG CCAACGCCAC CGGCGGCGAC GGAAAGAACG TCACCCGCAC CCTGTCGACC
ACGGTCAACG GTGCGTCCGG ACCGACACTC TCCCTGCCAG TCACCGGTTC CTGGGACACG
TGGTCGACGG TATCCGTGCC GGTCACGCTG AAGGCCGGCG TCAACACGGT CACCGTCGTG
CAGAACGCCT CCGACAGCGC CAACGTGAAC CTCGACAGCC TGGCCGTCAC CGCCTCCGGC
GCCGCCTACC CCGCGCCCGG CGCCTCCAGC GCGCTGCTGA CCACCGCCTA CGGCGCCGGC
CCCACCGACA CCCTCGGCGG CTGGTCCGGC TCGCTGGACA ACCAGAACCC GGCCACTGCG
ACAGAGCGGC CGGGGATCCT GGACCGCGAC GGCTGGTACC TGCTGGACGA CTCGCGCTCG
GCGCTGCTGA ACGCGAACGG CACCGTGACC GACCGTCCCG CCCACGGCGG CCAGCCGTAT
CAGGACGGGT ACTTCTTCGG CTACGGCACC AACTACAAGC AGGGTCTTTC CGATCTCAAT
GCACTGACCG GCAGCGCGAA CCTCCTGCCG GAATCGGCTT ACGGCGTCTG GTATTCGCGC
TACTACGCCT ACAGCGCCTC GGACTACGAG AACACGCTGC TGCCGACCTT CCGCAGCACG
TTCACGCCGC TGGACTGGCT GGTCGTCGAT ACCGACTGGA AGGCGCCCAA CCAGTGGGAC
GGCTGGAACT GGAACTCCTC CCTGTTCCCA GATCCCACCG GATTCATGAA CTGGACTAAA
CAACAGGGCC TTGAGACGTC CTTGAACATC CACACCGCCA TCTCCGGCTC CGACCCGCAG
TTCGCCGCCG CGAACAGCAC CGCCGGCGGC CTGTCCCCCG ACGTCACGCG CTCCGGCGAC
TACGAGTTCG ACTGGTCGAA CCCGAACCAG CTCGCGGCGT ACTTCAACCT GCACAAGCCC
TTCGAGCAGC AGGGCGTCCG CGAGTGGTGG CTGGACTACT GCGGCGGCTG CGGCAACTCC
ACGGCCGGCG ACCCGCACGT CGCCCCGGGC AACTTCATCA ACCAGGCCTA CGCCCAGGAC
GGCACCGCCC GCGGCCTGCG CGGCTTCTCC TTCGGCCGCA TCGGCTCCTC CTCGCAGGCC
GGCGACAACG GCAACTACGC CCTCGGCCCG TGGTCCGAGC GGCGCAACAC CATGCAGTTC
ACCGGCGACA CCGAGGCCAC CTGGGCCATG ATGGCGCTGG AGGCGCAGTT CGCCGGGGAC
GAGGCGGCAG CCGGGATCAC CAACGTCAGC AACGACATCG GCAGCTTCCA CGGCAACCAC
CTGGCCGACG ACATGTACGC GCGCTGGGTG CAGCTGGGCA CCTTCCAGCC GGTCGATCGC
CTGCACTCCG ACCACGGCGA CCGGCTGCCG TGGAACTACG GCGCGGCGGC CGACGCCAGC
AGCGAGCGGT TCCTGCGGCT GCGCGAAGCG CTGGTCCCGT ACACCTACAC GCTGGCCGAC
CAGGCGCACA CCACCGGCGT GCCGATCATC AGGCCTCTGT ATCTCGACTA TCCCTCGAAC
AACGAGGCGT ACACCTTCAA GCAGGAGTAC CTCTACGGCG ACAACGTCCT CGTCGCCCCG
ATCACCACTC CCGACGACGC CAACGGAAAC GGTTCCGTCA GCGCGTGGAT CCCGCCGGGC
ACCTGGACCG ACTACTTCAC CGGCACCAGC TACACCGGCC CGACCACCGT CACCATCACC
GACCCGCTGT CGCAGATGCC AGTCCTGATC AAGAGCGGCG GCATCATGCC GACCCGCACC
AACTACGTGA ACGACGCCAA CTCCTCGCCG CTGACCCAGG TGACGCTCTC CGTCGCCGCC
GGCGCCGACG GCTCGTTCCC GCTCTACCAG GACGCCGGCG AAGGCAACGG CTACCAGAGC
GGCCAGTCCA CCACGACCCC GATCTCCTGG AGCAACGCCT CCCGCACGCT GACCATCGGC
GCCGACAGCG GCAGCTTCAC CGGCGAGGCG ACGCAGCGCT CGTACACGCT GCGGCTGTCC
AACACCGTCG CCCCGACCGC CGTCTCCATC GACGGCACGC AGGTCCCCGA AACCGCGTGG
GCCTACAACC CGAACGAGCG CACCACGACC GTGACCACCG CCGCGCTGCC CGTCGGCACG
CAGCACACGA TCAGCCTGAC CGGCAGCGCC ACCGCCAACC CGGCCGCCGG CGAGGTCGTC
GGCGACGCCG GCCTGTGCCT GGACACCCGC GGCGGCACCA CCGCCAACGG CACCGCGATG
CAGCTGTACA CCTGCAACCA CACCGCCGGC CAGCAGGTCG CCTACACCCC CGGCGGCGCC
CTGCAGGTCC TCGGCAAGTG CCTGGACGCC GCCAACGCCG GCACCGCCAA CGGAACGCTC
ATCCAGCTCT ACGACTGCAA TTCCACCGGC TCGCAGAACT GGACGGCGCA GAGCAACGGC
GAACTGATCA ATCCCCAATC AGGACGCTGT CTCACTGTCC CCGGTGGCAA CACCACCCCC
GGCGCGGTCC AACTGCAACT CCAGGACTGC ACCGACGCCG CATCGCAGAT CTGGAAGCTC
CCGCCCGGAC CGCTCAAGGG ACCCGGCGGG CTGTGCGCGG ACGTGGCCAA CGCCGATCCG
TCCTACGCGA CCAGCGTCCA ACTCTGGGGC TGCAACCAGA GCGACGCCCA GCGCTGGTAC
ACCCCAGGCG ACAGCACGAT CCGCGTCTTC GGCAAGTGCC TCGACGTGAC CAACGGCGGC
ACCGCCAACG GCACCCACGT CCAGCTGTTC GACTGCAACG GCAGCGGATC GCAGAACTGG
ACGACGCAGG CGAACGGATC GCTGGTCAAC CCGCAGTCCG GCCGCTGCCT CGACGACCCC
AACAACACCG AGAAGGCAGG AGATCTGCTC GAGATCTACG ACTGCAACAA CTCCGCCGCA
CAGCAGTTCA GTCTCGGCGG CTGA
 
Protein sequence
MKPGAPGTDL PVPPPRGARR RRRKSARLAA AVGLLALAGT ALYSPAAATA ATAASSGKAN 
ATAASTPVLD GSARFEVLTP TLIRMEYAGD NQFQDAATFN AVNRSFPAVS YTTSVSGGYR
VITTSSITLR YKEGSGPFTA ANTSITVAGS SATAAPSFPS YCAFGTACEA EDGLLSWGAS
ASYDHANHTG SGFVAGFNQV HSAVQQDVSA VPSSGTYQLS VRYANATGGD GKNVTRTLST
TVNGASGPTL SLPVTGSWDT WSTVSVPVTL KAGVNTVTVV QNASDSANVN LDSLAVTASG
AAYPAPGASS ALLTTAYGAG PTDTLGGWSG SLDNQNPATA TERPGILDRD GWYLLDDSRS
ALLNANGTVT DRPAHGGQPY QDGYFFGYGT NYKQGLSDLN ALTGSANLLP ESAYGVWYSR
YYAYSASDYE NTLLPTFRST FTPLDWLVVD TDWKAPNQWD GWNWNSSLFP DPTGFMNWTK
QQGLETSLNI HTAISGSDPQ FAAANSTAGG LSPDVTRSGD YEFDWSNPNQ LAAYFNLHKP
FEQQGVREWW LDYCGGCGNS TAGDPHVAPG NFINQAYAQD GTARGLRGFS FGRIGSSSQA
GDNGNYALGP WSERRNTMQF TGDTEATWAM MALEAQFAGD EAAAGITNVS NDIGSFHGNH
LADDMYARWV QLGTFQPVDR LHSDHGDRLP WNYGAAADAS SERFLRLREA LVPYTYTLAD
QAHTTGVPII RPLYLDYPSN NEAYTFKQEY LYGDNVLVAP ITTPDDANGN GSVSAWIPPG
TWTDYFTGTS YTGPTTVTIT DPLSQMPVLI KSGGIMPTRT NYVNDANSSP LTQVTLSVAA
GADGSFPLYQ DAGEGNGYQS GQSTTTPISW SNASRTLTIG ADSGSFTGEA TQRSYTLRLS
NTVAPTAVSI DGTQVPETAW AYNPNERTTT VTTAALPVGT QHTISLTGSA TANPAAGEVV
GDAGLCLDTR GGTTANGTAM QLYTCNHTAG QQVAYTPGGA LQVLGKCLDA ANAGTANGTL
IQLYDCNSTG SQNWTAQSNG ELINPQSGRC LTVPGGNTTP GAVQLQLQDC TDAASQIWKL
PPGPLKGPGG LCADVANADP SYATSVQLWG CNQSDAQRWY TPGDSTIRVF GKCLDVTNGG
TANGTHVQLF DCNGSGSQNW TTQANGSLVN PQSGRCLDDP NNTEKAGDLL EIYDCNNSAA
QQFSLGG