Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4883 |
Symbol | |
ID | 8336237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5561031 |
End bp | 5564669 |
Gene Length | 3639 bp |
Protein Length | 1212 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957982 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003115584 |
Protein GI | 256394020 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0885799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTGA TCGGTTCATC GCGACGAGTG CTCACGCTGG GCGCCGTCGC CGCGTTGGCG GCGTCTGTCA TGGCCGCGAC ACCCGCACCC CCTGCGGCGG GTGCCGCGGG CAGCGGCGGG TCGACACCGA TCTACTTGCA AACCCGCTAC TCGTTCGCCG AGCGCGCCGC CGACCTCGTC TCGCGGATGA CGCTGCCGGA GAAGGTGGCA CAGCTGCACA CCAACAGCGC GCCGGCGATT CCGCGCCTGG GCGTGCAGTC GTATACCTAC TGGAGTGAAG GCCAGCACGG CATCAACCTG CTGGGTGCGG ACTCGAACAA CGGCGGTGCG GCCGGCGGAC CGCACGCCAC CAGCTTCCCG ACCAATCTGT CCTCGACGAT GTCCTGGGAT CCGGCCCTGG TCTACCAGGA GACGACGGCC ATCTCCGACG AGGTCCGCGG AGAACTGGAC AAGTCCTTGT GGGGCGTCGC CCAGAACAAC ATCGGCCCGT CTGCGGACGA CTACGGCTCC CTGACCTACT GGGCGCCAAC CGTCAACATG GACCGCGACC CGCGCTGGGG GCGCACCGAC GAAGCCTTCG GAGAGGACCC CTACCTGGTG GGGAAGATGG CCGGCGCGTT CGTCGCCGGC TACCAGGGTG AGACGATCGA CGGCACGCCG ACCAGCCCGT ATCTGAAGGT CGCGGCGACG GCCAAGCACT TCGCGCTCAA CAACAACGAG AACGACCGGC ACGCCGACTC CGCCGACGCC AGCGAGTCCG ACATCCGCGA CTACTACACC GCGCAGTTCC GCAGCCTGGT GGAGGACTCG CACGTAGCGG GCCTGATGAC GTCGTACAAC GCCATCAACG GAACCCCGTC GCCGGCCGAC ACCTACACCA CCGACGCGCT GGCGCAGCGC ACCTGGGGCT TCGACGGCTA CATCACCTCG GACTGCGGCG CGGTCGGCGA CGTCACCGCC TCCTCCAGCC ACGACTGGGC ACCGCCGGGA TGGACTGTCT CGGTCGTCAA CGGCACCAGT ACGTGGACGA ACACCGCGAC CGGCGTCCAG GTGCCGGCCG ACGCGGGCGG CCAGGCCTAC GCGCTGCGCG CCGGAACCGA TGCCAACTGC ACCGGCGGCG ACGCCACACT CGGCAACATC GAGGCCGCGA TCAAGGCGGA GATCCTCAGC GAGGGCGTGA TCGACCATGC CCTCGTCCAG CTGTTCACCG TGCGCATGCA GACCGGCGAG TTCGACCCGG CGAACAAGGT CGCCTACACC CGCATCACCA AGGCGCAGAT CCAAAGCCCT GAGCACCAGG CTCTAGCTGA GAAGGTGGCG GCCAACTCCC TGGTGCTGCT CAAGAACGAC CCGATGCCGG GCTCCGCAGC CAAGGTGCTG CCGGCGAACC CCGCGAGCCT GAACAACGTC GTGGTCGTCG GCGACCTGGC GAACACGGTC ACCCTCGGCG GGTACTCCGG TGACCCGACG CTCCAGGTGA ACGCCGTGCA GGGCATCACT TCCGCGGTCA AGGCGGCCAA CCCGAACGCC ACCGTCACCT TCGATGCCTG CGGCACCTCC ACGACCGCCA CCGCAGCGGC GTCCTGCTCC GCGGCGACCC AGGCCGCGAT CAAGACCGCG GACCTGGTCG TGGTGTTCGT CGGTACCGAC GGGAGCACCG CGGGGGAGAG CAACGACCGC GCGAGCCTGG CCATGCCCGG CAACTACGAC TCGCTGATCA GCCAGGTCGC CGCGCTCGGC AACCCCCGCA CCGTGCTGTC GATGCAGACC GACGGCCCGG TCGACATCGA GAACGTCAAG GGCGACTTCC CCGCCATCGT CTACAGCGCC TACAACGGCG AGAGCCAGGG CACCGCGCTG GCCGACGTCC TGTTCGGCAA GCAGAACCCG AGCGGGCACC TGGACTTCAC CTGGTACAAG GACGACTCGC AGCTGCCGTC GATCAAGAAC TACGGTCTGA ACCCGGCGGA CACCGGTGGC CTGGGCCGCA CCTACCAGTA CTTCACCGGC ACGCCGACCT ATCCCTTCGG CTACGGCCTG AGCTACACCG ACTTCGCCTA CTCCAAGGTC CAGGCGACCG ACCACGCCGA CGCTCAGGGC AAGGCCACCG TCCGGTTCGA CGTCACCAAC ACCGGCAAGA CGCCGGGAGC CACAGTCGCA CAGCTCTACA TCACCCCGCC CAGCGTGCCC GGAACGCAGC AGCCCGCTGA GCAGCTCGAA GGCTTTGCGA AGACGGCCGT CCTCAAGCCC GGCCAGACCC AGCACCTGTC CGTCTCCGTC AACATCGCCG ATCTGGCGAC CTGGGACGCG CAGAACGCGA AGAACGCCGT CACCGACGGC ACGTACACGC TGCGGCTCGC CACCGACGCC GCCGACACCG TCGCCTCCCG CCCGCTGCGG GTCACCGGGG CGATCAAGCC GCGGATCCAG TACGTGACGG TGCAACCCGA CCAGGTGGTC TTCACCCCCG GGAACACGCT CGACCTGACC GGGAAGAACC CGTGGATCGC GGACGACACC GCGCAGGCCG CGCAGCATCC GAGCGCGGAC AGCGTCGTCG AGGCGGTGAA CAACGACGAG TCCTTCGCCG ACCTGAGCAG CGAGCACGTC AAGTACAGCA GCAGCGATCC GGCGGTCGCC TCGGTCAGCC GCACCGGCGT CGTCACCACG CACGCCGTCG GCACCGCGAC GATCCGCGTG AGCGTCGACG GCGTCACCGG CTCGACGCCG ATCGCCGTGC ACGAGCCGTT CGCGATGAGC GCGCCGGCCG TCGCCGTGCC CGGCGGCAGC TTCACCGTCA CCACCACCTC GGCGAACCCG AGCGGCGGGG AGAAGCTGCG CAACGCCGCC TTCCACCTCA CCGTGCCCGC GGGCTGGACG GCCACCGCGA GCACTCCGGC GACGTTCCCC AGCGTGGCGG CCGGCCAGAC CATCACCACC ACCTGGACGG TCGGCGTCCC CGCGGACGCC AGCCCGACCG CGGACGCGCC GGCGCTGACC GCGCAGTACA CGTTCACCGA CGGCACCGGC ACCCACAGCG ACACGACCGG AGCCACCGTG TCGATCCCCT ATTCCTCGAT CGCCGCCGCC TCCACCAACC CCGGCGTCAG CGACGACTCC GACACCGCCG TCGGCAACCT CGACGGCGGC GGCGCCAGCT ACTCCGCGCA GACCCTCGCC ACGGCCACCC CGAGCATCAC CCCCGGCGGC ACGTTCACCC ACGACGGTCT GACGTTCACC TGGCCCGCCC CGGCGCCCGG CACGCCGGAC AACATCGTCG CCACCGGGCA GACCATCCCG GTCACCGGCA CCGGATCGAC CCTCGGAATC GTCGGCACGG CCGACTACGG AGCCGCCTCC GGCACCGCCG TGATCACCTA CACCGACGGC ACCACCCAGT CGTTCAGCCT CGCCTACAGC GACTGGTGGA CCAACGCGGC GGCATCCGGC GGCGACGTCC TGGCCACCTT CCCCTACCTG AACAACGCCA GCGGAGCGCT CCACAACCAG GTGAGCCTCT ACACCGACAC GGTCCCGCTG ACCCCGGGCA AGACCATCAA ATATCTGACG CTCCCGAACG TCGGCACGGC CCTGATCAAC CAGACGGCGA TGCACATCTT CGCGATCGCC GTCGGCTGA
|
Protein sequence | MTLIGSSRRV LTLGAVAALA ASVMAATPAP PAAGAAGSGG STPIYLQTRY SFAERAADLV SRMTLPEKVA QLHTNSAPAI PRLGVQSYTY WSEGQHGINL LGADSNNGGA AGGPHATSFP TNLSSTMSWD PALVYQETTA ISDEVRGELD KSLWGVAQNN IGPSADDYGS LTYWAPTVNM DRDPRWGRTD EAFGEDPYLV GKMAGAFVAG YQGETIDGTP TSPYLKVAAT AKHFALNNNE NDRHADSADA SESDIRDYYT AQFRSLVEDS HVAGLMTSYN AINGTPSPAD TYTTDALAQR TWGFDGYITS DCGAVGDVTA SSSHDWAPPG WTVSVVNGTS TWTNTATGVQ VPADAGGQAY ALRAGTDANC TGGDATLGNI EAAIKAEILS EGVIDHALVQ LFTVRMQTGE FDPANKVAYT RITKAQIQSP EHQALAEKVA ANSLVLLKND PMPGSAAKVL PANPASLNNV VVVGDLANTV TLGGYSGDPT LQVNAVQGIT SAVKAANPNA TVTFDACGTS TTATAAASCS AATQAAIKTA DLVVVFVGTD GSTAGESNDR ASLAMPGNYD SLISQVAALG NPRTVLSMQT DGPVDIENVK GDFPAIVYSA YNGESQGTAL ADVLFGKQNP SGHLDFTWYK DDSQLPSIKN YGLNPADTGG LGRTYQYFTG TPTYPFGYGL SYTDFAYSKV QATDHADAQG KATVRFDVTN TGKTPGATVA QLYITPPSVP GTQQPAEQLE GFAKTAVLKP GQTQHLSVSV NIADLATWDA QNAKNAVTDG TYTLRLATDA ADTVASRPLR VTGAIKPRIQ YVTVQPDQVV FTPGNTLDLT GKNPWIADDT AQAAQHPSAD SVVEAVNNDE SFADLSSEHV KYSSSDPAVA SVSRTGVVTT HAVGTATIRV SVDGVTGSTP IAVHEPFAMS APAVAVPGGS FTVTTTSANP SGGEKLRNAA FHLTVPAGWT ATASTPATFP SVAAGQTITT TWTVGVPADA SPTADAPALT AQYTFTDGTG THSDTTGATV SIPYSSIAAA STNPGVSDDS DTAVGNLDGG GASYSAQTLA TATPSITPGG TFTHDGLTFT WPAPAPGTPD NIVATGQTIP VTGTGSTLGI VGTADYGAAS GTAVITYTDG TTQSFSLAYS DWWTNAAASG GDVLATFPYL NNASGALHNQ VSLYTDTVPL TPGKTIKYLT LPNVGTALIN QTAMHIFAIA VG
|
| |