Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0861 |
Symbol | |
ID | 8332191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 999428 |
End bp | 1001356 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644954011 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_003111635 |
Protein GI | 256390071 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.025002 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCTCA TCCGTGTGCG TGTCCTGATC GCGCTGCTGG CCCTGCTGAC CTGCGCGGTA CTGCTCCCGG GGTCCGCGCG CGCCGCCTCG CGACTCCCCT CAGCCGCTGC CGTTCCCGCC GGCTCGCTCG CCGCGTCCTG GACCGGTCCG CTGAGCACCA GCGGCCGCTA CGTCGTCGAC GCGAACGGCA ACCGCTTCAA GCTGATCGGC GGCAACTGGG ACGGCGCGCA GGGCCACTGG CTCGGCAGCG GCTCGGCGAC CGATCCGGCG CAGAACCACG CCGGCGAGGT GTCCTACAAC GTCCCGCTGG CCCTGGACCG CAAGCCGATC CCGCAGATCC TGGCGGACTT CCACAGCCTG GGCATCAACA CCATCCGTCT GCCGTTCGCG GACGCGATGA TCCACGACAC CTCGACCGTC CCGGACGCCG CCGTCACCGC CAACCCGCAG CTGCGCGGAC TGACCGCGCT CCAGGTCTAC GACGCGGTCG TCAGCGCCCT GACGGGCGAC GGCTTCGCGG TGATCCTGAA CAACCACACC ACGAGCTACC GCTGGTGTTG CGGCCTGGAC GGCAACGAGC GCTGGAACAG CGGACAGAGC ACGCAGCAGT GGGAGTCCGA CTGGCTGTTC ATGGTGAACA GGTACAGGGC GAACAAGCGC GTCGTCGGCG CGGACCTGCG CAACGAAGTC CGGCGCGACA CCTGGGACGA CCCGAACTGG GGCTGGTACG ACGCCCACGA CGAATACGCC GCCTTCGAGG AGGCCGGCAA CCAGATCCTG GCCGCGGACC CGGACATGCT GATCGTCATG GAGGGCATCA ACTGGTACGG CATCCCCGCC GCCGGCTTCT CCCACGGCCG CCCGATGCTC ACCCCGGCCG CGAACCTCTC CGCCACCCTG ATCGCCTCGA ACAAGCTGGT GTACTCGGCG CACTTCTACA GCTACACCGG CCCGAACAAC TCCGGCGCCG CCGCGGGCTC GGCCGGCTCG ACCAGCGACC CGCGCTACGA GGACATGACC CCGGACCAGC TGGCGTCGGC GGTGAACCAG GAGGCCCTGT TCGTCACCCA ATCCGGCCAG CACTTCACCG CACCGGTCTG GGTCAGCGAA TTCGGCGCCG CCGGACGCGG CGAGACCGAC ACCAAGGAAC AGACCTGGCT CGACACCTTC ACCACCATCC TGGCCGCCAA CGACACCGAC TTCGCCATCT GGCCGCTGAT CGGCTACACC GCCACCAACG GAACCCTCCA GGACAACTGG GCCCTCCTGT CCTACGACCC CGCCGGCAAC CGCACCAGCA TCACCGACCC CGGCGACTGG CGCCTCCCCG ACTGGCAGAA GCTGACCTCC GCCCCCACCA CGACCGGCCA CATCCCCGCC TCCCCCCACT GGAACATGCT CGACCTGGAC CACGCCGACT ACAACGTCTC CACCACGATG CTGGCCCAAC CCGACTGGTC CCCCGGCAAC CGCAAGGGCA ACTGCCCCGA CACCGAACGC CTGACAGGCC TAGGCCGCGG CAGCAGCCGA GGACTATGCA CAGACTCCTC AGAACCGACC AAGAGCACAG CCACCCAGAC CGTCGTCACC AACGAGACCT ACGTAACCGA AGGCGACTGG GCACCCGGCT ACACCAAGCT CCAATGCCCC GACAACACCT TCGCCACCGG CTACAGCGTC CACAACAACG CCATGGCAGC CCTACTGTGC GCCCCCGCCG CAGCGCCTCT ACCCACCACC AGCCACACCA TCTGGTTCAA CCAAGGCGAC AACCGCCCAA CCACCGGCGG CTCCACACCC TCCGACTGGG CACCCGGCTC CTACAAGGGC CAGTGCCCCG ACAACGAGTA CTTAGCGGGC ATCGCGTACA CCTGGCAGCG CGCTGAAGGC GGCGTTCCGG ATGCGCTGCT GTGTCGGGCC CTGGTCTGA
|
Protein sequence | MSLIRVRVLI ALLALLTCAV LLPGSARAAS RLPSAAAVPA GSLAASWTGP LSTSGRYVVD ANGNRFKLIG GNWDGAQGHW LGSGSATDPA QNHAGEVSYN VPLALDRKPI PQILADFHSL GINTIRLPFA DAMIHDTSTV PDAAVTANPQ LRGLTALQVY DAVVSALTGD GFAVILNNHT TSYRWCCGLD GNERWNSGQS TQQWESDWLF MVNRYRANKR VVGADLRNEV RRDTWDDPNW GWYDAHDEYA AFEEAGNQIL AADPDMLIVM EGINWYGIPA AGFSHGRPML TPAANLSATL IASNKLVYSA HFYSYTGPNN SGAAAGSAGS TSDPRYEDMT PDQLASAVNQ EALFVTQSGQ HFTAPVWVSE FGAAGRGETD TKEQTWLDTF TTILAANDTD FAIWPLIGYT ATNGTLQDNW ALLSYDPAGN RTSITDPGDW RLPDWQKLTS APTTTGHIPA SPHWNMLDLD HADYNVSTTM LAQPDWSPGN RKGNCPDTER LTGLGRGSSR GLCTDSSEPT KSTATQTVVT NETYVTEGDW APGYTKLQCP DNTFATGYSV HNNAMAALLC APAAAPLPTT SHTIWFNQGD NRPTTGGSTP SDWAPGSYKG QCPDNEYLAG IAYTWQRAEG GVPDALLCRA LV
|
| |