Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4225 |
Symbol | |
ID | 8335579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4791619 |
End bp | 4793274 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957328 |
Product | Carbohydrate-binding CenC domain protein |
Protein accession | YP_003114930 |
Protein GI | 256393366 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3469] Chitinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0637754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAGGA ATCTCAGTTC TCGCCGACGA CAACTTCTCG CCTCCGCCGG GGCGGCGGTC GGCGCGCTCG CGCTCGCGGG AGCAGCCGCG GGCCTCGCGA CGAACGCGAC GGCCGCCACG GGCGCGCCCG CCGCGGCGCC GGCGGCGTCC GGCAACCTGC TGACCAACCC CGGGTTCGAA ACCGGGGACT TGTCCGGTTG GACGTGTGAC GCCGGTACAG GAGCCGTCGT CGCCTCGCCC GTGCGCTCCG GAACCCATTC GCTGGCCGGC ACGCCGTCGG GGTCGGCCGA CGCGCAGTGC ACGCAGACCG TCTCGGTGCA GCCGAACACG GCGTACACGT TCTCAGGGTA CGTCGAAGGG TCCTACGTCT ACATCGGCGT GACCGGCGGC ACCTCGACGT GGACGCCGTC GGCGACCAGC TGGCAGCAGC TGTCGGTGGC GTTCACGACC ACCGCGTCGC AGACCTCTGT CCAGGTCTAC GTCCACGGCT GGTACGGGCA GCCGGTGTAC CACGCCGACG ACCTGGCGCT GACCGGCCCG GCCGGACCGC CGCCGACGAC TCCGCCGACG ACTCCGACCA CACCGCCCAC GACGCCGTCC ACGACGCCGA CGACCGTGCC GACGTCGAGC ACGCCCACGA CGCCGACCAG CTCCACGCCG AGCACCCCGC CGAGCAGCTC CTCCTCCGCG CCGGGCGGCA CGACCTGCCC GGTCAAGTCG CGCCCGGCGG GCAAGGTGAT CCAGGGCTAC TGGGAGAACT GGGACGGCGC GCTCAACGGC GTGCACCCCG GGCTCGGCTG GATCCCGATC AACGACCCGC GCATCCAGCA GCACGGATAC AACGTGATCA ACGCCGCGTT CCCGGTCATC CTGTCCGACG GCACCGCCGA ATGGCAGGAC GGCATGGACA CCAACGTCAA GGTCGACACC CCGGCCGACT ACTGCGCGGC GAAGGCCTCC GGCGCGACGA TCCTGATGTC GATCGGCGGC GCGGCCGCGG GCATCGACCT CAACTCCAGC ACGGTCGCCG ACAAGTTCGT CGCCACCATC GTGCCGATCC TGAAGGCGTA CAACTTCGAC GGCATCGACA TCGACATCGA GACCGGCCTG ACCGGCAGTG GCAGCATCAA CACCCTGTCG ACCTCGCAGG CCAACCTCGA GCGCATCATC GACGGCATCC TCGCCCAGAT GCCCTCGAAC TTCGGCCTGA CCATGGCGCC GGAGACCGCC TACGTGACCG GCGGCAGCGT CACGTACGGC TCGATCTGGG GCTCGTACCT CCCGATCATC AAGAAGTACA TGGACAACGG CCGCCTGTGG TGGCTGAACA TGCAGTACTA CAACGGCAGC ATGTACGGCT GCTCCGGCGA CTCCTACCAG GCCGCGACAG TCCAGGGCTT CCAGGTCCAG ACCAACTGCC TGAACAGCGG CCTGACCATC CAGGGCACCA CCATCAAGGT CCCCTACGAC CACCAGGTCC CAGGCCTGCC CGCCCAACCC GGCGCCGGCG GCGGCTACAT GACCCCGTCC CTGGTCTCCC AGGCATGGAG CAGCGTCGGC GGCCAAGTCA AGGGCCTGAT GACCTGGTCG GTCAACTGGG ACGGCTCCCT GGGCTGGACC TTCGGGAACA ACGTGAAGGG GTTGGAGGGA CGCTGA
|
Protein sequence | MLRNLSSRRR QLLASAGAAV GALALAGAAA GLATNATAAT GAPAAAPAAS GNLLTNPGFE TGDLSGWTCD AGTGAVVASP VRSGTHSLAG TPSGSADAQC TQTVSVQPNT AYTFSGYVEG SYVYIGVTGG TSTWTPSATS WQQLSVAFTT TASQTSVQVY VHGWYGQPVY HADDLALTGP AGPPPTTPPT TPTTPPTTPS TTPTTVPTSS TPTTPTSSTP STPPSSSSSA PGGTTCPVKS RPAGKVIQGY WENWDGALNG VHPGLGWIPI NDPRIQQHGY NVINAAFPVI LSDGTAEWQD GMDTNVKVDT PADYCAAKAS GATILMSIGG AAAGIDLNSS TVADKFVATI VPILKAYNFD GIDIDIETGL TGSGSINTLS TSQANLERII DGILAQMPSN FGLTMAPETA YVTGGSVTYG SIWGSYLPII KKYMDNGRLW WLNMQYYNGS MYGCSGDSYQ AATVQGFQVQ TNCLNSGLTI QGTTIKVPYD HQVPGLPAQP GAGGGYMTPS LVSQAWSSVG GQVKGLMTWS VNWDGSLGWT FGNNVKGLEG R
|
| |