Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6685 |
Symbol | |
ID | 8338049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 7703200 |
End bp | 7704801 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644959779 |
Product | cellulose-binding family II |
Protein accession | YP_003117372 |
Protein GI | 256395808 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.713256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGCCA GACAGAGGCT GAGACCGCGG CCAAGACTGC GGGCGGCCCT CGGCGCGACG GCCGCCGTGG CGCTCATGGC CGGTTCGGCG ACCGCGGCGG TGCTCGTACC CGGGACCGCG AGCGCCGCGA CCACCGTGCA GTGTCAGGTC ACGTATACGG TCGCCAACGA CTGGGGTTCC GGGTTCACCA CGAACATCAC GATAGCCAAC GTCGGCACGA GCGCGTGGAC CGGGTGGACG CTGGGCTACA GCTATTCCGG CAACCAGACG CTGCAAAGCG GCTGGAACGG AACCTGGTCG CAGTCGGGCA AGGCGGTGAC GGTCGCCAAC GTCTCCTATA ACGGTTCAGT CGCCGCCGGC GCCACCACCA GCATCGGAGC CAATTTCACG TACAGCGGCG CGAACACCGC GCCGACCGTG TTCAGCGTCA ACGGCACTAC GTGCAACGGC GTGGGCTCGC CGTCCTCGCC GACGGGTACG ACGCCGAGCA CCACTCCCAG CACGACGCCG AGCACTACTC CCAGCACGAC CCCGAGCACC ACCCCGAGTA CGACGCCCAG CACCACCCCG AGCTCCCCTC CCCCGACGTC CGGCGCCCCG GCGCTGCACG TCTCGGGCAA CCAGCTCCTG GACAGCACCG GCAAGGTCTT CGTCCCGCAC GGGGTGAACC GCTCCGGCGC GGAGTTCGCG TGCGTGCAGG GCAAGGGCAT CTTCGACGGT CCGGTCGACG ACGCCTCGGT GGCGGCGATC GCCTCCTGGA AGGTGAACGT CGTCCGGGTT CCGCTGAACG AGGACTGCTG GCTCGGTGAG TCCACCGTGC TGCCGCAGTA CGGCGGAGCC ACGTACCAGG CGGCGATCGA GAGCTTCGTC AGTCTGCTGC ACAAGCACGG CATGGCGGTC ATCCTGGACC TGCACTGGAC CGACGGCGTC TACACCGGCC AGTCCTCGGC ATGTTCGGTG GCGACGGCGA CCTGCCAGAA GCCGATGCCC GACGCGGCGA ACGCCCCGGC GTTCTGGGCT TCGGTGGCCG GGGCGTTCAA GAACGACCAG TCGACCGTCT TCGACCTGTT CAACGAGCCC TACCCGGACT TCGCCGCGGG CTTCAACGCC GCGCTGGGCT GGTCCTGCTG GCAGAACGGC GGGACCTGCA CCGGCATCAA CTACCAGGTC GCCGGAATGC AGTCGCTGGT CAACGCGGTG CGCGGCGCCG GGGCGGGCAA CGTCCTGATG CTCGGCGGCG TGGCGTACTC CAACGACCTG AGCCAGTGGC TGGCCCACGA GCCGACGGAC CCCTCGCACA ACCTCGTGGC GTCCTGGCAC TCGTACAACT TCAACTCGTG CTCGTCCTCG TCGTGCTGGG ACTCCCAGGT CGCACCGGTC ATCGCGCAGG TGCCGGTGAT CCCTGGGGAG ATCGGCGAGA ACGACTGCGG CCACTCCTAC ATCGACACGC TGATGGCGTG GCTGGACAGC CACCACACCG GCTACGCGGC GTGGACGTGG AACACCTGGG ACTGCTCCTC GGGTCCCTCG TTGATCAGCG CGTACGACGG AACGCCCACG AACTACGGCG CCGGATACAA GGCGCACCTG GGCACCTTCT GA
|
Protein sequence | MFARQRLRPR PRLRAALGAT AAVALMAGSA TAAVLVPGTA SAATTVQCQV TYTVANDWGS GFTTNITIAN VGTSAWTGWT LGYSYSGNQT LQSGWNGTWS QSGKAVTVAN VSYNGSVAAG ATTSIGANFT YSGANTAPTV FSVNGTTCNG VGSPSSPTGT TPSTTPSTTP STTPSTTPST TPSTTPSTTP SSPPPTSGAP ALHVSGNQLL DSTGKVFVPH GVNRSGAEFA CVQGKGIFDG PVDDASVAAI ASWKVNVVRV PLNEDCWLGE STVLPQYGGA TYQAAIESFV SLLHKHGMAV ILDLHWTDGV YTGQSSACSV ATATCQKPMP DAANAPAFWA SVAGAFKNDQ STVFDLFNEP YPDFAAGFNA ALGWSCWQNG GTCTGINYQV AGMQSLVNAV RGAGAGNVLM LGGVAYSNDL SQWLAHEPTD PSHNLVASWH SYNFNSCSSS SCWDSQVAPV IAQVPVIPGE IGENDCGHSY IDTLMAWLDS HHTGYAAWTW NTWDCSSGPS LISAYDGTPT NYGAGYKAHL GTF
|
| |