Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3707 |
Symbol | |
ID | 8335060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4160733 |
End bp | 4163735 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956847 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_003114450 |
Protein GI | 256392886 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.176968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.212982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCCA CACCCACCAC CCCACCCGCA GCATCCCCCT CATCGGCGAT ACCTCGAGGA CGCAGACGCG CCCTGGCGGC GTCGACCGCC GGAGCTGTCA CGTTCGCCAC CATCGCCGCC TGGTCCATGG CGCACTCGGC ATCGGCCGCC GGTACGACCT ATGAGGCGGA GAAGGCGGCG TTGTCCGGCG GTACGGTCGT CGCCAGCGAT CACGCGAACT ACACCGGGAC CGGTTTCGTC GGCGGATACA CCGATGCGAA TAAGGGCGCT GCGAACACCA CGTTCACCGT GAACGCCTCC GCTGCGGGGA ACGAGTCCGT GGCGCTGCGT TATGCGAACG GCACCGGTGC GCAGATGTCG CTGTCGTTGT ACGTCAACGG CACCAAGCTG AAGCAGATCC TGCTTCCGGC GTCGGCCAAC TGGGACACCT GGACGACCGA GACCGAGTCG GTCGCGCTGA AGGCCGGGAC CGACACGATC TCGTACAAGT TCGACTCCGC CGATCTCGGC AACGTCAACG TGGACAACAT CACCGTCACT CCGGCGGCAC CGCCTCCGGC CGGTCAGTTC GAGGCGGAGA GCGCGGCGCT TTCCGGCGGT ACGGTCGTCG CCAGCGACCA TGCGAACTAC ACCGGCACCG GTTTCGTCGG CGGATACACC GATGCGAATA AGGGCGCTGC GAACACCACG TTCACCGTCT CGGAATCCGG TGCCGGATCC ACGACGACGA CGCTGCGCTA CGCGAACGGC ACCGGCGCGC AGATGTCGTT GTCGTTGTAC GTCAACGGCA CCAAGATCAA GCAGATCCTG CTTCCGGCCA CGGCGAACTG GGACACGTGG GGGACCGAGA CCGAAAGCGT CAGCCTGAAC GCGGGCAGCA ACGCGGTCTC CTACAAGTTC GACTCCTCCG ACCTGGGCAA CGTCAACATC GACAACATCG TCGTCGGTGC GATCACCCCG CCGACCACGA CGCCGTCGAC CTCTCCCAGC TCCAGCTCGC CGCCGCCGTC CGGCACGCCC TATGAGGCCG AGACCGCCTT CACCGCCGGC GGCCCGAGCG TCGCGACGTC TATCAGCGGG TACAGCGGCA CCGGCTACCT GACCGGCTTC ACCACCCAGG GCGCCGAGAC CGTCATCGAC ACCGACGTCC CGGCTGCGGG TTCGGATGCC GTGACCCTTC GCTATGCGAA CTCCACCGGC TCGGCGCAGA CGATCTCGCT GTATGTCAAT GGCCTGAAGA ACGCACAGCT CTCGCTGCCG GCCGGCAGCG GCTGGCTGAC GTCGTCGCGG ACCATCGCGC TGCGCTCCGG GGAGAACCTC ATCGGCGTCC AGCACGACAG CGGCGACACC GGCAACGTCG CCATCGACGA CGTCACCGTC GCCAACGGCA CCGCTCTGGC CGCGGTCGGC GCGACGCTCC CGTACACCGA GTACACCGCC ACGAGCTCGC AGACGCAGAC CAACGGCACG GTCCTGGCCG CCAGCACCGC CTACCCGAGC ATCCAGGCCG AATCGACCGG CCGCCGGGCC GTCCAGCTGA CCGCCACCGG CCAGTACATG CAGGTCACGT TGGCGCACCC GACCAACTCG ATCGTGGTCC GCTATTCGAT CCCTGACAAT GGCGACGGTT CCGCGGCGAG CGCCCCGATC GCGTTGTACG CCAACGGGAA CAAGATCCAG GATCTGACCC TCACCACCAA GTACTCCTGG CTCTACGGCG GCGGCTACTA CGACACAAAC ACGCCGAGCA GCGGTCCCGC GCACCACTTC TATGACGAGA CCAGGGCCCT GATAGGCAAC TGGCCGGCGG GAACGGTGCT GAAGCTCCAG AAGGACTCCG GCGACACGGC CGCCTCGTAC ACCTTCGACG TCATCGACAC CGAACAGGTG GACCCTGCCT TCGCGATACC GGCGAACTTC GTCCCGATCA CCAACTACGG CGTCACTCCG AACAACGGGG CGGACGACAC CAACGCGATC AACAGCGCGC TGAGTGCTTT GGCCGGAACG GGCAAGGGCT TGTTCTTCCC GTCCGGAACC TACGACATCT CGGGCCGCAT CAACATCAAC GGCGTGCCGG TGCGCGGCGC CGGCGAGTGG TACACGACGA TCCAGTCCAC GGCCGTGAAC GGCAGCGGCG GTCTGTACAC CACCGCCGGC GTGAACCAGA TCGCCGACCT GACGATCTCC GGCGATCAGA CCTCGCGGAA CAACGACTCC GGCGCGGCCG CGATCGAGGG GACCTTCGCG CAGGGCTCGC TGCTGTTCGA CGTGTGGATG GAGCACACGA AGGTCGGGCT GTGGGCGGTT CCGGGCGTCG GGCTCTACGC CTCCGGGCTG CGCGTCCGCG ACGTCTTCGC CGACGGTCTC CACGTCCACG GCGGCAGCAA CGGCACCCGG ATCGACCAGT CGCAGGTGCG CAACAGCGGC GACGACAACA TCGCGCTGGA CACCGAGGGC GGCGACGTCG TCCGCTGCTC GCTGGTGCAC AACACCGTTC AGAGTCCGAT CCAGGCCAAC GGCATCGGTG TCTACGGCGG CAACGGCAAC GCCGTCGTCG CCAATCAGGT CTCTGACACC GTCGCGTTCG GCGCGGGCAT CACCGTCAGC ACCCGGTTCG GAGGCGGGTT CACCGGCCCG ACCACGGTGT CCGGCAACGC GCTGACACGC GCCGGATCGT ATGAGTACAA CTGGGGTTCG AGCCTCGGCG CACTGTGGAT CTACGCGAGC CAGTCCGACA TCACCCAGCC GGTGACCGTC TCCACCAACA CGATCACCAG CGCCACCTAC GACGCCCTGC TTCTGGGTGA CAGCAAGCAG ATCGCCAACC TGACGCTCGA TCACCTCGCG ATCAGCGGCG CGGGCGGATA CGGCATCAAC ATCAAGAACC TGACCGGCGG GATGACGGCG AACTATGTGA CCGTCACCGG CGCCGCGTCC GGCGGACTGA ACAACCCCTC GAACTACCCG ATCACGCGCG GTCCGGGGGA CAGCGGCTGG TAG
|
Protein sequence | MPPTPTTPPA ASPSSAIPRG RRRALAASTA GAVTFATIAA WSMAHSASAA GTTYEAEKAA LSGGTVVASD HANYTGTGFV GGYTDANKGA ANTTFTVNAS AAGNESVALR YANGTGAQMS LSLYVNGTKL KQILLPASAN WDTWTTETES VALKAGTDTI SYKFDSADLG NVNVDNITVT PAAPPPAGQF EAESAALSGG TVVASDHANY TGTGFVGGYT DANKGAANTT FTVSESGAGS TTTTLRYANG TGAQMSLSLY VNGTKIKQIL LPATANWDTW GTETESVSLN AGSNAVSYKF DSSDLGNVNI DNIVVGAITP PTTTPSTSPS SSSPPPSGTP YEAETAFTAG GPSVATSISG YSGTGYLTGF TTQGAETVID TDVPAAGSDA VTLRYANSTG SAQTISLYVN GLKNAQLSLP AGSGWLTSSR TIALRSGENL IGVQHDSGDT GNVAIDDVTV ANGTALAAVG ATLPYTEYTA TSSQTQTNGT VLAASTAYPS IQAESTGRRA VQLTATGQYM QVTLAHPTNS IVVRYSIPDN GDGSAASAPI ALYANGNKIQ DLTLTTKYSW LYGGGYYDTN TPSSGPAHHF YDETRALIGN WPAGTVLKLQ KDSGDTAASY TFDVIDTEQV DPAFAIPANF VPITNYGVTP NNGADDTNAI NSALSALAGT GKGLFFPSGT YDISGRININ GVPVRGAGEW YTTIQSTAVN GSGGLYTTAG VNQIADLTIS GDQTSRNNDS GAAAIEGTFA QGSLLFDVWM EHTKVGLWAV PGVGLYASGL RVRDVFADGL HVHGGSNGTR IDQSQVRNSG DDNIALDTEG GDVVRCSLVH NTVQSPIQAN GIGVYGGNGN AVVANQVSDT VAFGAGITVS TRFGGGFTGP TTVSGNALTR AGSYEYNWGS SLGALWIYAS QSDITQPVTV STNTITSATY DALLLGDSKQ IANLTLDHLA ISGAGGYGIN IKNLTGGMTA NYVTVTGAAS GGLNNPSNYP ITRGPGDSGW
|
| |