Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2490 |
Symbol | |
ID | 8333839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 2814532 |
End bp | 2816112 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644955643 |
Product | Carbohydrate-binding CenC domain protein |
Protein accession | YP_003113249 |
Protein GI | 256391685 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGGG AGCTCGGTAC AGCAGGAGGC AGAGACCGGC CGCCGGTGCG TTCGCGGCGG GCCGCGGGTG TGGTCGGGGT GGTCGCGGCG GGCGCGCTGG CCGCCGCGGG GGTGGCGGTG GCGGCCGGCG GTGCCAATGC CGCCGCTGCG AATCTGGTGG TGAATCCGGG CTTTGAGGCC GGGGCGCTCA CCGGGTGGAC GTGCAGTGCC AACAGCGGGT CCGTGGTGAG CACTCCGGTG CACTCCGGGT CCTACGCCCT CAGTGCCACG CCGGCGGGTT CGGACGACGC GCAGTGCACG CAGAGCGTGA GCGTGCAGCC GAACTCGTCC TACACGCTGT CCGCTTGGGT ACAGGGCAGC TACGTCTACC TCGGCGCCAG CGGAACCGGG GGGACCGATC CGAGCACCTG GAGCAGCAAC GCGGCGTGGA ACCAGCTCAC CACGACGTTC ACCACCGGCG CCTCGACCAC CAGCGTCACG GTCTACCTGC ACGGCTGGTA CGGCCAGGGC ACCTACCACG GCGACGACGT CAGCCTGCTG GGACCGGCCG GGACCGGCTC GTCGAGCACC CCGCCGACCA CGCCGAGCAG CCCGACCACG CCGTCGACCA CGCCCACGAC GCCGCCGACC ACCCCGACGA CGCCGTCCTC GTCGCCGAGC TCGGTTCCCC CGCCGCCGCC GAACTCCGGC TTCAACCACC CGGCGTACTT CATGCCGCTG GACAACAGCC CGCAGGCGAT CTCCGCCATC GTCAACGCCG GGGAGAAGGA GCTGAACCTG GCCTTCGTCC TGGACTCCGG CGGCTGCACC CCGGCCTGGG GCGGCAACGC CTCGACCCCG GTGTCCTCCG ACACCACGGT CCTGGGCGAC ATCAGCGCCC TGCGGGCGGC CGGCGGTGAC GCCGCGGTCT CCTTCGGCGG CTACAACGGC ACCGAGCTCG GGTCCTCGTG CGGCAGCGCC AGCTCGCTGG CGGCGGCGTA CCAGAAGGTG ATCACCAAGT ACCAGCTGAA GCACGTCGAC TTCGACTACG AGAACACCGC GCTGGACAGC AACACCGCCG TCCGCTTCGG CGCCATCAAG ATCCTGGAGC AGAACAACCC GAATCTGGTG GTCTCGCTGA CCATCCCGAT GACCACGGTC GGTTTCCCGG GCTCCGGGGT CGATGAGATC AAGCAGGCGG TGGCGGCCGG CGCGCGGCTG GACGTCATCA ACATCATGGA CTTCGACACC GGTCTGACCT CCGGTACCGA GGTCGGCCAG ACCGAGGCGA TCGCCAACGA CGCCATCTCG CAGCTGCAGT CCATCTACGG CTGGAGCACG TCGCAGGCTT GGTCGCACCT CGGCCTGCAG ATCATGAACG GCCACACCGA CCAGCCCTCC GAGCTGTTCC AGGAGAGCGA CTTCTCCGCG CTGCTCGGCT TCGCCAAGGC GAACCACCCG GCGTGGTTCT CCTACTGGTC GGCCAACCGG GACCGCGTGT GCGACCCGAG CGTGCCGCAC AACTGGGCCG ACGGGACGTG CTCGAGCGTC TCGCAGAACC CGTGGGACTT CACCAAGATC CTGGTGCAGT ACACCGGCTG A
|
Protein sequence | MNRELGTAGG RDRPPVRSRR AAGVVGVVAA GALAAAGVAV AAGGANAAAA NLVVNPGFEA GALTGWTCSA NSGSVVSTPV HSGSYALSAT PAGSDDAQCT QSVSVQPNSS YTLSAWVQGS YVYLGASGTG GTDPSTWSSN AAWNQLTTTF TTGASTTSVT VYLHGWYGQG TYHGDDVSLL GPAGTGSSST PPTTPSSPTT PSTTPTTPPT TPTTPSSSPS SVPPPPPNSG FNHPAYFMPL DNSPQAISAI VNAGEKELNL AFVLDSGGCT PAWGGNASTP VSSDTTVLGD ISALRAAGGD AAVSFGGYNG TELGSSCGSA SSLAAAYQKV ITKYQLKHVD FDYENTALDS NTAVRFGAIK ILEQNNPNLV VSLTIPMTTV GFPGSGVDEI KQAVAAGARL DVINIMDFDT GLTSGTEVGQ TEAIANDAIS QLQSIYGWST SQAWSHLGLQ IMNGHTDQPS ELFQESDFSA LLGFAKANHP AWFSYWSANR DRVCDPSVPH NWADGTCSSV SQNPWDFTKI LVQYTG
|
| |