Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3818 |
Symbol | |
ID | 8335171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4317502 |
End bp | 4319874 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644956957 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003114560 |
Protein GI | 256392996 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.224381 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGTC AGATGTCCGC CGACACGGAT TCGGCGCCCG GCGTCCCTCC GTTATGGCGC AATCCCCGAC TCACGCCGCA GGAACGCGCC GACGCCCTCA TCCCTGTGAT GACTCTCGAG GAGAAGGTCG CGCAGTTGGC CGGGGTATGG GTCGGCGCCG ATGCTTCCGG CGGCGGCGTC GCCCCGCATC AGCAGGACAT GGCCCCCCTC GCGTGGGAGG ACGTGATCCG GCACGGGCTG GGACAGCTGA CCCGGCCGTT CGGGACGGCT CCGGTCGATC CGGTGGCCGG GGCCCGGTCG CTGGCCGCCT CCCAGGCGCA GATCGCCGCT GCCAGCCGCT TCGGGATCCC GGCGCAGGTG CACGAGGAAT GCCTGACCGG TTTCGCGACC TGGGGCGCGA CGGCCTACCC GGCGCCGCTG GCCTGGGGCG CGTCCTTCGA CCCGGAACTC GTGGAGCAGA TGACCGGCCG GATCGGCCGC TCGATGCGGG CGGTCGGTGT CCACCAGGGG CTGGCTCCGG TGCTGGACGT GACTCGCGAC TACCGCTGGG GCCGCACCGA GGAGACCATC GGGGAGGATC CTCATCTGGT CGGCGTGATC GGCGCGGCCT ATGTCAGAGG CCTGGAGAAC GCCGGGATCG TCGCGTCGCT CAAGCACTTC GCCGGCTACT CCGCCTCCCG CGGCGGACGC AACCTCGGCC CGGTGCCCAT GGGACGCCGT GAGCTGGCCG ACGTCGTCCT GCCGCCGTTC GAGGCGGCAC TGCGCCTGGG CGGCGCCCGC TCGGTGATGA ACTCCTACTC CGAGATCGAC GGGGTGCCCG CCGCGGCCGA CGAACAACTG CTGACCGGGC TGCTCCGCGA CCAATGGGGA TTCACCGGCA CGCTTGTCTC GGACTACTTC GCGGTGCGCT TCCTTCAGTC ATTGCACGCG GTGGCCGGCG ACGCGGCACA CGCCGCGGAT TTGGCCCTGC GGGCCGGCAT CGACGTGGAA CTTCCAACGG TGGACGTCTT CGGCACGCCG CTCACCGAGG CGGTCCGCGC CGGCGCGGTC GAGGAGGCTC TGATCGACCG GGCTTTACGC AGAGTCCTGA TCCAGAAGGC GGAACTCGGC CTGCTCGACC CGGACTGGCG GGCGCTGCCC GAGGAGATCG AAGCCGGTCC GGTGAGCCTG GACAGCGAGG AGGACCGCGA GATCGCCCTG CGCTTGGCCC GCCGGTCGGT AACGGTGCTG CGCAACGAGA ACGGAATCCT GCCGCTGATG CCGGACCGGC GCGTCGCGCT GATCGGCCCG GTCGCCGACG ATCCGATGGC GATGCTGGGA TGCTATTCCT TTCCCGCGCA CGTTAGCGGT AACACCGAAC ACGGCCTCGG TCTGGAAATC CCCTCGCTCC GCGAGGCGCT CAGCGCGACG ATCGCCGACC TGCTCTACGA GCCCGGATGC GCGATATCCG ATGACGACAC CTCCGGCATC GCCGCCGCGG CCGGCGTGGC GGCCGCGGCC GACGTATGCG TCCTGGCCGT CGGCGACCGC GCCGGCCTGT TCGGCCGCGG CACCTCGGGC GAGGGCTGCG ATGCGGCCGA CCTGAATCTG CCGGGGGTGC AGGCCGAACT GGTCCGCGCC GTTCTCGCCA CAGGAACACC GGTGGTCCTG GTCCTGCTGG CCGGCCGGCC CTACGCGCTC GGCGAAGACG TCGCAGACGC CGCCGGCATC GTCTACGCCT TCTTCGCCGG CCAGCTCGGC GGACAGGCGA TCGCCGAGGT CCTGACCGGC GCGGTGAACC CCTGCGGCCG GCTTCCGGTC AGCGTCCCGC GCGATTCCGG CGGTCTGCCG GTGACCTATC TCGCCCCGCC GCTGGGCCGC CGCTCGCAGG TCTCCTCGGT GGATCCGACG CCGGCGTTCC CGTTCGGGCA CGGCCTGAGC TACACGACGT TCGCCTGGAG CGGCGCCGCG GCCGACAGCG CCGAATGGCC GGTCGACGGC GAGGCCACAG TCCGGATCAC CGTCAGCAAC AGCGGCGAGC GCGCGGGCAC CGAAGTCGTC CAGCTGTATC TGCACGATCC GGTCGCGCAG ACCTCGCGAC CCGTCGTGCG ACTGGTCGGC TTCGCGCGCG TCGATCTGGC GCCCGGCGAG AGCGCCGAGG TGGCCTTCGA AGTCCCGGCC GACCTCGCCT CGTTCACCGG CCTGCGCGGC ACCCGCGTCG TGGAGCCCGG CGACGTCGAG CTGCGGTTCG GCCGCTCCAG CGGCGAGGCG GCCGCGATCG TGCCGCTGCG CATGACCGGT GCCGAACGCG AGGTGGGCCC GGGCCGCCGT CTGACCTCAC CTGTCCGCGT CGAGCGCTCG CCGGCGCCCG CGCCGTCGAA GCCGGGGAGA TGA
|
Protein sequence | MTSQMSADTD SAPGVPPLWR NPRLTPQERA DALIPVMTLE EKVAQLAGVW VGADASGGGV APHQQDMAPL AWEDVIRHGL GQLTRPFGTA PVDPVAGARS LAASQAQIAA ASRFGIPAQV HEECLTGFAT WGATAYPAPL AWGASFDPEL VEQMTGRIGR SMRAVGVHQG LAPVLDVTRD YRWGRTEETI GEDPHLVGVI GAAYVRGLEN AGIVASLKHF AGYSASRGGR NLGPVPMGRR ELADVVLPPF EAALRLGGAR SVMNSYSEID GVPAAADEQL LTGLLRDQWG FTGTLVSDYF AVRFLQSLHA VAGDAAHAAD LALRAGIDVE LPTVDVFGTP LTEAVRAGAV EEALIDRALR RVLIQKAELG LLDPDWRALP EEIEAGPVSL DSEEDREIAL RLARRSVTVL RNENGILPLM PDRRVALIGP VADDPMAMLG CYSFPAHVSG NTEHGLGLEI PSLREALSAT IADLLYEPGC AISDDDTSGI AAAAGVAAAA DVCVLAVGDR AGLFGRGTSG EGCDAADLNL PGVQAELVRA VLATGTPVVL VLLAGRPYAL GEDVADAAGI VYAFFAGQLG GQAIAEVLTG AVNPCGRLPV SVPRDSGGLP VTYLAPPLGR RSQVSSVDPT PAFPFGHGLS YTTFAWSGAA ADSAEWPVDG EATVRITVSN SGERAGTEVV QLYLHDPVAQ TSRPVVRLVG FARVDLAPGE SAEVAFEVPA DLASFTGLRG TRVVEPGDVE LRFGRSSGEA AAIVPLRMTG AEREVGPGRR LTSPVRVERS PAPAPSKPGR
|
| |