Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1433 |
Symbol | |
ID | 8332772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 1630647 |
End bp | 1633658 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644954581 |
Product | glycoside hydrolase family 2 TIM barrel |
Protein accession | YP_003112197 |
Protein GI | 256390633 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAGTG ACACTGAGAC CTCCTACCAG CGTTTCGCCT ACGTCGAGGA CCGCTCCCCC GGGACCGGCC GGCTCGCACC GCGCGCGGCG TTCGCCTCCG ACGCCGCGGT CCTCGGGCTC GACGGCCGGT GGCGCTTCCG CCTGGCCGCC GGGCTGCACG ACACGACAGA GGCATTCCAG GCGCCGGACT TCGACGACGC CGCCTGGGAC GAGATCGCCG TCCCGTCGTG CTGGCAGATG GACGGCCTGC CCGGCGAGCC GCGCTACGGC GCGCCGGCGT ACACGAACGT CACCTATCCG ATCCCGCTGA ACCCGCCGCA CGTCCCGCGC GAGAACCCGA CCGGCGAGTA CCGCTACGCC TTCGACGTGC CCGGCGACTT CCACGCCTCG GGTGCGCGCT TACGCTTCGA AGGCGTCGAT TCCTGCTTCG CGGTCTGGCT GAACGGCGCG CTGCTCGGCG ACGGCAAGGG CTCGCGGCTG CCCACCGAGT TCGACGTCTC CTCCGTGCTG GAACCGGGGC GGCAGAACGT GATCGCGGTC CGTGTCCACC AATGGTCGGC GGGAACCTAC CTGGAAGACC AGGACATGTG GTGGCTGTCC GGCATCTTCC GCTCGGCGGC GGTCCTGGAG CGGCCGCGCG AGGGTATCTC AGACTTCTTC GTCCACGCCG ACTACGACCC GGCGACCGGC GCCGGCACGC TGCGGATCGA CGTCACCGGC ACGGCCCGCC TCACCGTCCC CGACCTCGGC ATCGCCGACG CCGACCCGGC CGGGCCGTTC GTCATCAGGC GGGTCGAGCC CTGGAGCGAC GAGCGGCCGC GCCTGTACGC CGGCGAGCTG GTCAGCGCCG GCGAGCGCGT ACCAATCCGT ATCGGCTTCC GCCGCGTCGA GGCCGCCGAC GGCGTCCTGC GCGCCAACGG CAAGCCGCTG AAGTTCCGCG GCGTGAACCG CCACGAGTGG CATCCGCTCA CCGGCCGCAC GCTGAGCCCG GAGACGATGC TGGAGGACGT GCTGCTGATG AAGCGGCACA ACATCAACGC CGTCCGCACC TCGCACTACC CGCCGGACTC CCGCTTCCTG GACCTGTGCG ACGAATACGG GCTCTGGGTC ATCGACGAGT GCGACCTGGA GACGCACGGC TTCGCCGTGG TCGGCTGGCG CGAGAACCCC GTCGCCGACC CCGCGTGGCG CGAGGCGCTG TTGGACCGCG CCGAGCGCAT GGTCGAGCGG GACAAGAACC ACCCGAGCGT GGTGATCTGG TCGCTGGGCA ACGAATGCGG CAGCGGCGAG AACCTGGCCG CGATGGCCGC GTGGATCCGG GAGCGCAACC CCGAGCGCCT GATCCATTAC GAGGGCGACC ACGACTCCTC CTACGTCGAC CTCTACTCGC GGATGTACTC CGACTACGAC CACGTCGCCG CCATCGGGGT GTATCAGGAG CCGACGACGG TCGATCCGAC GGCCGACGCG CACCGGCGCT CCATCCCCTT CATGCTCTGC GAATTCGCCC ACGCGATGGG CAATGGACCC GGCGGCCTGC TGGAATACCG CGACCTGTTC GAGGCCCATC CCCGGCTGGC CGGCGGCTTC GTCTGGGAGT GGATCGACCA CGGCGTCGCG CAGGGCTCGC ACTACGCCTA CGGCGGCGAC TTCGGCGAGC GCGTGCACGA CGGCAACTTC GTCGCCGACG GCCTGCTCTT CCCGGACCGC ACGCCCTCGC CGGGTCTGCT GGAATACGCC AAGCTCTGCG AGCCGGTCCG CATCGAGGGC GACACCGTTC GCAACCTGCA CCACAGCCGG GACACCGGAT ATCTGCGCTG GCGCTGGCGA TTGGAGATCG ACGGCGACCT TATCGCGCAG GACGAACTCC CCGTCCCACC GATCGCCCCC GGCTCGACCT TCCGCCTCCG TTACCCGGAC GAGCTGACGA AGGCCGCCCA CGCCGCCGGC CCGGGCGAGC GCTGGCTGAC CGTCGAGGCG GTGCTGTCCG ACGGTGAACC ATGGGCGCCG GCCGGGCACG TGGTCGCCTG GAGCCAGCTC GAGCTGGGAG ACGCGCCGTT CACCGACGCC GATCCGCTGG TGGACCAAGC GGTCGCGCTG GCTGCGGACG CGCTGCTGGC GGCGACCGCC GTCGGCGGCA GCGCGATCGA CCGCGGCGAC GCGGCGGCCG ATTACCTGAC GCCGCAACGC CTCGGCGACA CCGTCACCCT GGGACCGGCC TCCTTCGACG CCTCCTCCGG CGAGCTGCTG GGCTTGGCAG GGCTGGCGAT CGACGGCTTC GCCCTGGATC TGTGGCGCGC TCCGATCGAC AACGAACGCT GGTCCTCCTT CACCGCGCCG CCGTTGGTCG AGGCGTGGCG GACGGCCGGC CTGGACCGGC TGGAGCACGA CGTCCTGGCG GTCGAGTCCG AGCCCGACGC GTTCACCGTC ACGACGCGCG TCGGTCCGGT CGGCCGCGAC CACCATCTCG ACGTCGTCTA CGTCTGGTCG GCGACGGACT CCCGGCTGTG CCTGACCGTC CACGTCGCGC CGAACCGGCC TTGGCCGTGC CCGATCCCCC GGCTCGGCGT GTCCTTCCAG CTGCCCGGCG AGCTGAATAC CGTGAGCTGG TACGGACTCG GGCCCGGCGA GGCCTACCGG GACAGCCGGT CGGCGGTGCG GATCGGGCAT TATCAGAGCT CTGCCGCCGA TCTGCAGACG CCCTACCTTT TCCCGCAGGA GAACGGGAAC CGCCACCAGG TGCGCAGAGC CTCGCTCACG CGTCCCGACG GCACCGGGCT GCTGCTCTCC GGCGCGCCGC ACTTCGATCT CGCCGTGCGG CCGTGGAGCA GCGCCGCAAT GGAAGCCGCG CGCCATCCCG ACGAGCTGAT TCCCTCCGGG CGGCTGCACG TCCAGGTGGA CCACGCGCAC CACGGCATCG GCAGTGCGTC GTGCGGCCAC CCCCTGCAGC CCCGCCATCG CCTCGAGGCC GGCCGCGCGA GCTTTGCCTT CACCCTGGAG GCGCTACAGT AG
|
Protein sequence | MPSDTETSYQ RFAYVEDRSP GTGRLAPRAA FASDAAVLGL DGRWRFRLAA GLHDTTEAFQ APDFDDAAWD EIAVPSCWQM DGLPGEPRYG APAYTNVTYP IPLNPPHVPR ENPTGEYRYA FDVPGDFHAS GARLRFEGVD SCFAVWLNGA LLGDGKGSRL PTEFDVSSVL EPGRQNVIAV RVHQWSAGTY LEDQDMWWLS GIFRSAAVLE RPREGISDFF VHADYDPATG AGTLRIDVTG TARLTVPDLG IADADPAGPF VIRRVEPWSD ERPRLYAGEL VSAGERVPIR IGFRRVEAAD GVLRANGKPL KFRGVNRHEW HPLTGRTLSP ETMLEDVLLM KRHNINAVRT SHYPPDSRFL DLCDEYGLWV IDECDLETHG FAVVGWRENP VADPAWREAL LDRAERMVER DKNHPSVVIW SLGNECGSGE NLAAMAAWIR ERNPERLIHY EGDHDSSYVD LYSRMYSDYD HVAAIGVYQE PTTVDPTADA HRRSIPFMLC EFAHAMGNGP GGLLEYRDLF EAHPRLAGGF VWEWIDHGVA QGSHYAYGGD FGERVHDGNF VADGLLFPDR TPSPGLLEYA KLCEPVRIEG DTVRNLHHSR DTGYLRWRWR LEIDGDLIAQ DELPVPPIAP GSTFRLRYPD ELTKAAHAAG PGERWLTVEA VLSDGEPWAP AGHVVAWSQL ELGDAPFTDA DPLVDQAVAL AADALLAATA VGGSAIDRGD AAADYLTPQR LGDTVTLGPA SFDASSGELL GLAGLAIDGF ALDLWRAPID NERWSSFTAP PLVEAWRTAG LDRLEHDVLA VESEPDAFTV TTRVGPVGRD HHLDVVYVWS ATDSRLCLTV HVAPNRPWPC PIPRLGVSFQ LPGELNTVSW YGLGPGEAYR DSRSAVRIGH YQSSAADLQT PYLFPQENGN RHQVRRASLT RPDGTGLLLS GAPHFDLAVR PWSSAAMEAA RHPDELIPSG RLHVQVDHAH HGIGSASCGH PLQPRHRLEA GRASFAFTLE ALQ
|
| |