Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6504 |
Symbol | |
ID | 8337868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7496027 |
End bp | 7497769 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644959602 |
Product | glycoside hydrolase family 18 |
Protein accession | YP_003117195 |
Protein GI | 256395631 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.462594 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.267228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCAC CCCCCACCCT GTACCGCACT GGCGTCAGCG GCGTCGTGGC CTTGTCACTC GCCGCTGTCA CCGCCGTCGC AGGATCGATG CTGTTCGCGC ACTCCGCGAC GGCGGCACCC ACCCTGGCCG TCACGGGCTA CGCCGAGGAG GGCACCGCCA ACAGCGCGAT TGACGCGAGT GCGGCGGCGA TGGCCACTGT CGGCGTCGAC GGCATCAACA TAAACTCCGC CGGTACCAGT GCTCCGGCGC CGGACGCCGG GGCGACCTCC CTGCTTGCCA AGGCGCACGC GGACAACCTG CGAGCCGAGT TCTTGGTGGG CAACTATTCC AGCTCGATCG GCGACTTCGA CCCCGCCGCG CTGGACAGAC TGCTGAGCTC GCCGAGCAAC ATCAACAGCG TGGTCACGAC CGTGGTGAAC GCTGTGAATT CGCAAGGCTG GGACGGCGTC ACGATCGACT TCGAGTCGAT CCTCGGCCAG GACGCCCAGG GCCTGGTCGA TTTCAGCACG GCACTGAAGC AGGCGATGCC GGCGGCCAAG ACGGTCAGCA TCGACGTCAC GGCATACCAG ACCGCCGCCG AGTACACCGC CAACGGCTAC AACCTCTCGG GTCTGGGCGG CGCCGTGGAC CGGATCGCGC TGATGGCGTA CGACGAGCAC GGCCCGACCT GGAACGGCGT GGGCCCGATC GGCGGGCTGC CGTGGCAGGA AGCATGTTTG CGGCAGCTGC TCACCCAGGT CCCCGCCGCC AAGGTCGACC TCGGCGTCGC AGGATACGGC TACACCTGGC CGAAGACAGG CACCGGCAGG CAAGTCAGCG ACGCGCAGGC CCGGCAGATG GTGGCGGGCG ACGGGTCGAC CGCGACCTGG GACAGCACCC AGGGCGAGTG GACCGCCACC CTGAAGAACG GCACGGTCAT GTGGTGGTCC GATGCGAAGT CGTGGCCGCT GCGGGCCACA CTCGCTCAGA AGTACGCGGT CCACGGCATG GCGCTGTGGT CGCTGGGCCT GTCCGACCCG CTCCCGGTCA CCGCTCCGGC CAATGGCTTT TCGGTGGCGG CGAGCCCGGC GTCGGGCTCG GTGGCGGCCG GCGCCTCGTC GACGTCGACA GTCAGCACCG CGGTCACGTC CGGTACCGCG CAGTCGGTCG CTCTGACGGC CGGCGGTGTG CCCGCCGGCG CGAGTGTCTC GTTCTCCCCG GCATCGGTGA CCGCGGGTTC GTCCTCGACG ATGACGGTGA CGACGTCCTC TTCCACACCG GTGGGCACGT ATCCGATCAC GGTGACCGGA CGCGCGGCAT CAGGTAGTCA CACCGCGACG TACACACTGA CGGTCACCAC CGCGAGCGGT TCCACCGTCT ACGAAGCGGA AGCTTCCTCC AGCGTCCTGG CCGGCGGCGC GAAGGTCGTC ACCTGCGCGG CGTGTTCGGG TGGAGCCCGG GTCGGATACC TCGGCGGCAC CGGCACGCTG ACCATGAAGA ACATCACCGT GGCCACCGCG GGCAGCTACC AGGTCACGAT CGCGTACACC AACGGCGACA CCGGCAACCT CCGGATCATG CTCAGCGTCA ACGGTGGCGC CAACGCCACC TTCACCGGCG CGCCGACGAC GAACTGGGAC ACCCCCGCGA CCGGCACCAT CACTGTGAGC CTGGCGGTCG GAACCAACAC CATCCTGTTC AGCAACACGG GAACCACCGG CGACGTCCCC GACATCGACA AGATCGCCGT GGTGTCCAAG TGA
|
Protein sequence | MTSPPTLYRT GVSGVVALSL AAVTAVAGSM LFAHSATAAP TLAVTGYAEE GTANSAIDAS AAAMATVGVD GININSAGTS APAPDAGATS LLAKAHADNL RAEFLVGNYS SSIGDFDPAA LDRLLSSPSN INSVVTTVVN AVNSQGWDGV TIDFESILGQ DAQGLVDFST ALKQAMPAAK TVSIDVTAYQ TAAEYTANGY NLSGLGGAVD RIALMAYDEH GPTWNGVGPI GGLPWQEACL RQLLTQVPAA KVDLGVAGYG YTWPKTGTGR QVSDAQARQM VAGDGSTATW DSTQGEWTAT LKNGTVMWWS DAKSWPLRAT LAQKYAVHGM ALWSLGLSDP LPVTAPANGF SVAASPASGS VAAGASSTST VSTAVTSGTA QSVALTAGGV PAGASVSFSP ASVTAGSSST MTVTTSSSTP VGTYPITVTG RAASGSHTAT YTLTVTTASG STVYEAEASS SVLAGGAKVV TCAACSGGAR VGYLGGTGTL TMKNITVATA GSYQVTIAYT NGDTGNLRIM LSVNGGANAT FTGAPTTNWD TPATGTITVS LAVGTNTILF SNTGTTGDVP DIDKIAVVSK
|
| |