Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3629 |
Symbol | |
ID | 8334982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4059422 |
End bp | 4060888 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644956770 |
Product | glycoside hydrolase family 28 |
Protein accession | YP_003114373 |
Protein GI | 256392809 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGAA TCACCCTCCC CCCGATCCGC CGCTCGGCTC CCCGCATCAG CGTCCTCGCC GCCGCCGCAC TGGCGATCAG TGGCCTGATG ACGGGCGGTG CGCACGCGGC CAGCGTGGCC ACCGGCGACA GCCGCTCCGT TCACGAGCCG TCACTTCCCT CCGTCTGCCA ACAGCTGAGC GCGAACCTGG CGACCAGCAA CGAGCAGTTC TCCAGCAGCG CCGAAGCCAG CCCGCCGGAC ACCTCCCGGA TCCAGAGCGC GCTCAACGCT TGCAAGAGCA GCGGAAAGTC GGTCGAGCTG AAAAGCAGCG GCGGGAACAC CGCGTTCCTG TCCGCCCCGC TGACCGTGCC CACCGGCGTG TACCTGGTGA TCGACTCCGG CACGACGCTG TACGCGAGCC GCGTCGCATC GCAGTACCAG TCCAGTTCCT CGGACAAGTG CGGCACGATC TCCAGCAGCG ACAACGGCTG CAACCCGTTC ATCAAGGTCT CCGGCGCCAA TGCGGGCATC GAGGGCGTCC GCAGCAGCTC CGGCTCCCAA GGCACGATTG ACGGACGCGG CGGCCAGACC GTCTACGGCA CCAGCACGAC GTGGTGGCAG CTGGGCACCA CCGCCGGCAA CGAGGGCAAG AAGCAGAACG ACCCGCGGCT GATCATCTTC CAGGGCGCCG ACAACGCGAC CCTGTACAAC ATCAACCTGG TCAACTCAGC GTTCTTCCAC GTGACCTACA AGTCCGGCAA CGGCTTCACC GCCTGGGGCG TCCGCATCAA GACCCCGGCC ACCGCCCGCA ACACCGACGG CATCGACCCC GACGCCGCCA CCAACATCAC CGTCGCCAAC TCCTATATCC AGGACGGCGA CGACGGCATC GCCATCAAGG GCGGCAGCGG CGCCGCGTCG AACATCACCA TCGAGAACAA CCACTTCTAC GGCACGCACG GCATCTCCAT CGGCAGCGAG ACCAACGACG GCGTGACCAA CGTGCTGGTC CAGAACAACA CCGTGCAGGG CTCCGACAGC TCGGGCAATG TCAGCACCCT GAACAGCGGC CTGCGGATCA AGAGCTACCC GGGCAAGGGC GGGACCGTCT CGACGGTCAC CTACACCAAC ACCTGCGAGA CCGGCGTGTC CCACCTGATC GAACTGAACC CCCGCTACTC CTCATCGAGC GGTTCGGGCA CGCCGGAGTT CACCAACATC CTGGTGAACG GACTCAAGTC GGTCAGCTCG GTGTCCGGCG CGCAGTCGAT CATCGAGGGC TACGACTCCT CGCACATCAC CGGCCTGACC CTGCAGTACG TGAGCCTGGA CAAGACCGCC TACACCGCCG CGTACGCGAA TGTCGGCGTC TACAACTCCA ACCTGAACCC GTCTTCGGGT TCCGGCGTCC ACCTCACGCA GCTGTCCTCG CCCGTGACGA GCGGCAGCGT CCCCTCCTGC TCCTTCCCCG GCTACCCGAG CCTGTAG
|
Protein sequence | MRRITLPPIR RSAPRISVLA AAALAISGLM TGGAHAASVA TGDSRSVHEP SLPSVCQQLS ANLATSNEQF SSSAEASPPD TSRIQSALNA CKSSGKSVEL KSSGGNTAFL SAPLTVPTGV YLVIDSGTTL YASRVASQYQ SSSSDKCGTI SSSDNGCNPF IKVSGANAGI EGVRSSSGSQ GTIDGRGGQT VYGTSTTWWQ LGTTAGNEGK KQNDPRLIIF QGADNATLYN INLVNSAFFH VTYKSGNGFT AWGVRIKTPA TARNTDGIDP DAATNITVAN SYIQDGDDGI AIKGGSGAAS NITIENNHFY GTHGISIGSE TNDGVTNVLV QNNTVQGSDS SGNVSTLNSG LRIKSYPGKG GTVSTVTYTN TCETGVSHLI ELNPRYSSSS GSGTPEFTNI LVNGLKSVSS VSGAQSIIEG YDSSHITGLT LQYVSLDKTA YTAAYANVGV YNSNLNPSSG SGVHLTQLSS PVTSGSVPSC SFPGYPSL
|
| |