Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6877 |
Symbol | |
ID | 8338243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 7945280 |
End bp | 7946911 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644959965 |
Product | hypothetical protein |
Protein accession | YP_003117556 |
Protein GI | 256395992 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0249011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCA CACGACGTAC TCTCGGAGTC GGAGCGCTCT CAGCCCTCGC ACTGCCCCTG GTCGACCAGG GCCTGGCCCG GGCCGCCGGA TCCGGCGCCC ACGACGCGGC GGCGCCGGCC GCGACCGGGC AGGCCGCGCC GGCGAAGGCG GCCGCTTCGG CCGTCGTGCC CTACCCGACG GCCCCGTACC TGACCAAGAG CACCGCCTAC ACGCTGAGCG TGAACTCGCA GGCGATCGAC GTGCGCAAGC ACTTCGACTA CTCGATCGCG CAGTTCTCGT ACTCTGGGAC GGCGACCTTC ACGATCACGG CGTCGGAGAC CATCACCTCC TACAACATCA GCCCCCACAG TTACGGCGTC AAAGCCACCA AAAGCGGGAA GACCCTCACC TTCTCGCTCA CCCAGACCCA GTCGCGGTAT CTGGTGATCA AGGTCAACGG CTTGGAGAAC CTCGTCATCG CGGCCGACCC CCTGGAGGCC GGCATCCCGG TGCCCAACGG CGGCAGCGTG AAGAACATCC TGGACTACTC GGGGATCGAC AGAACGGGCA ACACGCTCAT GACGTCGAAG ATCCAGAAGG CCATCGACGA CGCGGCGGCC CGCAGCGGCG GCGGCACGGT CTACGTTCCC GCCGGCGTCT ACAAGTTCAC GCAGATCGAG CTGAAGAGCC ACGTCACCGT CTACCTCGCC GCCGGTGCGG TGCTGCGCGG CTCGTCCAAG GTGGGCGACT ACGACTTCAG CGGCAAACAC TTCCACGCGG CGAACGTCCG CATCGTCGGC GCCGCCAACG CCTCGATCAA GGGCCGGGGG ACCATCGACT CCAACGGCAG TGTGCTGACG TCCGGACCGA GCGGGTCCAA CCGCGAGAAC ATCATCGCCT CGCTGAAGAA CTCGCAGGGA ACGAAGCCGG ACACGCTGGT CTTCGAGGGC ATCACGCTGC GGGACGGCAC GACCTGGAAC TTCAACCTCA AGGACTCGAC CCACGTCACG ATCACCAACG TGAAGATCTT CAACAACGTC CACTGGATCC ACGGGGACGG CTTCGACCTG GTCAACGTCT CCCATGCCGT GGTGGACCAG TGCCTGGCCT GCACCGGCGA CGACGCGTTC GACGCGAAGA GCTCCTCGAC GGAGCCGATG ACGGACCTGG TGTATCAGAA CTCTGTCGCC TACACCCAGG CCGCCGGCAC CAAGCTCGGC ATGCAGGGCG CGGGCGCCAT GAGCGACATC TGGTTCAAGA ACATCGACGT GATCCAGGGA TACCGCGGGG TCAGCGTCTC CCACGACGAG GGCGGCGGTG CCTGGAGCGG CATCCACTTC ACCGACATCC GCACCGAGAA GATCTACAAC AACGGCACGT CCGGCGAGTT CCGGACCGCG CCGATCCTCA TCTGGACCGC GGAGTACTCG GGGTCGACCA GCGGGCCGAT CACCGGCGTG GAACTGGTCC GGGTCAGCTT CGAGAACATC AACGGCTTCC ACTCGATCAT CCAAGGAGAG AACAGCAAGA GCAAGGTCTC CAACGTCAGC TTCACGGATC TGACGATCAA CGGACGGCCG GTCAAGAAGG CCTCCGACGG CCTGATCGAC ATCAACGCGA ACACCTCCGA CATCACGTTC GCCGTCTCCT AG
|
Protein sequence | MNITRRTLGV GALSALALPL VDQGLARAAG SGAHDAAAPA ATGQAAPAKA AASAVVPYPT APYLTKSTAY TLSVNSQAID VRKHFDYSIA QFSYSGTATF TITASETITS YNISPHSYGV KATKSGKTLT FSLTQTQSRY LVIKVNGLEN LVIAADPLEA GIPVPNGGSV KNILDYSGID RTGNTLMTSK IQKAIDDAAA RSGGGTVYVP AGVYKFTQIE LKSHVTVYLA AGAVLRGSSK VGDYDFSGKH FHAANVRIVG AANASIKGRG TIDSNGSVLT SGPSGSNREN IIASLKNSQG TKPDTLVFEG ITLRDGTTWN FNLKDSTHVT ITNVKIFNNV HWIHGDGFDL VNVSHAVVDQ CLACTGDDAF DAKSSSTEPM TDLVYQNSVA YTQAAGTKLG MQGAGAMSDI WFKNIDVIQG YRGVSVSHDE GGGAWSGIHF TDIRTEKIYN NGTSGEFRTA PILIWTAEYS GSTSGPITGV ELVRVSFENI NGFHSIIQGE NSKSKVSNVS FTDLTINGRP VKKASDGLID INANTSDITF AVS
|
| |