Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5225 |
Symbol | |
ID | 8336579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6014274 |
End bp | 6016118 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644958323 |
Product | glycoside hydrolase family 76 |
Protein accession | YP_003115925 |
Protein GI | 256394361 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4833] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0174562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCAGAA TCCTGATCCT CCTCACAGCT CTGGTCACGG CGCTGGTACC GCTGCTCGCC ATGGCGCCGC CGCGTGCGAA CGCGGCGAGC GCCGTGTGCG CGCTGTACTG CGACACGCGT GATCCCTCGC TGGCGCAGCA GGAGACGTTC CCGACCCCGA ACGTCTCCGA GAACGGCCGC GTGATCGCAT TGCACGTGGA CGACGTCGAC GGCATGGCCT GGGCCAGCAT CGACAACGGC CGGCTGAACG ACTCGGTCTG GATCGACCGG TCGTGGGACG CCGGCAGCAG CTGGGACGGC TTGTTGGGCA AGGCGTGGAT CCCGAGCTCG TGGACCGGTA CACGGACCCT GATGTACAAC ATGTACGACC CCTCCGACCA CCGCCGCGCG GTGGTGCGTG CCTGCGGCGA CGCCAGCGGG GTGGTGTGTA CGAACTGGGT CCACCTGCCG GTGTGCGCGG CGCGGTGTGA CGGCGCCGAT TCCAGGACCT CGGTCGGGAA CACCTCACCG GTCCCCGACG CCACGCTGTC CGGGCGCGAC ATCGCGCTGC ACGTCGACTC CGGCGGCATG GCCTGGGCTT CGATCGCCGG CGGAGCGCCC GGCGACGAGG TGTGGCTGGA CCGGTCGTGG GACGGCGGCG CGACGTGGCC GGACGGCTCG AGCAAGGGCC GGGTGAGCGT GCCGTCGGGG GCGTCCGGTA CTCAGACTAT TGAGATCAAC ATCGACGATC CGTTGGGCCG GCTGGCCGGG GGCGCCGTGC GCGCCTGCGG GCGTGCGGTG ACCGGGCAGA ACGGCAGCTG CACGGCGTGG GCGCGCGCCG CCGCGGTCCC GGCGAAGGCT GCCGCCGACG CGCTGATGTG GTCTTATGAC CCCTCCAACG CATGGTGGCC GTCGAGCTGG TGGAATTCGG CGGTCGCACT GACGTCGGTG ATCGACTACA CGCGCGGCTC GGGCGATACG GCATACGAGT GGATCGTCGA CCGCACGTTC CAGGTGAACA AGGTCGCCTT CCCGGCCGGC GCGCGCAGCT CGGACCCCAT CCAGGGCGAC TTCATCAGCC AGGCGACCGA CGACACCGAG TGGTGGGCGC TGGCGTGGAT CGACGCGTAC GACCTGACGG GGAATCGGAC GTACCTGAAC GAGGCCGTCA CCATCACGAA CCATGTCAGT TCCCTGTGGA ACACCAGCAC CTGCGGCGGC GGCGTGTGGT GGAACACGCA GAAGACGTAC AAGAACGCGG TGACCAATGC GCTGTATGTG GATCTGACCG CCGCGCTGCA CAACCGCATC GCGGGCGACA CGGCGTGGCT GGCGCGGGCG ACGACGTCCT GGAACTGGTT CCGCTCCAGC GGACTGATCA ACGGCTCGGG TCTGGTCAAC GACGGCCTGA CGAACGCGTG CACGAACAAC GGCCAGACGG TCTGGACGTA CAACCAAGGG CTGGCCATCG GCGCGGCGCA GGAGATGTAC CGCGCGACCG GCGACAGCGG CGACCTGAGC GAGGCGCGCC ACCTCGCCGA CTCGGCGGTG CACTCCCCCA CACTGGTGAC GAACGGGCTG CTCACGGAGT CGTGCGATGC GCTGACCGCC ACCTGCGACG ACAACCAGAA GCAGTTCAAG GGGATCTTCA TGCGCTTCCT GGGCGAGCTG AACGCCGACG CGTCGGTCGG TGGCGCGTAC AGCACGTTCA TCCAGGCGCA GACGTCGTCG CTGTGGAACG CGGACCGGAA CTCGCTCAAC CAGCTCGGGG AGCGATGGTC GGGGCAGGGC TCGGGGACGA ATCCGAATGT GAGCGATTGG CGGACGCAAG CGAGCGGGTT GGAGGCGCTG GACGCGGGGG TTTGA
|
Protein sequence | MRRILILLTA LVTALVPLLA MAPPRANAAS AVCALYCDTR DPSLAQQETF PTPNVSENGR VIALHVDDVD GMAWASIDNG RLNDSVWIDR SWDAGSSWDG LLGKAWIPSS WTGTRTLMYN MYDPSDHRRA VVRACGDASG VVCTNWVHLP VCAARCDGAD SRTSVGNTSP VPDATLSGRD IALHVDSGGM AWASIAGGAP GDEVWLDRSW DGGATWPDGS SKGRVSVPSG ASGTQTIEIN IDDPLGRLAG GAVRACGRAV TGQNGSCTAW ARAAAVPAKA AADALMWSYD PSNAWWPSSW WNSAVALTSV IDYTRGSGDT AYEWIVDRTF QVNKVAFPAG ARSSDPIQGD FISQATDDTE WWALAWIDAY DLTGNRTYLN EAVTITNHVS SLWNTSTCGG GVWWNTQKTY KNAVTNALYV DLTAALHNRI AGDTAWLARA TTSWNWFRSS GLINGSGLVN DGLTNACTNN GQTVWTYNQG LAIGAAQEMY RATGDSGDLS EARHLADSAV HSPTLVTNGL LTESCDALTA TCDDNQKQFK GIFMRFLGEL NADASVGGAY STFIQAQTSS LWNADRNSLN QLGERWSGQG SGTNPNVSDW RTQASGLEAL DAGV
|
| |