Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5780 |
Symbol | |
ID | 8337141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6679079 |
End bp | 6680140 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644958884 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003116479 |
Protein GI | 256394915 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACAGCG CCTGGGCCCT GGTCGTCTCC GCCCTTCTCC TGGCCGCCAA CGCCTTCTTC GTCGCCGCCG AATTCGCCCT CGTCACCAGC AAACGACACC GCCTGGAGGC GGCCGCCGCC GAGGGCAGTC GCGCGGCGCG CGTCGCGGTG GCCGGCACGC GCGAGCTGTC CCTGATGCTG GCGGGCACCC AACTGGGCAT CACGTTGTGC ACGCTGGGTT TGGGCGCCCT GGCCGAGCCC GCGGTCGCGC ACCTGCTGGA CCCGGTGCTC TCCGCGACCG GGCTGCCCGA GGGCGTGTCG TACGGGATCG CGTTCGCCGC GAGCCTGGCG CTGGTCGTGT TCCTGCACAT GGTCGTCGGC GAGATGGCGC CGAAGTCCTG GTCGATCACC CACCCGGAAC GGTCCGCGGC CCTGGTCGCG CTGCCGTTCC GGGCTTTCAC GCAGCTGGTG CGCTGGCCGC TGGTCGCCCT CAACGGCATG ACCAACGGCC TGTTGCGCCT GCTGAAGGTG GAGCCGCAAA GCGAACTGGC CGAAGCGCAC AGTCCCGAAG ACCTGCGGAT GCTGGTCCGA CAGTCCGCCG AGCACGGACT GATCCCCGCA GTGCAGCAAA GGCTTCTGGC CCAAGCGCTG CGCCTACAGA ACACGCCGCT GTCCGAGGTC ATGATCGCCT GGACGGACGC CGTCACCGTT CCCTGCGATT CCACCGCGAG CGCCGTGGAG GACCTGAGCC GCGCCACCGG CCACTCCCGC TTCCCGGTCA CCGGCGCCGA CGGCAACCCG GTCGGCCTGG TCCACGTCCG CGACGCGGTG CGCGCGACCA CCGCCGGTCT CGACCCGGAC GTGTCCGACC TGCGCAGCAC CGCGCTGACG CTGCGCGCGG ACCAGACCGG AGCCGAGGCG GTCAGCGTGA TGCGGCGGTA TCGCTCGCAA CTGGCTCTGG TGAAGAGCGG CGCCGGGGGC GACGAGGGAG CTCAGGACAC TGATGCGGTG GTCGGTGTCG TGGCACTGGA GGATCTGTTG GAAGAGCTCA TCGGCGAATT CCAGGACGAG ACAGATATCT GA
|
Protein sequence | MNSAWALVVS ALLLAANAFF VAAEFALVTS KRHRLEAAAA EGSRAARVAV AGTRELSLML AGTQLGITLC TLGLGALAEP AVAHLLDPVL SATGLPEGVS YGIAFAASLA LVVFLHMVVG EMAPKSWSIT HPERSAALVA LPFRAFTQLV RWPLVALNGM TNGLLRLLKV EPQSELAEAH SPEDLRMLVR QSAEHGLIPA VQQRLLAQAL RLQNTPLSEV MIAWTDAVTV PCDSTASAVE DLSRATGHSR FPVTGADGNP VGLVHVRDAV RATTAGLDPD VSDLRSTALT LRADQTGAEA VSVMRRYRSQ LALVKSGAGG DEGAQDTDAV VGVVALEDLL EELIGEFQDE TDI
|
| |