Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4201 |
Symbol | |
ID | 8335555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4757608 |
End bp | 4759551 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644957304 |
Product | protein of unknown function DUF181 |
Protein accession | YP_003114906 |
Protein GI | 256393342 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACCC TGACTCCCGT CGAGGAGCTG ACCCGGCTGC TCGGCGAGTA CGCGGCGCGC GACGGTCGCG GGCCCGATAA GGTCCCGACG GTCGTCGAAC TCGGCGGCGA GGACCTGCTG GCCTTTGAGC GCTCGCCGCT GCGCGAGGCC GGTGAGGCGC GTCCGGATCT GCGCGACGCG GACATCTTCC TGACCTCGCG GGCCGTTCTC ATCGGCCCGA GAAGGTCGGC GCGATCGGAG CGGTCGGAGC TGTCGGCGCG GTCGGCGCGG TCGCAGAGGT CTGAGCGGGG ATGCGGGCAC TGTCTGGCGT TGCGACTGCA GCGCATGCAG CCGGTGCATG AGCGCGACCT GCTCGAAACC AGCCCCGGCG CGCACCGGGC CGGCGTGTGG CCGGACGTCG GCGACCAGCT CGCCGCCGCC GCGTGGGCCG CTTTCCGGCA CGCCGAGCTG GTCCCGGCCG GCGACCTGCC GAAGGTCTGG CGTATCGACA TCCGAACCCT GATGATGACG GCCGCGGCGC TGCTGCCTGA CTCGCGCTGC CCCAGCTGCG GGGACCTCGA CCCCGAGCCG CCGGACCTCG ATCTCGGCGC ACCGCGCAAG CCGAGCGCCG AGGCGTACCG ACTGCGCCAT CCCGAGGACT TCGCAGTCCC GGTCGAAGCG CTGGTCAACA CGGTCTGCGG AGTGCTCGCA CCGGCGACGT CGGCCGCGCT GGCCTCGCCG ACGACCGCCC CGGTCGCGGG CTCGGCGGCG GTGCGCGGAC CGACCGGGCT GCACGACCTG AGCTGGAGCG GTCAGGCGAA CTCCTACGCG GCGAGCGCCC GCCTGGCGGT GTTCGAGGGC TTGGAACGGC ACGCCGGGAC GATGCGCAGG CGGCCCGGGC TGGTCGTCGG CAGCTACACC GAACTCGCCG ACCGAGCTCT GAATCCGAGT GACTGCACGG CTTACAGCGC CGATTCCTAC CTCCAGGACG AACACCTCAC GCCGTTCGAC CCGGACCGGA CGATCCCGTG GGTCCCCGGC TACTCGCTGC GCGACAAGCG GTCCGTGCTG GTCCCGGTGC GCCTGGCGTT CTACGGCTGG GACGGCGGCG CGGAGCTGTT CTCCTTCGAG TGCTCCAACG GCTGCGCGAG CGGCGGATGC CTGACCGAGG CGGTCCTGTT CGGTCTGCTG GAGCTGATCG AGCGCGACGC CTTCCTGCTC GCCTGGTACG GCGGCGCGCT GCTGCCCGAG ATCGACACCG GCTCGCTGGA CCGCACGGCG CGCGCGATGC TGAGCCGGGC TCGGCTTCAG GGCTACGACG TCCGCCTGTT CGACAACCGG ATCGACCTGC CGGTACCGGT CGTCACCGGC GTGGCGCGCC GGCGGGACGG CGGCGACGGG CTGTTCTCCT TCGCCGCCGG GGCGGGCATG GACCCGGCGG CGGCGGTGGA GGCCGCCCTC GGCGAGATCC TGACCTACAT CCCCTCGATG CGGCATCGGG TGCGTGCGCG CCGTGAGGAG CTGGTCGCGA TGACGCGGGA CTTCTCGCTG GTTAGCGGGC TGGCCGATCA TCCGGCGCTG TTCGGGCTGC CGGAGATGCA GGAGCACGCC TGGCGTTACA CGCGCCACGC CGATCCGATG CCTGTCGCAG AGCTCTACCA CGAGTGGCTG CGGGTCCGTC CGGCGACCGA CGATTTGGCC GACGACGTCC GCTTCCTGGT GGACGAGGTG GCACGCCGGG GATCGGACGT GATCGTGGTC GATCAGACCA GCGCCGAGCA GCAGGCAGCC GGTCTGTCGG GAGTCCGGGT CATCGCGCCG GGACTGCTGC CGATCGACTT CGGCTGGGGG CGGCAGCGCG CGCTGCGGGC TCCGCGGATG TTCTCGGCGC TGCGCCACGC GGGTCTGCGC GAGAGCGACC TGACCGCTGA GGAGCTGCAC ATGGTGCCGC ATCCCTTTCC GTGA
|
Protein sequence | MRTLTPVEEL TRLLGEYAAR DGRGPDKVPT VVELGGEDLL AFERSPLREA GEARPDLRDA DIFLTSRAVL IGPRRSARSE RSELSARSAR SQRSERGCGH CLALRLQRMQ PVHERDLLET SPGAHRAGVW PDVGDQLAAA AWAAFRHAEL VPAGDLPKVW RIDIRTLMMT AAALLPDSRC PSCGDLDPEP PDLDLGAPRK PSAEAYRLRH PEDFAVPVEA LVNTVCGVLA PATSAALASP TTAPVAGSAA VRGPTGLHDL SWSGQANSYA ASARLAVFEG LERHAGTMRR RPGLVVGSYT ELADRALNPS DCTAYSADSY LQDEHLTPFD PDRTIPWVPG YSLRDKRSVL VPVRLAFYGW DGGAELFSFE CSNGCASGGC LTEAVLFGLL ELIERDAFLL AWYGGALLPE IDTGSLDRTA RAMLSRARLQ GYDVRLFDNR IDLPVPVVTG VARRRDGGDG LFSFAAGAGM DPAAAVEAAL GEILTYIPSM RHRVRARREE LVAMTRDFSL VSGLADHPAL FGLPEMQEHA WRYTRHADPM PVAELYHEWL RVRPATDDLA DDVRFLVDEV ARRGSDVIVV DQTSAEQQAA GLSGVRVIAP GLLPIDFGWG RQRALRAPRM FSALRHAGLR ESDLTAEELH MVPHPFP
|
| |