Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4207 |
Symbol | |
ID | 8335561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4768114 |
End bp | 4770000 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957310 |
Product | protein of unknown function DUF181 |
Protein accession | YP_003114912 |
Protein GI | 256393348 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCCCGT GGGCGTCCGC CTGCGCCCGC TTCGAAGAAG GGCTGCGGCA GCGGACCGGC TTGGACGCCG TCGTCGCGCC GCTCGGCGTG CGGGACGAGT TGGACGGCCC GATCTCCGCC CCCGAAACCA GCAGCGGCAT GGACAACAGC TCGATCCCGC TCCACTTCAC CGGACGCCAC ATCATCGCCG GACCGCTCGG CGAGCAGTCC TGCGCACGCT GCCTGGCCCG GCGCTGGCAG CGGATCCGGG ACCCCCTGGT CCGCGACGCG CTGGAGATCG GCGGTCCCAC GCACGCCGCC GGAATGCCGC CGCAGCTCAC CGACTTCGCG CTCGACGCGA CCGCCGCGCT GCTCCTGCAC TTGATCGACG AGGCGCCGAG CACCGGATCG AGCACCGCAG AGCCCTTCGT CTACCAACTC GATCTTGTCG AAGCCAGGAT CCTGCGAACC CAGGTAATGG CCGATCCGGA CTGCCCGGAC TGCCGCAGCC ACGCCAGTTG CGACAACCAG ACCGGTCGCG CCAACCAGAC CGCCTGGACA CCGATCGCCG ATCTGCCGCC GGCTCCGCGC TCCAAGCCCG GCTCGTCCCG GGTCCGGGAC GCCGACGCCT TCGACCTGCC GGCCGACGCG CTGGTCAACC CGGTGTGCGG GGTGCTGGGA CAGGTGAAGA TCGAGGAGCT GGACCTGCCG ACCACGTCCT CGGTCCTCGG CGTCATCGCC GAGCGCGCGG GCGGCCGGCT GTACGAGGTG TTCTGGGGCG GGCATCGCGA CACCTATCGT CGAAGCCTGC GAACCGGGGT CCTGGAGGGC TTCGAGCGCT ACGCCGGGCT GCGTCCGCGC GGCCAGGAAC CGCCGCTGGT CGCCGCTTTC GACGACCTCA CCGTGCCGGC GCTCGATCCG CGCGACTGCG GCATGTACGC GCCGGAGTTC TACGAACAGG ACCCTGAGAC CAAGGCGTTC GCCACGGACC GCAAGATCCC TTGGGTCCGC GGGTACTCGC TGCGGGACGA CCGCGAGATC CTGGTCCCGC TCGTCATGAG CTACTACCAC TGCGAGCCGC ACGCCGAGCG CTTCGTGCAG CAGTGCTCGA ACGGCTGTGC CTCTGGAGGC TCCGTCGCCG AAGCCGTCCT GTCCGGACTG CTGGAGCTGA TCGAACGCGA CGCGTTCCTA CTGACGTGGT TCGGACGCCA GAGCTTGCCC GAGCTCGATC CCCGCACCAG CGAGCACGCC GAGACCAGAC ATCTCGTCGA GCGCCTGGCG ATGTACGGGT ACGAAGCAAG GTTCTTCGAC ACCCGCCTGG CGTTCCCGGT CCCGGTCATC ACCGCGGTCG CGGTACGCCG CACCCCAGGT CTGGGCGCGT TGTGCTTCGG CGCGGGCGCC GCCCTCGACC CGGAGGAGGC GCTGGCCGCC GGGCTGGCCG AGATCGCCAC CGACGCGCTG CACGCCCGGC GCCGAGCCGA GCGCGACGAA GCCGAGCTGC GGGCGATGGC AGCCGATTTC ACGCAGGTCA TCGGGCTGCA CGACCACTCC CGGCTCTACG GACTGCCGGA GATGGCGCAG TATGCCTCCT TCCTGCTCGA CGCGCCGCGC GCCGCTCTAC TGCCGCTGAA GCAGGCGTTC GGTGAAGCGC CGCCGCGTAC CGGCGACCTG CGCGAAGACC TCGCCGCATG CGTCGGGCAC GTCACGGCGG CGGGCTTCGA CACGATCGTC GTCGACCAGA CCGCGCCCGA ACAGCGCCGC CTGGGTCTGT CCACGGTCGG CGTGATCGTG CCGGGGCTCG TGCCGATCGA CTTCGGCTGG ACCCGGCAGC GCGCGCCGCT GATGCCGCGC CTGCACGCCC TCACCGGCGC CACGACGCCG AACCCCGCTC CGCACCCCTT CCCCTGA
|
Protein sequence | MLPWASACAR FEEGLRQRTG LDAVVAPLGV RDELDGPISA PETSSGMDNS SIPLHFTGRH IIAGPLGEQS CARCLARRWQ RIRDPLVRDA LEIGGPTHAA GMPPQLTDFA LDATAALLLH LIDEAPSTGS STAEPFVYQL DLVEARILRT QVMADPDCPD CRSHASCDNQ TGRANQTAWT PIADLPPAPR SKPGSSRVRD ADAFDLPADA LVNPVCGVLG QVKIEELDLP TTSSVLGVIA ERAGGRLYEV FWGGHRDTYR RSLRTGVLEG FERYAGLRPR GQEPPLVAAF DDLTVPALDP RDCGMYAPEF YEQDPETKAF ATDRKIPWVR GYSLRDDREI LVPLVMSYYH CEPHAERFVQ QCSNGCASGG SVAEAVLSGL LELIERDAFL LTWFGRQSLP ELDPRTSEHA ETRHLVERLA MYGYEARFFD TRLAFPVPVI TAVAVRRTPG LGALCFGAGA ALDPEEALAA GLAEIATDAL HARRRAERDE AELRAMAADF TQVIGLHDHS RLYGLPEMAQ YASFLLDAPR AALLPLKQAF GEAPPRTGDL REDLAACVGH VTAAGFDTIV VDQTAPEQRR LGLSTVGVIV PGLVPIDFGW TRQRAPLMPR LHALTGATTP NPAPHPFP
|
| |