Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0268 |
Symbol | |
ID | 8331595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 300047 |
End bp | 303094 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644953435 |
Product | hypothetical protein |
Protein accession | YP_003111062 |
Protein GI | 256389498 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.564116 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGCA CGCACATGCC CAGCACCAGC GAGCCCGCCT TCCACCAGAT GACCTGGACC CCCGAGGACC TGCGCGCGGC CCTGGAGGCG AACGAGAACG AGCCGCCGGG CCGAGCCCGA TCGGTCCGCG CCGAGACGCT GTTGGCCGCC GCCGACAAGC TCGGCGACCC CGAGACCCAG ATCTGCGCCC TGCACACCGT CATCGAAGCC TACGAGCGCG GCGGGGAGAG CTTCCGCTCG CCGGTGCTCT TCTCCCGCCT CCTGCGCCTG TGGGACCGCC ACGGCAAGAC CCTGCGCGAC GCGAGCCGTC TGGAATACGA GACGCACTGG GTCTTCAAGT GGATGACCTC CGACCTGCTC TCGGTCCCCG AGGTCCCGCT GGCGACGGTC ACCGGCTTCG TCGATGAGAT GGAGCGCCGC TACCGCCTGG CTGGATACGG CATGCGGGCC GTCCACGCGC AGCGCTTCCG CATAGCGGAG CACCTGGGCG ACACCGCGCA GGCCGAGGTG CACTTCGGAC GCTGGCTGTC GGCCGATCGC GACCTGATGA GCGACTGCCG CGCCTGCGAG CACCTGACGC AGGGCGTGTG GCGGGCGGAG AACGGCGACG ACCTCGCCGC GATGCGGCTG TGGCGGCCGA CCGTCGAGAA CGAGATCTCG TGTCTGGACG AGCCCGCCAG CACTCTTGCC GCGTCCCTGA AGCCACTGCT GCGTCTGCGC CGCTACGACG AAGCACGCTC GAATCACCTG CGTGGCTACC GCTTGCTGCG CGGACACGTC GAGCTGCGCA CCGCCTTCGG ACGCCACATC GAGTTCTGCG TGCTGTCTGG AAACGGCGAA CGCGCCCTGG AGATCCTCGC CGAGAACCGC CGCCTGTTCG ACCCGCCCTA CGAGCCGCTG GACTACCTCG AGTTCCTGTC CTGCGTCGCG TTGCTGCTGC GCAGCCGGGT CGACGCCGGA TCGACCGCGA TCGTCGCCGG ACCCGAGGGA CGGGACTGGC CGGTCGCGGA GTTGCTGGCG CGCGTGCGAT CGGAGATCGA CGACCTCTCC GGACGCTTCG ACCGCCGCAA CGGCACCAAG GTCGTGAGCG CGCGCGTCGC CGCCACGATG GACCAGACGT GGCTGGTCGC GGAGCTGCCG CTGATCGTCT CGCAGCGACC GCGACCCCAG CAGTTCCAGA AGCCGGAGCA GGCGCAGCAG CCCGGCCAGG CGCACGAGCC GGATCAGGCG CCGCTGGCTG CCGTGCCCGC CCAGCGCGAA CCCGTCGCCG AGCCGCCCGA CCCGGACGCG GACACCGACT TCTACGACCT TCTCGCCGAG GCGCGCCGGC TGTTCAACGT CGCGCACCCC AGCGCGATGA AGATGTGGGA ACGCGTCGCG ATCGCGGCCG ATCGCCGAGG CATCGTGCTG GACCTTGAGG CGCAGGCCCA ACTCGCCGAG GAACGCGCCG CCGAAGCGCT CGACCGCGAG GACATGGACC GGGCCGTCAC GCTGCTCGGC GAGGCTGTCG AGCGCTACAA GGCAGGTGGG CTGGAAGGGC GCGCGGTCGC GGTGCAGGCG CGCAGGCTGC TGGCCGAGGC GTTGCAGAAG AAGACCTACG AGCCCATTCC GGAGCAGGCA CTCGCCGCGC TCTACGCGAC AGCTCGAGTC CTGCAAGCAC GCGGACTGGC CGAGCCGGAG GACATCCTCA CCGTCCGCCG CGCGCAGGCC TTCGAAGCGC GCAGGACGAC CGAGGTCGGC GCCGACACCG ACACCGGCAC CGGCACCGAC CGCGACCGCG AATCAGCCTT CGACCACTTC GCAGCAAGCG TCGAAGCGCT CCTCGCCGAT GCCATCGAGT TCGACGTCCC GGCCCGCGCC GCCGCCGCGC ACACCATGCG CGCCGAGATC GCCCAGCGGC GCGGTCGGCC GGAGGAATCG GTCCCGGAGC TGCTGGCGGC CATAGCGCTG TACGACCGCG CCGGGCGTCC GTGGGCGAAT CTGCACCCGA ACATGCTCCT CGCGCAGGCG TACCTCGCCT CCGACCGCGA CGCAGACGCC GAGCGCGCCG GGCTGGCCGC CCTGGACATC GCCGAACGCT GGCCCGAGCA GCGGTTCCCG GCCGGATACA CCCGCCAGGT GCTGGCCAAC GCGACCGGCG GGCAGGGCCG CTACACCGAC AGCGCCGAGC ACGCGCTGCA TGCCGTCGGC TGGGCCGACC GGCACGGCGT CCCGGACCTG GCCGCGAGCG CGCGCCACAG CCTGGCGTTC GCCTACGAGC AGCTCGGCCG CGACGCCGAC GCCGCCGCGA TCCTGGAATC CGCGCTGCCG GAGATGATCC GGCACTTGGA CGACCCGACG GTCGTCAACG CCCGCTGGGC CCTGGCCCGC TGCCTCGGCC GTCTGGAGGA CTACCGGGGC GCCGCCGAGC AGTACCTGCT GGCCGCGTCG ATCGCCGAGC ACTTCCCGCA GCAGGGCGGC CACGCGATGC TCGCGGCGTC CGCCGGACAC GCGCTGCGCG CCGCCGGGCT CGCCGACGAG GCGCGCCGGG CGTTCGACCG CGCCGTGATC CTGCTGCGCG CGCTGCCGGA CCCGATCAAC CTCGCCAAGA CCCTCAGGGC CCTGGCCTGG GTGACCTTCG GCGAGTCGGA GGAGACCGCG CACGACATCG AGCGCGAGGA CGTCCTGGAC CAGGTGCTGC TCCTGTTCGC GGAGGCGGCG CAGGTCCTGG AGACCGCCGA GGCCTCCGGC GCCTATGAGC GCGACGCCCA GGTCATCGCC TACGAACTGG CCGAGACCGA CGACCAGCTG GCCCGCCTGC ACCTGAACGC GGACCTGTCC GACAAGGCGA TGCCGTACGC CGAGCGCGCG GCCGCCGGGT TCCGCGCGCT GCTCCCGCAC AGCGCGATGG ACTACGACTT CTCCGAGCAG ATGGTCGCCT GGCTGCTGGA CCGGTACGGC AGCCGCGACG CGGCGGTGGA GCGGCTGCGC GAGGCGATCG CCGCGTGCAC CGAGGCCGGG GTCGAGGCGG TACGGTGCGT GGCGTTCCTG GAGCAGCTCG GGGACTGA
|
Protein sequence | MSSTHMPSTS EPAFHQMTWT PEDLRAALEA NENEPPGRAR SVRAETLLAA ADKLGDPETQ ICALHTVIEA YERGGESFRS PVLFSRLLRL WDRHGKTLRD ASRLEYETHW VFKWMTSDLL SVPEVPLATV TGFVDEMERR YRLAGYGMRA VHAQRFRIAE HLGDTAQAEV HFGRWLSADR DLMSDCRACE HLTQGVWRAE NGDDLAAMRL WRPTVENEIS CLDEPASTLA ASLKPLLRLR RYDEARSNHL RGYRLLRGHV ELRTAFGRHI EFCVLSGNGE RALEILAENR RLFDPPYEPL DYLEFLSCVA LLLRSRVDAG STAIVAGPEG RDWPVAELLA RVRSEIDDLS GRFDRRNGTK VVSARVAATM DQTWLVAELP LIVSQRPRPQ QFQKPEQAQQ PGQAHEPDQA PLAAVPAQRE PVAEPPDPDA DTDFYDLLAE ARRLFNVAHP SAMKMWERVA IAADRRGIVL DLEAQAQLAE ERAAEALDRE DMDRAVTLLG EAVERYKAGG LEGRAVAVQA RRLLAEALQK KTYEPIPEQA LAALYATARV LQARGLAEPE DILTVRRAQA FEARRTTEVG ADTDTGTGTD RDRESAFDHF AASVEALLAD AIEFDVPARA AAAHTMRAEI AQRRGRPEES VPELLAAIAL YDRAGRPWAN LHPNMLLAQA YLASDRDADA ERAGLAALDI AERWPEQRFP AGYTRQVLAN ATGGQGRYTD SAEHALHAVG WADRHGVPDL AASARHSLAF AYEQLGRDAD AAAILESALP EMIRHLDDPT VVNARWALAR CLGRLEDYRG AAEQYLLAAS IAEHFPQQGG HAMLAASAGH ALRAAGLADE ARRAFDRAVI LLRALPDPIN LAKTLRALAW VTFGESEETA HDIEREDVLD QVLLLFAEAA QVLETAEASG AYERDAQVIA YELAETDDQL ARLHLNADLS DKAMPYAERA AAGFRALLPH SAMDYDFSEQ MVAWLLDRYG SRDAAVERLR EAIAACTEAG VEAVRCVAFL EQLGD
|
| |