Gene Caci_0268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0268 
Symbol 
ID8331595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp300047 
End bp303094 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content72% 
IMG OID644953435 
Producthypothetical protein 
Protein accessionYP_003111062 
Protein GI256389498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.564116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGCA CGCACATGCC CAGCACCAGC GAGCCCGCCT TCCACCAGAT GACCTGGACC 
CCCGAGGACC TGCGCGCGGC CCTGGAGGCG AACGAGAACG AGCCGCCGGG CCGAGCCCGA
TCGGTCCGCG CCGAGACGCT GTTGGCCGCC GCCGACAAGC TCGGCGACCC CGAGACCCAG
ATCTGCGCCC TGCACACCGT CATCGAAGCC TACGAGCGCG GCGGGGAGAG CTTCCGCTCG
CCGGTGCTCT TCTCCCGCCT CCTGCGCCTG TGGGACCGCC ACGGCAAGAC CCTGCGCGAC
GCGAGCCGTC TGGAATACGA GACGCACTGG GTCTTCAAGT GGATGACCTC CGACCTGCTC
TCGGTCCCCG AGGTCCCGCT GGCGACGGTC ACCGGCTTCG TCGATGAGAT GGAGCGCCGC
TACCGCCTGG CTGGATACGG CATGCGGGCC GTCCACGCGC AGCGCTTCCG CATAGCGGAG
CACCTGGGCG ACACCGCGCA GGCCGAGGTG CACTTCGGAC GCTGGCTGTC GGCCGATCGC
GACCTGATGA GCGACTGCCG CGCCTGCGAG CACCTGACGC AGGGCGTGTG GCGGGCGGAG
AACGGCGACG ACCTCGCCGC GATGCGGCTG TGGCGGCCGA CCGTCGAGAA CGAGATCTCG
TGTCTGGACG AGCCCGCCAG CACTCTTGCC GCGTCCCTGA AGCCACTGCT GCGTCTGCGC
CGCTACGACG AAGCACGCTC GAATCACCTG CGTGGCTACC GCTTGCTGCG CGGACACGTC
GAGCTGCGCA CCGCCTTCGG ACGCCACATC GAGTTCTGCG TGCTGTCTGG AAACGGCGAA
CGCGCCCTGG AGATCCTCGC CGAGAACCGC CGCCTGTTCG ACCCGCCCTA CGAGCCGCTG
GACTACCTCG AGTTCCTGTC CTGCGTCGCG TTGCTGCTGC GCAGCCGGGT CGACGCCGGA
TCGACCGCGA TCGTCGCCGG ACCCGAGGGA CGGGACTGGC CGGTCGCGGA GTTGCTGGCG
CGCGTGCGAT CGGAGATCGA CGACCTCTCC GGACGCTTCG ACCGCCGCAA CGGCACCAAG
GTCGTGAGCG CGCGCGTCGC CGCCACGATG GACCAGACGT GGCTGGTCGC GGAGCTGCCG
CTGATCGTCT CGCAGCGACC GCGACCCCAG CAGTTCCAGA AGCCGGAGCA GGCGCAGCAG
CCCGGCCAGG CGCACGAGCC GGATCAGGCG CCGCTGGCTG CCGTGCCCGC CCAGCGCGAA
CCCGTCGCCG AGCCGCCCGA CCCGGACGCG GACACCGACT TCTACGACCT TCTCGCCGAG
GCGCGCCGGC TGTTCAACGT CGCGCACCCC AGCGCGATGA AGATGTGGGA ACGCGTCGCG
ATCGCGGCCG ATCGCCGAGG CATCGTGCTG GACCTTGAGG CGCAGGCCCA ACTCGCCGAG
GAACGCGCCG CCGAAGCGCT CGACCGCGAG GACATGGACC GGGCCGTCAC GCTGCTCGGC
GAGGCTGTCG AGCGCTACAA GGCAGGTGGG CTGGAAGGGC GCGCGGTCGC GGTGCAGGCG
CGCAGGCTGC TGGCCGAGGC GTTGCAGAAG AAGACCTACG AGCCCATTCC GGAGCAGGCA
CTCGCCGCGC TCTACGCGAC AGCTCGAGTC CTGCAAGCAC GCGGACTGGC CGAGCCGGAG
GACATCCTCA CCGTCCGCCG CGCGCAGGCC TTCGAAGCGC GCAGGACGAC CGAGGTCGGC
GCCGACACCG ACACCGGCAC CGGCACCGAC CGCGACCGCG AATCAGCCTT CGACCACTTC
GCAGCAAGCG TCGAAGCGCT CCTCGCCGAT GCCATCGAGT TCGACGTCCC GGCCCGCGCC
GCCGCCGCGC ACACCATGCG CGCCGAGATC GCCCAGCGGC GCGGTCGGCC GGAGGAATCG
GTCCCGGAGC TGCTGGCGGC CATAGCGCTG TACGACCGCG CCGGGCGTCC GTGGGCGAAT
CTGCACCCGA ACATGCTCCT CGCGCAGGCG TACCTCGCCT CCGACCGCGA CGCAGACGCC
GAGCGCGCCG GGCTGGCCGC CCTGGACATC GCCGAACGCT GGCCCGAGCA GCGGTTCCCG
GCCGGATACA CCCGCCAGGT GCTGGCCAAC GCGACCGGCG GGCAGGGCCG CTACACCGAC
AGCGCCGAGC ACGCGCTGCA TGCCGTCGGC TGGGCCGACC GGCACGGCGT CCCGGACCTG
GCCGCGAGCG CGCGCCACAG CCTGGCGTTC GCCTACGAGC AGCTCGGCCG CGACGCCGAC
GCCGCCGCGA TCCTGGAATC CGCGCTGCCG GAGATGATCC GGCACTTGGA CGACCCGACG
GTCGTCAACG CCCGCTGGGC CCTGGCCCGC TGCCTCGGCC GTCTGGAGGA CTACCGGGGC
GCCGCCGAGC AGTACCTGCT GGCCGCGTCG ATCGCCGAGC ACTTCCCGCA GCAGGGCGGC
CACGCGATGC TCGCGGCGTC CGCCGGACAC GCGCTGCGCG CCGCCGGGCT CGCCGACGAG
GCGCGCCGGG CGTTCGACCG CGCCGTGATC CTGCTGCGCG CGCTGCCGGA CCCGATCAAC
CTCGCCAAGA CCCTCAGGGC CCTGGCCTGG GTGACCTTCG GCGAGTCGGA GGAGACCGCG
CACGACATCG AGCGCGAGGA CGTCCTGGAC CAGGTGCTGC TCCTGTTCGC GGAGGCGGCG
CAGGTCCTGG AGACCGCCGA GGCCTCCGGC GCCTATGAGC GCGACGCCCA GGTCATCGCC
TACGAACTGG CCGAGACCGA CGACCAGCTG GCCCGCCTGC ACCTGAACGC GGACCTGTCC
GACAAGGCGA TGCCGTACGC CGAGCGCGCG GCCGCCGGGT TCCGCGCGCT GCTCCCGCAC
AGCGCGATGG ACTACGACTT CTCCGAGCAG ATGGTCGCCT GGCTGCTGGA CCGGTACGGC
AGCCGCGACG CGGCGGTGGA GCGGCTGCGC GAGGCGATCG CCGCGTGCAC CGAGGCCGGG
GTCGAGGCGG TACGGTGCGT GGCGTTCCTG GAGCAGCTCG GGGACTGA
 
Protein sequence
MSSTHMPSTS EPAFHQMTWT PEDLRAALEA NENEPPGRAR SVRAETLLAA ADKLGDPETQ 
ICALHTVIEA YERGGESFRS PVLFSRLLRL WDRHGKTLRD ASRLEYETHW VFKWMTSDLL
SVPEVPLATV TGFVDEMERR YRLAGYGMRA VHAQRFRIAE HLGDTAQAEV HFGRWLSADR
DLMSDCRACE HLTQGVWRAE NGDDLAAMRL WRPTVENEIS CLDEPASTLA ASLKPLLRLR
RYDEARSNHL RGYRLLRGHV ELRTAFGRHI EFCVLSGNGE RALEILAENR RLFDPPYEPL
DYLEFLSCVA LLLRSRVDAG STAIVAGPEG RDWPVAELLA RVRSEIDDLS GRFDRRNGTK
VVSARVAATM DQTWLVAELP LIVSQRPRPQ QFQKPEQAQQ PGQAHEPDQA PLAAVPAQRE
PVAEPPDPDA DTDFYDLLAE ARRLFNVAHP SAMKMWERVA IAADRRGIVL DLEAQAQLAE
ERAAEALDRE DMDRAVTLLG EAVERYKAGG LEGRAVAVQA RRLLAEALQK KTYEPIPEQA
LAALYATARV LQARGLAEPE DILTVRRAQA FEARRTTEVG ADTDTGTGTD RDRESAFDHF
AASVEALLAD AIEFDVPARA AAAHTMRAEI AQRRGRPEES VPELLAAIAL YDRAGRPWAN
LHPNMLLAQA YLASDRDADA ERAGLAALDI AERWPEQRFP AGYTRQVLAN ATGGQGRYTD
SAEHALHAVG WADRHGVPDL AASARHSLAF AYEQLGRDAD AAAILESALP EMIRHLDDPT
VVNARWALAR CLGRLEDYRG AAEQYLLAAS IAEHFPQQGG HAMLAASAGH ALRAAGLADE
ARRAFDRAVI LLRALPDPIN LAKTLRALAW VTFGESEETA HDIEREDVLD QVLLLFAEAA
QVLETAEASG AYERDAQVIA YELAETDDQL ARLHLNADLS DKAMPYAERA AAGFRALLPH
SAMDYDFSEQ MVAWLLDRYG SRDAAVERLR EAIAACTEAG VEAVRCVAFL EQLGD