Gene Caci_4972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4972 
Symbol 
ID8336326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5685340 
End bp5687238 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content68% 
IMG OID644958071 
ProductXylan 1,4-beta-xylosidase 
Protein accessionYP_003115673 
Protein GI256394109 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.122581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.3032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTTA CAGCCATGCT CAGACGCAGG ACGCGGCTGG TCGCTGTGTT CCTGCTGGTG 
TTCGCGGCCG CGCTGATGAT GCCTGGCCTC GGGTTGTCGG GGACGGCCAA GGCGGCGGCG
GCCTCGTCCC TGCCGTGCGA CCTCTACTCG GCCGGCGGCA CGCCCTGCGA GGCGGCGTAC
AGCACGACCC GCGCGTTGTC AGTCTCCTAC GCCGGGCCGC TGTACCAGGT CCAGCGTGCC
TCGGACGGCA GCCGCCTCGA CGTCCGTGTG CGGTCGGCCG GCGGGGTCGT CGACGTCGCG
CCCGAAAACA GCTTCTGCGG CGGCACGACC TGCACGATCA CCGAGCTGTA CGACCAGACC
GCCAACGCCA ACCACATGCC GATCTCGCCT GGGACGTCGT GCTCCGGGTG CTCGCACGGG
ATCGCCGGTC CCGGGCCCAA CGGAGCGGAC ATCGGCGCGT CAGCCTTGGC GCTGCCGGTG
ACGGTCGGAG GCCAGCCGGC GTACGGAGCC CTGTTCAACG CCCAGGGGAT CGGCTATCGC
ATCACCAACG CCAAGAACGT GCCGACCGGC TCGCAGCCCG AGGGCGTCTA CATGCTCACC
TCGTCGAACC TGACCAGCAA CGGCTGCTGC TTCGACTTCG GTGCGGGGGA GAGCAACGAC
ACCGACGACG GCAACGCCAC CATGAACGCG ATCTACTACG GCACCGACTG CTGGACCCAG
AACTGCACCG GCCCGGGACC GTGGGTGGGC GGCGATCTGG AGAACGGCAT GTACTTCAGC
AACACCGGCG CCAACCCGAC GAGCATCCCC AGCGAGAAGG GATCGTTCCT GACCGCCTGG
GAGAAGAACA ACGGCACGAC CAACTTCACG CTGAAGTACG GCAACGGCCA GCAAGGCGGC
CTGACCCAGT CCTATTCAGG CGCCTTGCCC AACGGCTACA ACCCGATGAA GGTGCAGCCC
TCGATCGAGT TGGGCACCGG CGGGGACAAC AGCATCTGGG GCGACGGGGA GTTCTTCGAG
GGCGCCGTCC TGGCCGGCTT CCCCTCCGAC GCCACCGAGA ACGCGGTGCA GGCCGGGGTT
GTCGCGGCCG GGTTCGCCAA CAACACCACC TACGTTCCGA GCACCGCCTC GCTGGTGAGC
CTGAAGGCGC ACGCCAACGG CCAGTACGTG GACGCGGCGG GCAGCGGTTC GGCGCTCATC
GCGAACGCGG CCTCGACCGG GAAGGCCGAA ACGTTCGACC TGATCACCAA CCCGGACGGA
ACCGCGAGCC TGCGGGCGCA CTCCGACGGC GAGTACGTCA CCGCCGGTAC CTCGCCGCTC
ATCGCCGACC GCACCACCAT CGGCTCCGCC GAGACCTACG ACCTGATCAC CAATGCGGAC
GGCAGCGTCA GCTTTCGGGC GCATGCCAAC GGTGATTACG TCACTGCGGA GAACGCCGGC
GCCTCGGCTC TGATCGCGAA CCGCACCGCC ATCGGTCCGT GGGAGGAGTT CGACGTCGTG
CGCGACACCG CCGCGGTCAG CTTCCGGGCG CACGCGAACA ACGACTATGT GACCGCGGAG
AACGGCGGCG CCGCCTCGCT CATCGCCAAC CGCACCGCCG TCGGCCCGTG GGAGACCTTC
GACCTGATCA GCAACTCCGA CGGCAGCGTC AGCCTGCGGG CCCACGCGAA CAACGACATC
GTCACGGCGG GCACGGGCAG CACCGCCCTG ATCGCCAGCC GCACCTCCAT CGGTACCGGC
GAGGAGTTCG ACCTCGTCCA GAACGCTGAT GGCAGCGTCG GATTCCGAGC GCACGCGAAC
TACCAGTACG TGACCGCCGA CAACGCCGGC GCCTCGCCGC TGATCCCCAA CCGCAACGTG
ATCGGCCAGT GGGAAGAGTT CGACCTCATC TACGACTGA
 
Protein sequence
MPVTAMLRRR TRLVAVFLLV FAAALMMPGL GLSGTAKAAA ASSLPCDLYS AGGTPCEAAY 
STTRALSVSY AGPLYQVQRA SDGSRLDVRV RSAGGVVDVA PENSFCGGTT CTITELYDQT
ANANHMPISP GTSCSGCSHG IAGPGPNGAD IGASALALPV TVGGQPAYGA LFNAQGIGYR
ITNAKNVPTG SQPEGVYMLT SSNLTSNGCC FDFGAGESND TDDGNATMNA IYYGTDCWTQ
NCTGPGPWVG GDLENGMYFS NTGANPTSIP SEKGSFLTAW EKNNGTTNFT LKYGNGQQGG
LTQSYSGALP NGYNPMKVQP SIELGTGGDN SIWGDGEFFE GAVLAGFPSD ATENAVQAGV
VAAGFANNTT YVPSTASLVS LKAHANGQYV DAAGSGSALI ANAASTGKAE TFDLITNPDG
TASLRAHSDG EYVTAGTSPL IADRTTIGSA ETYDLITNAD GSVSFRAHAN GDYVTAENAG
ASALIANRTA IGPWEEFDVV RDTAAVSFRA HANNDYVTAE NGGAASLIAN RTAVGPWETF
DLISNSDGSV SLRAHANNDI VTAGTGSTAL IASRTSIGTG EEFDLVQNAD GSVGFRAHAN
YQYVTADNAG ASPLIPNRNV IGQWEEFDLI YD