Gene Caci_4225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4225 
Symbol 
ID8335579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4791619 
End bp4793274 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content69% 
IMG OID644957328 
ProductCarbohydrate-binding CenC domain protein 
Protein accessionYP_003114930 
Protein GI256393366 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3469] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0637754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGGA ATCTCAGTTC TCGCCGACGA CAACTTCTCG CCTCCGCCGG GGCGGCGGTC 
GGCGCGCTCG CGCTCGCGGG AGCAGCCGCG GGCCTCGCGA CGAACGCGAC GGCCGCCACG
GGCGCGCCCG CCGCGGCGCC GGCGGCGTCC GGCAACCTGC TGACCAACCC CGGGTTCGAA
ACCGGGGACT TGTCCGGTTG GACGTGTGAC GCCGGTACAG GAGCCGTCGT CGCCTCGCCC
GTGCGCTCCG GAACCCATTC GCTGGCCGGC ACGCCGTCGG GGTCGGCCGA CGCGCAGTGC
ACGCAGACCG TCTCGGTGCA GCCGAACACG GCGTACACGT TCTCAGGGTA CGTCGAAGGG
TCCTACGTCT ACATCGGCGT GACCGGCGGC ACCTCGACGT GGACGCCGTC GGCGACCAGC
TGGCAGCAGC TGTCGGTGGC GTTCACGACC ACCGCGTCGC AGACCTCTGT CCAGGTCTAC
GTCCACGGCT GGTACGGGCA GCCGGTGTAC CACGCCGACG ACCTGGCGCT GACCGGCCCG
GCCGGACCGC CGCCGACGAC TCCGCCGACG ACTCCGACCA CACCGCCCAC GACGCCGTCC
ACGACGCCGA CGACCGTGCC GACGTCGAGC ACGCCCACGA CGCCGACCAG CTCCACGCCG
AGCACCCCGC CGAGCAGCTC CTCCTCCGCG CCGGGCGGCA CGACCTGCCC GGTCAAGTCG
CGCCCGGCGG GCAAGGTGAT CCAGGGCTAC TGGGAGAACT GGGACGGCGC GCTCAACGGC
GTGCACCCCG GGCTCGGCTG GATCCCGATC AACGACCCGC GCATCCAGCA GCACGGATAC
AACGTGATCA ACGCCGCGTT CCCGGTCATC CTGTCCGACG GCACCGCCGA ATGGCAGGAC
GGCATGGACA CCAACGTCAA GGTCGACACC CCGGCCGACT ACTGCGCGGC GAAGGCCTCC
GGCGCGACGA TCCTGATGTC GATCGGCGGC GCGGCCGCGG GCATCGACCT CAACTCCAGC
ACGGTCGCCG ACAAGTTCGT CGCCACCATC GTGCCGATCC TGAAGGCGTA CAACTTCGAC
GGCATCGACA TCGACATCGA GACCGGCCTG ACCGGCAGTG GCAGCATCAA CACCCTGTCG
ACCTCGCAGG CCAACCTCGA GCGCATCATC GACGGCATCC TCGCCCAGAT GCCCTCGAAC
TTCGGCCTGA CCATGGCGCC GGAGACCGCC TACGTGACCG GCGGCAGCGT CACGTACGGC
TCGATCTGGG GCTCGTACCT CCCGATCATC AAGAAGTACA TGGACAACGG CCGCCTGTGG
TGGCTGAACA TGCAGTACTA CAACGGCAGC ATGTACGGCT GCTCCGGCGA CTCCTACCAG
GCCGCGACAG TCCAGGGCTT CCAGGTCCAG ACCAACTGCC TGAACAGCGG CCTGACCATC
CAGGGCACCA CCATCAAGGT CCCCTACGAC CACCAGGTCC CAGGCCTGCC CGCCCAACCC
GGCGCCGGCG GCGGCTACAT GACCCCGTCC CTGGTCTCCC AGGCATGGAG CAGCGTCGGC
GGCCAAGTCA AGGGCCTGAT GACCTGGTCG GTCAACTGGG ACGGCTCCCT GGGCTGGACC
TTCGGGAACA ACGTGAAGGG GTTGGAGGGA CGCTGA
 
Protein sequence
MLRNLSSRRR QLLASAGAAV GALALAGAAA GLATNATAAT GAPAAAPAAS GNLLTNPGFE 
TGDLSGWTCD AGTGAVVASP VRSGTHSLAG TPSGSADAQC TQTVSVQPNT AYTFSGYVEG
SYVYIGVTGG TSTWTPSATS WQQLSVAFTT TASQTSVQVY VHGWYGQPVY HADDLALTGP
AGPPPTTPPT TPTTPPTTPS TTPTTVPTSS TPTTPTSSTP STPPSSSSSA PGGTTCPVKS
RPAGKVIQGY WENWDGALNG VHPGLGWIPI NDPRIQQHGY NVINAAFPVI LSDGTAEWQD
GMDTNVKVDT PADYCAAKAS GATILMSIGG AAAGIDLNSS TVADKFVATI VPILKAYNFD
GIDIDIETGL TGSGSINTLS TSQANLERII DGILAQMPSN FGLTMAPETA YVTGGSVTYG
SIWGSYLPII KKYMDNGRLW WLNMQYYNGS MYGCSGDSYQ AATVQGFQVQ TNCLNSGLTI
QGTTIKVPYD HQVPGLPAQP GAGGGYMTPS LVSQAWSSVG GQVKGLMTWS VNWDGSLGWT
FGNNVKGLEG R