Gene Caci_4957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4957 
Symbol 
ID8336311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5659810 
End bp5661876 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content65% 
IMG OID644958056 
Productcellulose-binding family II 
Protein accessionYP_003115658 
Protein GI256394094 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0505986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGC ATGTACTGAC AAGACGCACT GATCGCCGCA AGATCGTGTC GCTCGCCGCC 
AGCCTGAGCG TCGTGGCCGC GTGCACGCTG GCGGTTGTGG TTCCGTCCGG TGCTGCGTAT
GCGGCTGATA CCGCGACGAT CAACGGAGCG ACGACGTATC AGACGATCGC TGGGTTCGGT
GCCTCGGAGG CGTTTGGTGA GGCGGCGGCG GTGATGAATG CCTCCTCCTC GGTGCAGCAG
CAGGCGCTGG CCGATCTGTA CAGTCCGACG ACCGGTGCGG GTTTGACCAT TCTGCGTAAC
GAGATTGGTG CGACCTCCGG CAACACGATC GAGCCGACCA ATCCTGGCGG CCCTGGTGCG
ACGCCGAACT ATCTTCCGCT GTCGCAGATC AACCAGGACA TGGGCCAGTT GTGGTTCGCC
CAGCAGATCA AGGCCCGCTA CAACGTCACC AACGTCTACG CCGATGCCTG GAGCGCCCCA
GGTTTCATGA AGACCAACAA TTCGGTGTCC GGCGGTGGTC AGGTTTGTGG GTCGGCCGGG
GCTTCGTGCT CCAGTGGTGA TTGGCGTCAG GCGTATTCGA ACTATCTGGT GCAGTACGCC
AGGGACTACG CGGCGGCGGG TGTGCCGCTG ACCTATCTGG GTCCGTCGAA CGAGCCTGAC
TACAGCACCA ACTACGACAG CATGTCGATG AGTCCGGCGC AGATGGCCAG TGTGGTCGAT
GTGCTCGGGC CGACTTTGCG GAGCTCGGGG CTGGCTACCC AGGTCACTTG TTGTGCGGCG
ACTGGTTGGC CGAAGGCTGG GCAGTATGCT GCGGCGATTG AGGCGGATCC GACGGCGTTG
GCTGCGGTGG GCATGGTGGG CGGGCACGGC TACAGTGGGG CGCCGACTTC GCCGTTGCCT
GGTTGGACCA AGCAGTCGTG GGAGACTGAG TGGTCGACTT TTGAGGGCTT CAGCTCTGCC
TGGGATGACG GCTCCGATGC GTCCGGGATG GCGTGGGCTC AGCACATCAA TCAGGGGTTG
ACTGGTGCGA ACCTGAACGC GTTCCTGTAT TGGTGGGGTA GCACCACCCC GTCGGAGAAC
GGTGACAATG AGGGGCTGTT GGAGATCAAC GGCAGCTCGG TGATTCCGAC TGGTCGGTTG
TGGGCGTTCG CGAACTACAG CCGCTACATC CATCCTGGTG CTGTGCGTAT CGGGGCGAGC
AGCTCCAATG GTGCGGTGAA CCTGAGCGCG TACAAGAACA CCGACGGCTC CTTGGCTATC
GTGGCGCTCA ATACTGGTAG CGGTTCGGAC GCGCTCACCT ATTCGCTGGC GAATACCGGC
GTGGCCAACG GCGCCACCGT GACCCCGTAC CTGACCAACA ATGCGAACCA GGTCGCAGCC
CAGGGCACGA CGACGGTCGC CGGCGGCGCC TTCACCGCGA CTGTGCCGGG CCGCTCCCTG
GTGACGTATG TGATCCCAGC CGGCGTGGTG AGCGGCAACA CCGTCACCGT GACCAACCCG
GGCTCCCAGA CCGGGAAGGT CGGTACGGCG ATCAGCGGCC TGCAGATCCA GGGTACTGAC
TCGGGCTCCG GCCAGACGCT GACGTACTCG GCGTCGGGTC TGCCGGCCGG GCTGTCGATC
AGCAGCAGCG GCCTGATCAC CGGTACCCCG ACCACGGCGG GCAGCTCCAC CGTGGCGGTG
ACTGCCACCG ACAGTACCGG CGCGTCCGGC TCGGCTGGCT TCACCTGGAC CGTCACCGGC
GGCACGACGA CAGGCACCTG CCATGTCGCC TACACCAGAA CCAATGAATG GCCAGGTGGC
TTCACCGCCA ACGTCACCAT CACCAACACC GGTACGGCGG CGATCAACGG CTGGACCGTC
GGCTGGAGCT TCCCCGGCGA CCAGAAGATC ACCAACGCCT GGAGCGCCAC CGCCACCCAG
AGCGGCGCCG CGGTCAGTGC GACCAATGCC GCCTACAACA GCACCATCGC CCCCGGAGCC
AACACCTCGT TCGGCTTCCA GGGCACGTTC ACCGCGAACG ACACCTCACC CTCGAGCTTC
ACCGTCAACG GAGCCGCGTG CTCGTAG
 
Protein sequence
MSVHVLTRRT DRRKIVSLAA SLSVVAACTL AVVVPSGAAY AADTATINGA TTYQTIAGFG 
ASEAFGEAAA VMNASSSVQQ QALADLYSPT TGAGLTILRN EIGATSGNTI EPTNPGGPGA
TPNYLPLSQI NQDMGQLWFA QQIKARYNVT NVYADAWSAP GFMKTNNSVS GGGQVCGSAG
ASCSSGDWRQ AYSNYLVQYA RDYAAAGVPL TYLGPSNEPD YSTNYDSMSM SPAQMASVVD
VLGPTLRSSG LATQVTCCAA TGWPKAGQYA AAIEADPTAL AAVGMVGGHG YSGAPTSPLP
GWTKQSWETE WSTFEGFSSA WDDGSDASGM AWAQHINQGL TGANLNAFLY WWGSTTPSEN
GDNEGLLEIN GSSVIPTGRL WAFANYSRYI HPGAVRIGAS SSNGAVNLSA YKNTDGSLAI
VALNTGSGSD ALTYSLANTG VANGATVTPY LTNNANQVAA QGTTTVAGGA FTATVPGRSL
VTYVIPAGVV SGNTVTVTNP GSQTGKVGTA ISGLQIQGTD SGSGQTLTYS ASGLPAGLSI
SSSGLITGTP TTAGSSTVAV TATDSTGASG SAGFTWTVTG GTTTGTCHVA YTRTNEWPGG
FTANVTITNT GTAAINGWTV GWSFPGDQKI TNAWSATATQ SGAAVSATNA AYNSTIAPGA
NTSFGFQGTF TANDTSPSSF TVNGAACS