Gene Caci_4214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4214 
Symbol 
ID8335568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4776901 
End bp4778727 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content68% 
IMG OID644957317 
Productglycoside hydrolase family 5 
Protein accessionYP_003114919 
Protein GI256393355 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.303504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGG CGTTGATCGC GGCGTCGGTC ATGACCTTCG CCGGGGTGGC CGCGCAGCCG 
GCCACCGCGC GCACCGATGC CGCCGCCGCC CACACCACCG CCGCCACCGC CGGCACCACC
GGCACCGCCC CGGCCGCGAC AGCGGCGGCA GGCGCCGGAT ACTGGCACAC CAGCGGCCGG
GAGATCCTGG ACTCGGCCAA CCAGCCGGTC CGCATCGCCG GCGTCAACTG GTTCGGCTTC
GAGACCAGCA ACGACGTCGT CCACGGCCTG TGGAGCCGGG ACTACAAGTC GATGATGGAC
CAGATGAAAT CGCTGGGCTA CAACACGATC CGGCTGCCCT ACAGCGACGA CATCTTCAAG
CCCGGCACGA TGCCGAACAG CATCAACTTC TACCAGATGA ACACCGACCT GCAGGGTCTG
ACCTCGCTGC AGGTGATGGA CAAGATCGTG GACTACGCCG GCTCGATCGG CCTGAAGGTG
ATTTTGGACC GCCACCGTCC GGACGCCGGC GGGCAGTCGG CGCTGTGGTA CACCTCCACA
GTCCCGGAAT CGACGTGGAT CAACGACCTG AAAGCCATCG CCACGCGCTA CCAGGGCAAC
CCGGCGGTGG TCGGGATCGA CCTGCACAAC GAGCCGCACG ACCCGGCGTG CTGGGGCTGC
GGGGACACCA CGATCGACTG GCGGCTGGCC GCCGAGCGCG GCGGCAACGC GGTGCTGTCC
GTCAACCCCA GCCTGTTGAT CTTCGTCGAG GGTGTTCAGA CCTTCAACGG CAGCTCCTAC
TGGTGGGGCG GCAACCTACA GGGCGCCGGG CAGTATCCGG TGCAGCTGTC GGTGGCCAAC
CGGGTCGTCT ACTCGGCCCA CGACTACGCC ACCAGCGTGG CCAGCCAGCC GTGGTTCACC
GACCCGAGCT TCCCCTCCAA CATGGCCGGG ATCTGGGACA AGAACTGGGG CTACCTGTTC
AACCAGAACA TCGCCCCCGT GTGGGTCGGC GAGTTCGGCA CGACGCTGTC CGCCACGACC
GACCAGGTCT GGCTCAAGAC GCTGGTCTCC TACCTGCGTC CGACCAGCAC CTACGGCGGC
GACTCCTACC AGTGGACGTT CTGGTCCTGG AACCCGGACT CCGGTGACAC CGGCGGCATC
CTGAAGGACG ACTGGAACTC GGTGGACACC GTCAAGGACG GCTATTTGAC CTCGATCAAG
GCACCGGCCT TCGGCGGCGG GGGCGGCGGA GGCGGAGGCG ACAGCTCGCC GCCGACCGCG
CCGACCAACG TGGCGGCGAC CGTCACCACC TCCAGCTCGG TAGCGCTGTC CTGGACCGCC
TCGACCGACA ACGTCGGCGT CACCGGCTAT GACATCTACC GGGGCAGCAC CCTCGCCGGA
ACGTCGGCGG GCACGACGTT CACCGACAGT GGACTGACAC CCTCGACCGC GTACACCTAC
ACGGTCAAGG CGTACGACGC AGCCGGCAAC CTGTCGGCGG CCTCGGCCGC GGTGTCCGCG
ACGACGCAGG GCGGCGGCAG CAACGCCGGC TGCACCGCGG CGTACACGGT GTCGAACCAG
TGGAACAACG GCTTCACCGC CGCGGTCACG ATCACCAACT CCGGAACCGC GGCGACCCAC
AGCTGGAAGG TCACCTGGAC GTGGCCGGGC GGCCAGCAGG TCAGCAACGC CTGGAACGCC
ACCGAAGCCC ACAGCGGGCA GAGCGAGACG TTCACCGCGG TGAGCTCCGA CGCGGTGGTC
GCGCCGGGAG GCACGACGTC GTTCGGATTC CAGGCGAGCT ACAACGGGAC CAACCCGGCA
CCGACTCCAG TGTGCACAGC GAGCTGA
 
Protein sequence
MAAALIAASV MTFAGVAAQP ATARTDAAAA HTTAATAGTT GTAPAATAAA GAGYWHTSGR 
EILDSANQPV RIAGVNWFGF ETSNDVVHGL WSRDYKSMMD QMKSLGYNTI RLPYSDDIFK
PGTMPNSINF YQMNTDLQGL TSLQVMDKIV DYAGSIGLKV ILDRHRPDAG GQSALWYTST
VPESTWINDL KAIATRYQGN PAVVGIDLHN EPHDPACWGC GDTTIDWRLA AERGGNAVLS
VNPSLLIFVE GVQTFNGSSY WWGGNLQGAG QYPVQLSVAN RVVYSAHDYA TSVASQPWFT
DPSFPSNMAG IWDKNWGYLF NQNIAPVWVG EFGTTLSATT DQVWLKTLVS YLRPTSTYGG
DSYQWTFWSW NPDSGDTGGI LKDDWNSVDT VKDGYLTSIK APAFGGGGGG GGGDSSPPTA
PTNVAATVTT SSSVALSWTA STDNVGVTGY DIYRGSTLAG TSAGTTFTDS GLTPSTAYTY
TVKAYDAAGN LSAASAAVSA TTQGGGSNAG CTAAYTVSNQ WNNGFTAAVT ITNSGTAATH
SWKVTWTWPG GQQVSNAWNA TEAHSGQSET FTAVSSDAVV APGGTTSFGF QASYNGTNPA
PTPVCTAS