Gene Caci_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0861 
Symbol 
ID8332191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp999428 
End bp1001356 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content69% 
IMG OID644954011 
Productglycoside hydrolase family 5 
Protein accessionYP_003111635 
Protein GI256390071 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.025002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCTCA TCCGTGTGCG TGTCCTGATC GCGCTGCTGG CCCTGCTGAC CTGCGCGGTA 
CTGCTCCCGG GGTCCGCGCG CGCCGCCTCG CGACTCCCCT CAGCCGCTGC CGTTCCCGCC
GGCTCGCTCG CCGCGTCCTG GACCGGTCCG CTGAGCACCA GCGGCCGCTA CGTCGTCGAC
GCGAACGGCA ACCGCTTCAA GCTGATCGGC GGCAACTGGG ACGGCGCGCA GGGCCACTGG
CTCGGCAGCG GCTCGGCGAC CGATCCGGCG CAGAACCACG CCGGCGAGGT GTCCTACAAC
GTCCCGCTGG CCCTGGACCG CAAGCCGATC CCGCAGATCC TGGCGGACTT CCACAGCCTG
GGCATCAACA CCATCCGTCT GCCGTTCGCG GACGCGATGA TCCACGACAC CTCGACCGTC
CCGGACGCCG CCGTCACCGC CAACCCGCAG CTGCGCGGAC TGACCGCGCT CCAGGTCTAC
GACGCGGTCG TCAGCGCCCT GACGGGCGAC GGCTTCGCGG TGATCCTGAA CAACCACACC
ACGAGCTACC GCTGGTGTTG CGGCCTGGAC GGCAACGAGC GCTGGAACAG CGGACAGAGC
ACGCAGCAGT GGGAGTCCGA CTGGCTGTTC ATGGTGAACA GGTACAGGGC GAACAAGCGC
GTCGTCGGCG CGGACCTGCG CAACGAAGTC CGGCGCGACA CCTGGGACGA CCCGAACTGG
GGCTGGTACG ACGCCCACGA CGAATACGCC GCCTTCGAGG AGGCCGGCAA CCAGATCCTG
GCCGCGGACC CGGACATGCT GATCGTCATG GAGGGCATCA ACTGGTACGG CATCCCCGCC
GCCGGCTTCT CCCACGGCCG CCCGATGCTC ACCCCGGCCG CGAACCTCTC CGCCACCCTG
ATCGCCTCGA ACAAGCTGGT GTACTCGGCG CACTTCTACA GCTACACCGG CCCGAACAAC
TCCGGCGCCG CCGCGGGCTC GGCCGGCTCG ACCAGCGACC CGCGCTACGA GGACATGACC
CCGGACCAGC TGGCGTCGGC GGTGAACCAG GAGGCCCTGT TCGTCACCCA ATCCGGCCAG
CACTTCACCG CACCGGTCTG GGTCAGCGAA TTCGGCGCCG CCGGACGCGG CGAGACCGAC
ACCAAGGAAC AGACCTGGCT CGACACCTTC ACCACCATCC TGGCCGCCAA CGACACCGAC
TTCGCCATCT GGCCGCTGAT CGGCTACACC GCCACCAACG GAACCCTCCA GGACAACTGG
GCCCTCCTGT CCTACGACCC CGCCGGCAAC CGCACCAGCA TCACCGACCC CGGCGACTGG
CGCCTCCCCG ACTGGCAGAA GCTGACCTCC GCCCCCACCA CGACCGGCCA CATCCCCGCC
TCCCCCCACT GGAACATGCT CGACCTGGAC CACGCCGACT ACAACGTCTC CACCACGATG
CTGGCCCAAC CCGACTGGTC CCCCGGCAAC CGCAAGGGCA ACTGCCCCGA CACCGAACGC
CTGACAGGCC TAGGCCGCGG CAGCAGCCGA GGACTATGCA CAGACTCCTC AGAACCGACC
AAGAGCACAG CCACCCAGAC CGTCGTCACC AACGAGACCT ACGTAACCGA AGGCGACTGG
GCACCCGGCT ACACCAAGCT CCAATGCCCC GACAACACCT TCGCCACCGG CTACAGCGTC
CACAACAACG CCATGGCAGC CCTACTGTGC GCCCCCGCCG CAGCGCCTCT ACCCACCACC
AGCCACACCA TCTGGTTCAA CCAAGGCGAC AACCGCCCAA CCACCGGCGG CTCCACACCC
TCCGACTGGG CACCCGGCTC CTACAAGGGC CAGTGCCCCG ACAACGAGTA CTTAGCGGGC
ATCGCGTACA CCTGGCAGCG CGCTGAAGGC GGCGTTCCGG ATGCGCTGCT GTGTCGGGCC
CTGGTCTGA
 
Protein sequence
MSLIRVRVLI ALLALLTCAV LLPGSARAAS RLPSAAAVPA GSLAASWTGP LSTSGRYVVD 
ANGNRFKLIG GNWDGAQGHW LGSGSATDPA QNHAGEVSYN VPLALDRKPI PQILADFHSL
GINTIRLPFA DAMIHDTSTV PDAAVTANPQ LRGLTALQVY DAVVSALTGD GFAVILNNHT
TSYRWCCGLD GNERWNSGQS TQQWESDWLF MVNRYRANKR VVGADLRNEV RRDTWDDPNW
GWYDAHDEYA AFEEAGNQIL AADPDMLIVM EGINWYGIPA AGFSHGRPML TPAANLSATL
IASNKLVYSA HFYSYTGPNN SGAAAGSAGS TSDPRYEDMT PDQLASAVNQ EALFVTQSGQ
HFTAPVWVSE FGAAGRGETD TKEQTWLDTF TTILAANDTD FAIWPLIGYT ATNGTLQDNW
ALLSYDPAGN RTSITDPGDW RLPDWQKLTS APTTTGHIPA SPHWNMLDLD HADYNVSTTM
LAQPDWSPGN RKGNCPDTER LTGLGRGSSR GLCTDSSEPT KSTATQTVVT NETYVTEGDW
APGYTKLQCP DNTFATGYSV HNNAMAALLC APAAAPLPTT SHTIWFNQGD NRPTTGGSTP
SDWAPGSYKG QCPDNEYLAG IAYTWQRAEG GVPDALLCRA LV