Gene Caci_5444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5444 
Symbol 
ID8336798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6268952 
End bp6271438 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content72% 
IMG OID644958542 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003116144 
Protein GI256394580 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.615677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCTC GAACACCCTG GTCACGGCCG CTCAGGAGCC GTACACAGGC GCGCTCGACC 
CGGCGCGTGG CGCACGTCGC CATCGCCGCC GCGCTCACCT TGGCGGCCGT CGACGCCGCC
GCCGCGACCG GCGCCGCGGC CAGCACCGCG CAGCCGAGAT ACCTGGACCG GACCGCGCCG
ATCGCCGCGC GGGTCAACGA TCTGCTCGGC CGCATGACCC TGCCGGAGAA GGCCGGGCAG
ATGGATCAGC AGTTGGTCGA CAACGCGACC GCCGCCTCCG GCGGCGCCTG CGGGGCGGCC
GGCTTCAACC TTCCGACCCC CGCCTGTATG CAGTCGGCGC TGATCGACCA GAACGTCGGC
TCGATCCTGG CCGGCGGAAC CGATAACCCT TCTGACACCA CGGGCAGCGG CACCAGCGGC
AACACCGCTC AGGACTGGGC CAACGACTAC AACACCATCC AGCAGTACGC GATCGCCCAC
TCGCGGCTGC ACATCCCGCT CAGCTTCGGC GTGGACGCCG TGCACGGCTT CGGCCACCCC
TGGCAGGCGC CGCTGTTCCC GCAGTCGATC GGCATGGGCG CGACCTGGGA CCCCTCGCAG
GCGAAGGCCG GCGGCGCGAT GACCGCCACC GCGCTGCGTT CGACGGGCTG GACGTGGGCC
TTCGCCCCGG TCCAGGACCT GGCGCGCGAC AACCGCTGGG GACGCACCTA TGAGACGTGG
GCCGAGGAGC CCGCTCTGTC TTCGGCGATG GGAGCCGCAA ACGTTACCGG CCTGCAGACC
CCGGCTCCGG CCGGCGGCCT GGACGTCAGC GCGACCGTCA AGCACTTCGC CGGGTATTCG
GAGTCGGTCA ACGGCCACGA CCGCGACGAA GCGCTGCTGC CGCTGAACTA TCTGCAGAGC
ACGATCCTGC CGTCCTACGC CGGGGCGATC AACGCCGGCG CCGACGCGGT GATGGTCGAC
TCCGGGTCCA TCAACGGCGT CCCAGCCACC TCCTCGCACT ACCTGCTCAC CGACATCCTC
CGCGGCCAGA TGGGCTTCAA GGGCGTCGAG ATCAGCGACT ACCAGGACGT GCAGGCGCTG
CAGACGACCT ACCACATCGC CGCCAGCCTG CCGGACGCCG TCGCCCTGGC GGTGAACGCG
GGTCTGGACA TGAGCATGGA GGTCAACGGC CCGGACCAGT GGCAGAGCGC GATCATCCAG
GACGTCGGCA ACGGCAAGAT CAGGATGTCG ACGATCAACG ACGCCGTGCG CCGCATCCTG
ACCATGAAGT TCCAGCTCGG ACTGTTCGAC CAGCCCTGTG TGGCCGACCC GGGCAAGCCG
TGCCTGAACG CCGGGGCGGC CGACGCGGTC GTGACCTCCG GGCGTGACCA GACCCTGAAG
GCCACGCAGG AGTCGATCAC GCTGCTGCGC AACCAGAACA GCGTGCTGCC GCTGCCCGCC
GGCAGCCGCG TGGTGGTGAC CGGTCCCAGC GCGGACTCGA TGACCAACCA GCTCGGCGGC
TGGAGCGTGA GCTGGCAGGG CGTCGCCGGC GCCGGGCACG TGTGCTGCAT GGGCTCGCCG
GACCAGATCC CGCCGGGGAC CACGGTGCAG ACCGGGGTCC TGGGCGCCGA CACCCACGCC
ACGGCCATCT CCGACCAGGC GGCCGCCGTG GCTGCCGCCC CGAACACCGA CGCCTACGTC
GCGGTGGTGG GGGAGAAGGC GTACGCCGAA GGCCTCGGCG ACAATCCCGC CCCGGCGCTC
CCGGCGGACC AGCAGGCGCT GATCTCCGCG CTGGAGGCGA CCGGCAAGCC GGTGATCGTC
GTGGTCGAGG CGGGCCGTCC GGTCGCTCTG GGCTCGGCGG AGAAGGCCAG CGCGGTCGTG
ATGGCCTACC AGGGCAGCAC CGAGGCCGGG CAGGCCGTGG CCGACGTGCT GTTCGGCAAG
ACCGACCCCA GCGGCCACCT GTCGATCAGC TGGCCGTCCG ACGCGCCGGC GGTCGGCGGC
GACTTCAACA GCACCGCGCC GTCCCCGCTG GGCGACGAGC CGAAGTTCTT CGACCAGCTG
CCGGGTACCG GCTCCGGGCC GGGGAACGCC TACAACCCGC TCTATCCGTT CGGATACGGG
CTCTCCTACA CCACCTTCAG CCACTCGGCG CTGGCGGTGA CGCCGAACGC CTCGGCGCAC
GGCACCCTCA CCGCGACGCT GACCGTCACC AACACCGGAA CGCGCGACGG CACGGACGTG
GTGCCGCTGT ACGTGGCCCA GCCGGTCAGC GCCTCGGCCG AGCCGCCGCA GCGCCTGGTC
GGCTTCACCC GGGTGACGCT GGCCGCCGGG GCGTCGCAGA CCGTGAAGGT CAGCTTCCCG
GCCACGGCGC TGGCCCGGAG CCAGGGCGAC ATCAACGCAT CCGCGCCGCC GACCGTCGAG
CCCGGCGGCT ACGTGCTGCA GGTCGACAAG GAGGCTGACA CCACTCCGTA CGACGTCGAC
CTGTCGGCGC CGTTCACGCT GCGGTGA
 
Protein sequence
MRARTPWSRP LRSRTQARST RRVAHVAIAA ALTLAAVDAA AATGAAASTA QPRYLDRTAP 
IAARVNDLLG RMTLPEKAGQ MDQQLVDNAT AASGGACGAA GFNLPTPACM QSALIDQNVG
SILAGGTDNP SDTTGSGTSG NTAQDWANDY NTIQQYAIAH SRLHIPLSFG VDAVHGFGHP
WQAPLFPQSI GMGATWDPSQ AKAGGAMTAT ALRSTGWTWA FAPVQDLARD NRWGRTYETW
AEEPALSSAM GAANVTGLQT PAPAGGLDVS ATVKHFAGYS ESVNGHDRDE ALLPLNYLQS
TILPSYAGAI NAGADAVMVD SGSINGVPAT SSHYLLTDIL RGQMGFKGVE ISDYQDVQAL
QTTYHIAASL PDAVALAVNA GLDMSMEVNG PDQWQSAIIQ DVGNGKIRMS TINDAVRRIL
TMKFQLGLFD QPCVADPGKP CLNAGAADAV VTSGRDQTLK ATQESITLLR NQNSVLPLPA
GSRVVVTGPS ADSMTNQLGG WSVSWQGVAG AGHVCCMGSP DQIPPGTTVQ TGVLGADTHA
TAISDQAAAV AAAPNTDAYV AVVGEKAYAE GLGDNPAPAL PADQQALISA LEATGKPVIV
VVEAGRPVAL GSAEKASAVV MAYQGSTEAG QAVADVLFGK TDPSGHLSIS WPSDAPAVGG
DFNSTAPSPL GDEPKFFDQL PGTGSGPGNA YNPLYPFGYG LSYTTFSHSA LAVTPNASAH
GTLTATLTVT NTGTRDGTDV VPLYVAQPVS ASAEPPQRLV GFTRVTLAAG ASQTVKVSFP
ATALARSQGD INASAPPTVE PGGYVLQVDK EADTTPYDVD LSAPFTLR