Gene Caci_6865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6865 
Symbol 
ID8338231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7927021 
End bp7930071 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content68% 
IMG OID644959954 
Productglycoside hydrolase family 31 
Protein accessionYP_003117545 
Protein GI256395981 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCCG ACTCTATGGG AGACGTGATG AAAGACTATT CGATCAAGCG CAGAACCGTC 
CTTTCCACGG CCGTCGGCGC GCTGGCCGTG AGCACCGTGA ACGCGGTCCC CGCCTTCGGG
GCGAGCGCGG ATCAGCTCAC GGACTACGCC TCGCACACCA CTGACAGCCG TAGCATCACC
GTGACCAGCA CGACCGGTCA GCAGCTGAGG ATCACCGCGT ACGGAGACCA GATCGTCAGG
GTGCACGCGG TCCGTTCCGG GGAGAGCTTC TTCTCCGACA CCCGGTACGA GATGGTGGTC
CCCGCGAACC ACACCTCCAT GGGCGGCAGC CTGACCGTGA CCGTCACCAC GGACACGATC
GAGATGCACA CCGCCGCGGC GGACGGTCTG CGCATCGTCC TGCACCGCAA GCCCCTGAGG
CTGGAGTTCT ACAACCGGGC CACCGGCGCG CTGCTGGCCA AGGAGGACGC GACCCGGGGC
ATCACGTGGA GCGGCACCAA CTCCACCGTC GTGGCCGAGG CCTTCGTCCC CTCGTCCTCC
GGTGAGCGCT TCCTGAAGGC CGGACACGGC ATCCTCGGGC GCGTGCCGTC ACTGGATCGC
ACCGGTACCA CGGTCTCGGA GAACTACGCC GACGCCAACG CCGCCGCTCA CAACCCTCAG
GAACAGGCGC CGGGCATCGT GCCGTTCTAC CTCTCCAACC TGGGGTATGG GGTGTTCTTC
AACACCACCT TCGACACGAC CTTCACCTTC AACAGCAGCA ACGGGTACGG GTTCTCCGCC
ACCGGGTACG GCGTCAGCGG CATCCGGCCC CAGGTCGACT ACTTCCTGAT CAACGGTCCC
CAGTTCACGC AGCTCTTCGA CCGCTACACC CAGCTCACCG GCCGTCCGCG GCTGCCCCAG
CGGTCCATCT TCGGGCTCCA CATGACCGAC CACAGCTTCC CCGACACCAG CGACGAGAAC
TGGTGGCGTC AGAAGATCAC CCAGCACCGC GCGGCCGGCT TCCCGTTCGA CCACCAGGTC
AACGACAACC GGTGGCGGGC CGGCTCCGGC GCCTGGTCCG GCTCGTATTT CGAGTTCAGC
TCCGTCCGCT GGCCCGACCC CGCCGGCTAC GCGAAATGGG CCGCCACCAA CGGTGTCACC
GTGACGCTGG ACTACAACCG CAACAACTCC GACCTCATGG AGAACTGGAA GGCGGGGCCG
CCCCCCGGCT ACAGCTTCGC GTCGGCCGAC ATTTCCAGCG TGCCGCAGAA CAACGCCGTC
CCCGACTGGT CCTACCCCGC CACCCGCGCC TGGGTGTGGA AGGTCTTCTG GGACAAGGCC
CTCAACCCGA GCCTGAAGTA CCCCTGTGAC GGCCTGTGGA TCGACGAGAC CGACGAGATG
GGCGGGATCC CGTACCCTGC GAAGATGGCC GACGGCCACA CGTGGGCCGA AGGGCGGAAC
GCCTACCTGC TGAACCTGCA CAAGGGCATC GGCGAAGAAG GCTGGGACCC GGCCGGCAGC
GGCCACATCG GCTCCGCGAA GCGCCCGTGG ACCTGGAGCC GCGGCGCCAC CGCGGGCCAG
CAGCGCTACG GCCACTACTG GACCGGCGAC ATCCCCTCGA CCTACGACGA GATGCGCTCC
CAGATCAGGG GCATGCTGAC GGCGGGCCTC GGCGGCTTCC CGTTCGCCAA CATCGACGGC
GGCGGCTACG GCAACGGCAG CGTGATTTCC GACGCTTTCT ACCGCAACTG GCCGGTCGCG
TGGTCCAGCC TCGCGCCGAT CTGGCGCCCG CACACCTCCG CCACGGTCCC GTCGAAGGGC
ACGCTCGCCT CACGCTGGCC GCTCGACCAG GGCACGCAGG CGCAGGCGGA CTTCGCCCGG
TACGGCCGGC TGCGCTACAC CCTGATGCCC TACATCTACT CGCTCGCCCA CCAGTCCGCC
GCAACCGGTA TGCCGATGGC TCGGGCCATG GTGATCGACT ACCAGAGCCG CTCCCAGGCT
TACACCCACG ACCTGCAGTA CATGTGGGGC CCTTCGCTGC TGGTCGCGCC CTGCACCAAC
GACGGCGGGG CCGTCCAGCA GATCTGGCTG CCGGCCGGTT CGACCTGGTA CAACTTCTGG
GCCGACATCA AGCACACCGG TTCCGACTCC GGGGACTTCG CCTACACCAC CCGCACCGGC
GAGACTCCGT TGTTCGTCAA GGCGGGGGCG ATTCTGCCCA AGTACCCGTA CGCGCAGAGC
GCCGCCTACT TCACCAAGCA GCAGCTTGAG ATGGATGTCT ACGCGGGGGC CGACGGCACC
TTCTCAGTCA TCGAAGACGA CGGAGTGACC GAGTCCTATC GGAGCGGCGC CCAGAGCACC
ACGCAGCTCA CCTACACCGA CGCGGCGACC CGCGTCGCTG TCGCCCATCC GCAGGGGACG
TACGCGGGCG CGCCCACCAG CCGCCGCTAC ATCGTCCGCT TCCACGGATT GGCGAATCCG
GTGGGGATGC GGGTCAACGG CGGGGCGACC CTGCCGGCCT TCACCAGCGA AGCCGCAGCG
CTGATCAGCT CGGGTGGAGC CGGCAGCGTG TGGAACGCGT CTACGAAGGT CCTGAGCGTC
GTCACCTCGC AGATAGCCGT GGTCGCGAAC GGCGGCACCG CCGCGACGGT CGAACCGAGC
GGCGCCGCCT TCCCCGCCGT CAGCGGCGGC ACGGTCTACG AGGCCGAGAC GGCCCATCTC
GACAGCGCGT TCATCATCGA CACCAGCCAC CCCGGCTACA CCGGGACCGG CTATGCCGAC
TTCAACGGAT CGTCCTCGGG CCCCGGCATC AGCTGGACGG TCACGGCGGC CGCGGCGGGG
AAGAAGCAAC TCTCGATCCG CTATGCCAAC GGGGGCACCA CGAACCGCCC GATGGCCGTC
GCGGTCAACG GCACCACTGT CGCCACGCTC ACTATGGCGC CCACTGGTGC GTGGGACAGC
TGGGCGACTG TGTCTTGTAC TGCCACGCTT CCGCAGAGTA CGACGATCAC TGTTCGAGCT
ACGGTCACCA CGGCTAATGG GGCGAACATC GACAGCTTGA TTGTGGGGTA G
 
Protein sequence
MGSDSMGDVM KDYSIKRRTV LSTAVGALAV STVNAVPAFG ASADQLTDYA SHTTDSRSIT 
VTSTTGQQLR ITAYGDQIVR VHAVRSGESF FSDTRYEMVV PANHTSMGGS LTVTVTTDTI
EMHTAAADGL RIVLHRKPLR LEFYNRATGA LLAKEDATRG ITWSGTNSTV VAEAFVPSSS
GERFLKAGHG ILGRVPSLDR TGTTVSENYA DANAAAHNPQ EQAPGIVPFY LSNLGYGVFF
NTTFDTTFTF NSSNGYGFSA TGYGVSGIRP QVDYFLINGP QFTQLFDRYT QLTGRPRLPQ
RSIFGLHMTD HSFPDTSDEN WWRQKITQHR AAGFPFDHQV NDNRWRAGSG AWSGSYFEFS
SVRWPDPAGY AKWAATNGVT VTLDYNRNNS DLMENWKAGP PPGYSFASAD ISSVPQNNAV
PDWSYPATRA WVWKVFWDKA LNPSLKYPCD GLWIDETDEM GGIPYPAKMA DGHTWAEGRN
AYLLNLHKGI GEEGWDPAGS GHIGSAKRPW TWSRGATAGQ QRYGHYWTGD IPSTYDEMRS
QIRGMLTAGL GGFPFANIDG GGYGNGSVIS DAFYRNWPVA WSSLAPIWRP HTSATVPSKG
TLASRWPLDQ GTQAQADFAR YGRLRYTLMP YIYSLAHQSA ATGMPMARAM VIDYQSRSQA
YTHDLQYMWG PSLLVAPCTN DGGAVQQIWL PAGSTWYNFW ADIKHTGSDS GDFAYTTRTG
ETPLFVKAGA ILPKYPYAQS AAYFTKQQLE MDVYAGADGT FSVIEDDGVT ESYRSGAQST
TQLTYTDAAT RVAVAHPQGT YAGAPTSRRY IVRFHGLANP VGMRVNGGAT LPAFTSEAAA
LISSGGAGSV WNASTKVLSV VTSQIAVVAN GGTAATVEPS GAAFPAVSGG TVYEAETAHL
DSAFIIDTSH PGYTGTGYAD FNGSSSGPGI SWTVTAAAAG KKQLSIRYAN GGTTNRPMAV
AVNGTTVATL TMAPTGAWDS WATVSCTATL PQSTTITVRA TVTTANGANI DSLIVG