Gene Caci_5007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5007 
Symbol 
ID8336361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5732605 
End bp5735157 
Gene Length2553 bp 
Protein Length850 aa 
Translation table11 
GC content70% 
IMG OID644958106 
ProductAlpha-galactosidase 
Protein accessionYP_003115708 
Protein GI256394144 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.285302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATAC CCAAACTCAT CGCGCTCGCC GCGTCGGCGG CGCTGTGGGC GACAGCGGGA 
CCGGCGTGGG CCGGGTCTCA GGCCCGGCAC GCGCCGGCCA CGGTCCCGAT CAACAACCTG
GCCCGCACGC CGTATCAGGG CTGGAACACC TACTACGGCC TCGGGTCCAC CTTCACCGAG
CAGACCATCA AGGACGAGGC CGACGCGCTG GTGAGCAAGG GTCTGGCCGC CGCGGGCTAC
AACTACGTCT GGATCGACGG CGGCTGGTGG AACGGCGCGC GGGACGCCTC CGGCGCCATC
ACCGTAGACT CGACTCAGTG GCCTGACGGG ATGAAGGCGG TCGCCGACTA CATCCACTCG
CGCGGGCTGA AAGCAGGCAT CTACACCGAC TCCGGCCTCA ACGGCTGCGG CGGCGCCAAC
CAGGGCAGCT ACGGCCGCTA CCAGCAGGAC GTCAACCAGT TCGCCGGGTG GGGCTACGAC
GCGGTGAAGG TCGACTTCTG CGGCAGCGAG CAGATGGGAC TGGACCCGGC CACCGTCTAC
GGCCAGTTCC GCGACGCAGT CCTGAACAAC AGCAGCCACC GCCCGATGCT GTTCAACATC
TGCAACCCCT TCATCCCCGA GACCGGCGCC GCGCCCGGCC GCAGCGCGTT CGACTCCTAC
ACGTTCGGCC CGAGCACCGG GAACTCCTGG CGGACCGACA CCGACATCGG CTTCCCGAAC
GACGTCCGCT ACTCCGACGT GCTGCGCAAC CTGGACGCCG ACGCCGCGCA CCCGGAAGCC
GCCGGTCCCG GGCACTGGAA CGACCCGGAC TATCTGGGCC CTGATCTGGG CATGACCGAC
GCCGAGTCCC GCTCGCAGTT CTCAATGTGG TCGATCGTCG CGGCTCCGCT GATGATCGGC
TCGGATGTAC GCAAGCTCTC CGACAGCGCC GTCGCTATGC TCACCAACGC CGAAGTGCTC
GCGGTAGACC AGGACCGGCT GGGAATTCAG GGCACTGCGT TGTCCGCCCC GACCGCCTCC
GGCGCCCAGG TCTGGACCAA GCCGCTCGCC AACGGCGACG TCGCTGTCGC GCTGCTCAAC
CGCGGCACGA CCCCGCAGCT GATCTCCACG ACCGCCGGCA AGATCGGCCT GTCCACGTCG
GGCAGCTACG CCGTGCGCGA TCTGTGGCAG CACTCGACGA CCGAGTCGGC CGGCACGATC
TCCGCGACCG TCGCCCCGCA CGACGTCGTG CTGTATCGGG TCTCCCGCAA CGGAAACCCG
GCGACCATGA CGCCGGCGAC CACGCTGTCC CCGGCGACCC TGACCGCGAC CGCGCAGGCC
GCGCTTCCCC TGGTCGCCCC CGGCGACTCC TTCCCGGTCT CGGCCACGTT CACCGACAAC
GGCCGGCTCG CCGTCCGGAA CGTGAAGCTC ACGCTCGCCG TCCCGGCGGG CTGGACCGCC
ACGCCGACCA CCCCGGCCGC GAAGGACCGC CTCGACAGCG GGCAATCGAT TGCGGCGACC
TGGCAGGTGA CCGCCGCTCC AGGCGCTCTG CCCGGTACGG ACCAGCTCGC GGTGACTGCG
GGCTACGACT GGCAGGGTGC TTACTCCGGT GCGACATCAA GGCTCCAGAC ACTCAGCGCC
ACCGAGTCGA CGCAAGTCCA GGTCCCCGCT GCGCCGCCGT CGGGCACCGG TCCGCTGAGC
CACCATCCCT GGCTCGACGC GGCGAGCGGC TACCTCGTGC CCCGGGTCGA CCTCGACGAT
GCCGGCGGCG GGCCGCTGAC GATGCACGGC GTCGGGTACC CGACCGGTGT CGGCACCGCG
TCGCCCTCGA CGATCGACTA CTACGTCGGC GGCCAGTGCA GCACGCTCAC CGCCACGGTC
GGCATCGACG ACTCGGCGGA CTTCGACCCG ACCGGCGGGA CGGCGGTGTT CCAGGTCTAC
GGCGACGGCG TGAAGCTGTA TGACAGTGGT CTGGTGACCC GGGCCGCGCC TCAGAGCGCG
TCGGTGAATC TGGGTACTGC GAAGGTGATC AGCCTGGTCG TCGGGGACGG CGGCGACGGC
GGTTACAACG ACCGCACGGA CTGGGGCGGG CTGCGGATCA CCTGCGGCGC GCCGGTCGGC
ACGCAGCCCA GCGGACCCTG GCCGCACTTC GCGCCCTCGT CCTCGGTGTC CGCGACGGCC
ACCAGCGCCA ACGCCGGCTA CCCGGCGGGC AACGCGGTGG ACGGCCAGGT GACCACTTTG
TGGCACTCGC AGTTCAGTCC GGTCCACGAC CCGCTGCCGA TCTCGCTGAC GATGGACCTC
GGCTCGGTGC AGACGGTCAC CGGACTGACC TACCAACCCC GCCTCGACGG CGCGATCACC
GGTACCATCA CCGGTTACAC CGTCGAGGTC AGCAGCGACG GCGTCACCTT CACCCCGGCG
GCAGCGGCGG GGACGTGGAC GCAGGACGCG CTGCTGAAGT CCGTTGAATT CGCTCCGGTG
TCGGCTCGCT ATGTGCGACT GACTGCGACT GCGGCAGCCG ACGGCTACGC CTCGGCGGCT
GACGTCAGCG TGGCGGCGCG ACCGACCGCC TGA
 
Protein sequence
MLIPKLIALA ASAALWATAG PAWAGSQARH APATVPINNL ARTPYQGWNT YYGLGSTFTE 
QTIKDEADAL VSKGLAAAGY NYVWIDGGWW NGARDASGAI TVDSTQWPDG MKAVADYIHS
RGLKAGIYTD SGLNGCGGAN QGSYGRYQQD VNQFAGWGYD AVKVDFCGSE QMGLDPATVY
GQFRDAVLNN SSHRPMLFNI CNPFIPETGA APGRSAFDSY TFGPSTGNSW RTDTDIGFPN
DVRYSDVLRN LDADAAHPEA AGPGHWNDPD YLGPDLGMTD AESRSQFSMW SIVAAPLMIG
SDVRKLSDSA VAMLTNAEVL AVDQDRLGIQ GTALSAPTAS GAQVWTKPLA NGDVAVALLN
RGTTPQLIST TAGKIGLSTS GSYAVRDLWQ HSTTESAGTI SATVAPHDVV LYRVSRNGNP
ATMTPATTLS PATLTATAQA ALPLVAPGDS FPVSATFTDN GRLAVRNVKL TLAVPAGWTA
TPTTPAAKDR LDSGQSIAAT WQVTAAPGAL PGTDQLAVTA GYDWQGAYSG ATSRLQTLSA
TESTQVQVPA APPSGTGPLS HHPWLDAASG YLVPRVDLDD AGGGPLTMHG VGYPTGVGTA
SPSTIDYYVG GQCSTLTATV GIDDSADFDP TGGTAVFQVY GDGVKLYDSG LVTRAAPQSA
SVNLGTAKVI SLVVGDGGDG GYNDRTDWGG LRITCGAPVG TQPSGPWPHF APSSSVSATA
TSANAGYPAG NAVDGQVTTL WHSQFSPVHD PLPISLTMDL GSVQTVTGLT YQPRLDGAIT
GTITGYTVEV SSDGVTFTPA AAAGTWTQDA LLKSVEFAPV SARYVRLTAT AAADGYASAA
DVSVAARPTA