Gene Caci_4439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4439 
Symbol 
ID8335793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5047034 
End bp5049217 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content67% 
IMG OID644957542 
ProductAlpha-galactosidase 
Protein accessionYP_003115144 
Protein GI256393580 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.093 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA CCGCCCCACG CCGGAGCCTG TTCGGAGCCG TCGCCGCGGC CGTCATGGTG 
TTCGGCGCCG TGACCGCTGC CGCGCCCGCG GTCTCGGCCA GCACCACGGC AGCTACTCCC
GCGATCAACA CCCACCCCTC CTACAACGGC CTGTCGCTGA CCCCGCCGAT GGGTTTCAAC
GACTGGGCCG GCTTTGAATG CAACAGCGAC ATGAACGAGG CGCTGTTCAC CAAGACCGCC
GACGAAATAG TCAAGCTGGG CCTGAACAAA CTCGGCTATG ACTACGTCAA CATCGACGAC
TGCTGGATGC AGAAGACCCG CGACGCAAAC GGCGACTTGC AGGTCGACGC CACCCGCTTC
CCGCACGGTC TGAAATGGCT CGGCGACTAC ATCCACAGCA AGGGCTTGAA GTTCGGCATC
TACGAGGACG CCGGCTACTA CACCTGCCAG GGCGCCGCAG GCAGCTACGG GCACTTCCAG
CAGGACGCGG ACCTCTACGC CTCCTGGGGC GTGGACTACC TCAAGCTCGA CTACTGCTAC
GAGCCGATGG ACCAGTTCCC CGGCAAGACC GAGAGCCAGG TCGCGCAGAT CGTCTACACC
GAGGCCAGCC AGGCGCTGCT GAACACGCAC CGCCCGATGC TGTTCTCCGA GTCCGCCCCG
GCCTACGTCT GCTGCTCCGG CTCGGACTTC ACCGACGAGC TCACCTGGCT CTACCAGCAC
GGCAACCTGT GGCGCTTCGG CTCGGACATC TACGACGCCT GGCCGAGCGT GCTGGAGAAC
TACAGCGAGG ACAACACCCC GGGCTTGGCG CAGTGGGCCG GTCCCGGGCA CTGGAACGAC
GCGGACATGC TGGAGATCGG CAACGGCGGG CTGACCCCCA CCGAGGAGCA GACCCAGATG
ACGCTGTGGG CCGAGATGGC CTCGCCGATC CTGCTGTCCA CCGACCTGTC GAAGCTGACC
CCGGCCGAGG TGGGGATCGT TTCGAACCCC GATGTCGTGG CCGTGGACCA GGACCGGCTC
GGCGCGCAGG GGACGATCGT GCAGTCCGGG ACCGGCTACG ACGTGCTGGC CAAGCCGCTC
GCGGACGGCG ATGTGTCGGT CGTGCTGTTC AACAAGGGCG ACACGGCGCA GACGGTCACC
ACGACCGCGG CGAAGATCGG CCTGCCGAGC CGCGGCGCGC CGTTCCAGCT GACCGATTTG
GTGAGCAAGG CCACGAGCGC CAGCGACGGC ACCATTTCGG CGTCCCTCGC ACCGCACTCG
ACGGTGATCT ACCGCGTGCA CCCCGGAGGC GACAAGCACC TGCCGATCCA CACCGCCGCG
ACGATCAGCA GCGCACCGCT CGCCGCCGGG GTACCCACGA AGGTCAGCGT TTCCTTCGCC
AATCACGGGT ATTTCGACGC GCAGCAGCCC TCTGTCACGC TGAAGCTGCC GACCGGTTGG
ACGGCGACGC CGGCTTCGGT GTCGTTGAAG AACGTCAAGC CGGGGACGTC GGCGACGGCG
ACGTTCGTCG TCACCGCCAG CGCTCCACCG CCGGGCAAGG TGACGACGAC GTTGACCACG
TCCGTCGCCT ACCGCGATCG CGGCAGACCG GCGTCCGACA GCGCGGACCT GTCCAACGTC
ACCAACACGC CGTTCCCGAC GCTGGCCGGC GCGTTCAACA ACACCGCGAT CACCGATGAG
ACCAACACCG CGCCCGGCAA CTTCGACGGG GACGGCGACA GTTACTCGGC GCAGTCGCTC
GCCACAGCCG GCGCCACGCC GGGGGCGACG ATCAGCGCCG GCGGCACGAC GTTCACCTGG
CCGTCGGCAG CGGCCGGGAC GAACGACAAC GTGGCGGGCA GCGGGGTGAT GGTGAACCTC
GCCGGGCAGG GCTCCAAGCT CGGATTCCTC GGCTCGGAGG CGGGTTTCAG CACCGACACC
GTCACGGTGG CGTACACCGA CGGCACGTCC AGCACGGGCA GTCTCGGCTT CCCGAACTGG
TGCTGCTCGT CGCCGACCGG CTACGGCGCC ACGCCGGCGA TCGTCACCGA TCACCGGAAC
ACGCCGAGCG GACCGGCGAA CTTCGGGACC GCTTACGACG TGTTCTACAA CTCGATCGCC
ATTGATGCGA CGAAGACGGT GAAGACCGTT ACTGTTCCGA GCGACCCGGC TATTCATGTC
TTCGCGATGA CGGTTCAGCC CTGA
 
Protein sequence
MSRTAPRRSL FGAVAAAVMV FGAVTAAAPA VSASTTAATP AINTHPSYNG LSLTPPMGFN 
DWAGFECNSD MNEALFTKTA DEIVKLGLNK LGYDYVNIDD CWMQKTRDAN GDLQVDATRF
PHGLKWLGDY IHSKGLKFGI YEDAGYYTCQ GAAGSYGHFQ QDADLYASWG VDYLKLDYCY
EPMDQFPGKT ESQVAQIVYT EASQALLNTH RPMLFSESAP AYVCCSGSDF TDELTWLYQH
GNLWRFGSDI YDAWPSVLEN YSEDNTPGLA QWAGPGHWND ADMLEIGNGG LTPTEEQTQM
TLWAEMASPI LLSTDLSKLT PAEVGIVSNP DVVAVDQDRL GAQGTIVQSG TGYDVLAKPL
ADGDVSVVLF NKGDTAQTVT TTAAKIGLPS RGAPFQLTDL VSKATSASDG TISASLAPHS
TVIYRVHPGG DKHLPIHTAA TISSAPLAAG VPTKVSVSFA NHGYFDAQQP SVTLKLPTGW
TATPASVSLK NVKPGTSATA TFVVTASAPP PGKVTTTLTT SVAYRDRGRP ASDSADLSNV
TNTPFPTLAG AFNNTAITDE TNTAPGNFDG DGDSYSAQSL ATAGATPGAT ISAGGTTFTW
PSAAAGTNDN VAGSGVMVNL AGQGSKLGFL GSEAGFSTDT VTVAYTDGTS STGSLGFPNW
CCSSPTGYGA TPAIVTDHRN TPSGPANFGT AYDVFYNSIA IDATKTVKTV TVPSDPAIHV
FAMTVQP