Gene Caci_7101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7101 
Symbol 
ID8338468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8260216 
End bp8262093 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content70% 
IMG OID644960182 
ProductBeta-galactosidase 
Protein accessionYP_003117772 
Protein GI256396208 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.172989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCACG AACGCGTACT GACCATCGAC GGCGGCCGGT TCCTGCGGGG CGGGCGGGAG 
CACCGGATCG TCTCCGCGGC GATCCACTAC TTCCGGATCC ATCCGGACCT GTGGCGCGAC
CGGCTGCAGC GGTTGCGCGC CATGGGCTGC AACACCGTCG AGTGCTACAT CGCCTGGAAC
TTCCATCAGC CGACGCCGGC GGCGCCGCGG TTCGACGGCT GGCGGGACGT CGCCGGATTC
GTGCGGCTGG CAGGGGAACT CGGCTTCGAT GTGATCGCGC GTCCCGGCCC TTATATCTGT
GCGGAGTGGG ACTTCGGCGG GCTGCCGGCG TGGCTGCTGG CCGATGAGAA CGTGCGGCTG
CGCACCACCG ATCCGGTCTA TCTGGCCGCC GTGGACGCGT GGTTCGACGA GCTGATCCCG
GTCCTGGCCG AGCTCCAGGC GACGCGCGGC GGACCGGTCG TGGCGGTGCA GATCGAGAAC
GAGTACGGCA GCTTCGGCGC CGATCCCGAC TACCTCGACC ACCTTCGCAA GGGTCTGATC
GAGCGCGGCG TGGACACTTT GCTGTTCACC TCCGACGGCC CGCAGGAGCT GATGCTGGCC
GGCGGCACGG TCCCGGACGT GCTGGCCACC GTGAACTTCG GCTCGCGCGC CGACGAGGCG
TTCGCGACGC TGCGCCGCGT CCGCCCGGAC GACCCGCCGG TGTGCATGGA GTTCTGGAAC
GGCTGGTTCG ATCACTTCGG CGAGCCACAC CACACCCGCA GCGCGCAGGA CGCCGCACGC
TCCCTCGACG AGATCCTCGC CGCCGGCGGC TCGGTCAACT TCTACATGGG GCACGGCGGC
ACCAACTTCG GGTTCTGGGC GGGCGCCAAC CATTCCGGCG TGGGCACCGG CGATCCCGGA
TATCAGCCCA CGATCACCAG CTACGACTAC GACGCGCCGG TCGGCGAGGC CGGCGAGCTG
ACGCCGAAGT TCCACCTGTT CCGCGAGGTC GTCGGGCGAT ACGTCGAACT GCCCGATGCT
CAGCCTCCCG CTCCCCTGCC CCGTTTGATG CCGCAAACCG TTGCCGCGCC TCGGATCGCG
GCGCTGCGAG ACCGCCTGGA CCTGCTGGCG ACGGACCCGA TCCACCACCC GACGCCGCAA
CCGATCGAGA AGCTCGGGCA CGGCTTCGGG CTCGTCCACT ACCGCCGCCG CCTCGACGGT
CCCGCTCGTA CCCACACGCT GCGGATCGAG GGTGTCCGCG ACCGCGCGCA GGTCTTCGCG
GACGGAAAGC TGCTCGGGAT GGTAGAGCGT GACATACCCG AGCGGACGCT GGATCTCCAG
ATCCCGGATG AGGGCCTGGA TCTGGAGCTC CTCGTCGAGC CGCTGGGCCG GGTGAACTAC
GGCCCGCATC TGGCCGATCG CAAGGGCCTG ATCGGCGGCG TGCGGCTGGA CCACCAGTTC
CAGTTCGGAT GGGAGCACCG GGTGCTGCCG CTGGACGATC CGACAGGTGC GTTGGCGCTG
GAGAATCAGG AGGCTGTAAC GGCGAACCAG ACTGCCGGTC CCGCTTTCCA CCGCGCCGCG
ATCACCGTCC GCGAGCCCGC CGACGGCTTC CTCGCCGTCC CCTCCACGGC GCGAAGTCTG
GTCTGGCTCA ACGGATTCCT GCTCGGACGG CTGTGGGACC GGGGACCGCA GGTCACGCTC
TACGCCCCGG CGCCGCTGTG GCGCGCCGGC GCGAACGAGA TCGTGGTGCT GGCGCTGGAG
CCGGATGCCG GTACGCAGAG CCCTGATGCG CAGAGCCCCA GTGCACCGAG CCCTGATGCA
CAGGGCCTGG AGATCGAGCT GCGCGGCGAG CCGGATCTCG GCCCGCTCGC GACGCCCTCC
ACCCACGCGG ACTACTGA
 
Protein sequence
MAHERVLTID GGRFLRGGRE HRIVSAAIHY FRIHPDLWRD RLQRLRAMGC NTVECYIAWN 
FHQPTPAAPR FDGWRDVAGF VRLAGELGFD VIARPGPYIC AEWDFGGLPA WLLADENVRL
RTTDPVYLAA VDAWFDELIP VLAELQATRG GPVVAVQIEN EYGSFGADPD YLDHLRKGLI
ERGVDTLLFT SDGPQELMLA GGTVPDVLAT VNFGSRADEA FATLRRVRPD DPPVCMEFWN
GWFDHFGEPH HTRSAQDAAR SLDEILAAGG SVNFYMGHGG TNFGFWAGAN HSGVGTGDPG
YQPTITSYDY DAPVGEAGEL TPKFHLFREV VGRYVELPDA QPPAPLPRLM PQTVAAPRIA
ALRDRLDLLA TDPIHHPTPQ PIEKLGHGFG LVHYRRRLDG PARTHTLRIE GVRDRAQVFA
DGKLLGMVER DIPERTLDLQ IPDEGLDLEL LVEPLGRVNY GPHLADRKGL IGGVRLDHQF
QFGWEHRVLP LDDPTGALAL ENQEAVTANQ TAGPAFHRAA ITVREPADGF LAVPSTARSL
VWLNGFLLGR LWDRGPQVTL YAPAPLWRAG ANEIVVLALE PDAGTQSPDA QSPSAPSPDA
QGLEIELRGE PDLGPLATPS THADY