Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7101 |
Symbol | |
ID | 8338468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 8260216 |
End bp | 8262093 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644960182 |
Product | Beta-galactosidase |
Protein accession | YP_003117772 |
Protein GI | 256396208 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.172989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCCACG AACGCGTACT GACCATCGAC GGCGGCCGGT TCCTGCGGGG CGGGCGGGAG CACCGGATCG TCTCCGCGGC GATCCACTAC TTCCGGATCC ATCCGGACCT GTGGCGCGAC CGGCTGCAGC GGTTGCGCGC CATGGGCTGC AACACCGTCG AGTGCTACAT CGCCTGGAAC TTCCATCAGC CGACGCCGGC GGCGCCGCGG TTCGACGGCT GGCGGGACGT CGCCGGATTC GTGCGGCTGG CAGGGGAACT CGGCTTCGAT GTGATCGCGC GTCCCGGCCC TTATATCTGT GCGGAGTGGG ACTTCGGCGG GCTGCCGGCG TGGCTGCTGG CCGATGAGAA CGTGCGGCTG CGCACCACCG ATCCGGTCTA TCTGGCCGCC GTGGACGCGT GGTTCGACGA GCTGATCCCG GTCCTGGCCG AGCTCCAGGC GACGCGCGGC GGACCGGTCG TGGCGGTGCA GATCGAGAAC GAGTACGGCA GCTTCGGCGC CGATCCCGAC TACCTCGACC ACCTTCGCAA GGGTCTGATC GAGCGCGGCG TGGACACTTT GCTGTTCACC TCCGACGGCC CGCAGGAGCT GATGCTGGCC GGCGGCACGG TCCCGGACGT GCTGGCCACC GTGAACTTCG GCTCGCGCGC CGACGAGGCG TTCGCGACGC TGCGCCGCGT CCGCCCGGAC GACCCGCCGG TGTGCATGGA GTTCTGGAAC GGCTGGTTCG ATCACTTCGG CGAGCCACAC CACACCCGCA GCGCGCAGGA CGCCGCACGC TCCCTCGACG AGATCCTCGC CGCCGGCGGC TCGGTCAACT TCTACATGGG GCACGGCGGC ACCAACTTCG GGTTCTGGGC GGGCGCCAAC CATTCCGGCG TGGGCACCGG CGATCCCGGA TATCAGCCCA CGATCACCAG CTACGACTAC GACGCGCCGG TCGGCGAGGC CGGCGAGCTG ACGCCGAAGT TCCACCTGTT CCGCGAGGTC GTCGGGCGAT ACGTCGAACT GCCCGATGCT CAGCCTCCCG CTCCCCTGCC CCGTTTGATG CCGCAAACCG TTGCCGCGCC TCGGATCGCG GCGCTGCGAG ACCGCCTGGA CCTGCTGGCG ACGGACCCGA TCCACCACCC GACGCCGCAA CCGATCGAGA AGCTCGGGCA CGGCTTCGGG CTCGTCCACT ACCGCCGCCG CCTCGACGGT CCCGCTCGTA CCCACACGCT GCGGATCGAG GGTGTCCGCG ACCGCGCGCA GGTCTTCGCG GACGGAAAGC TGCTCGGGAT GGTAGAGCGT GACATACCCG AGCGGACGCT GGATCTCCAG ATCCCGGATG AGGGCCTGGA TCTGGAGCTC CTCGTCGAGC CGCTGGGCCG GGTGAACTAC GGCCCGCATC TGGCCGATCG CAAGGGCCTG ATCGGCGGCG TGCGGCTGGA CCACCAGTTC CAGTTCGGAT GGGAGCACCG GGTGCTGCCG CTGGACGATC CGACAGGTGC GTTGGCGCTG GAGAATCAGG AGGCTGTAAC GGCGAACCAG ACTGCCGGTC CCGCTTTCCA CCGCGCCGCG ATCACCGTCC GCGAGCCCGC CGACGGCTTC CTCGCCGTCC CCTCCACGGC GCGAAGTCTG GTCTGGCTCA ACGGATTCCT GCTCGGACGG CTGTGGGACC GGGGACCGCA GGTCACGCTC TACGCCCCGG CGCCGCTGTG GCGCGCCGGC GCGAACGAGA TCGTGGTGCT GGCGCTGGAG CCGGATGCCG GTACGCAGAG CCCTGATGCG CAGAGCCCCA GTGCACCGAG CCCTGATGCA CAGGGCCTGG AGATCGAGCT GCGCGGCGAG CCGGATCTCG GCCCGCTCGC GACGCCCTCC ACCCACGCGG ACTACTGA
|
Protein sequence | MAHERVLTID GGRFLRGGRE HRIVSAAIHY FRIHPDLWRD RLQRLRAMGC NTVECYIAWN FHQPTPAAPR FDGWRDVAGF VRLAGELGFD VIARPGPYIC AEWDFGGLPA WLLADENVRL RTTDPVYLAA VDAWFDELIP VLAELQATRG GPVVAVQIEN EYGSFGADPD YLDHLRKGLI ERGVDTLLFT SDGPQELMLA GGTVPDVLAT VNFGSRADEA FATLRRVRPD DPPVCMEFWN GWFDHFGEPH HTRSAQDAAR SLDEILAAGG SVNFYMGHGG TNFGFWAGAN HSGVGTGDPG YQPTITSYDY DAPVGEAGEL TPKFHLFREV VGRYVELPDA QPPAPLPRLM PQTVAAPRIA ALRDRLDLLA TDPIHHPTPQ PIEKLGHGFG LVHYRRRLDG PARTHTLRIE GVRDRAQVFA DGKLLGMVER DIPERTLDLQ IPDEGLDLEL LVEPLGRVNY GPHLADRKGL IGGVRLDHQF QFGWEHRVLP LDDPTGALAL ENQEAVTANQ TAGPAFHRAA ITVREPADGF LAVPSTARSL VWLNGFLLGR LWDRGPQVTL YAPAPLWRAG ANEIVVLALE PDAGTQSPDA QSPSAPSPDA QGLEIELRGE PDLGPLATPS THADY
|
| |