Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4935 |
Symbol | |
ID | 8336289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5632371 |
End bp | 5634290 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644958034 |
Product | Beta-galactosidase |
Protein accession | YP_003115636 |
Protein GI | 256394072 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0997823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATCGC TCGGCGATGC CACCGGCGGC CGCATCCTGT TCGGCGGCGA CTACAACCCC GAACAGTGGC CGCGCGAAGT CTGGGACGAG GACGTGCGCC TGATGCGCGA GGCCGGGGTG AACCTGGCCA CGGTCGGCGT CTTCTCCTGG GCCCTGCTCG AACCCCGGCC CGGCGAGCGC GACTTCGGCT GGCTCACCGA GGTGCTGGAT CTGTTGCACG CCAACGGCAT CGGCGTGGGG CTGGCCACCC CTACCGCGTC GCCGCCGCCG TGGATGGGCC ACCGCTGGCC CGAGACGCTG CCGCGCAACC CGGACGGGAC GATCCGGACG TACGGTTCGC GCAACGCCTA CTGTCCGTCC TCTCCGGTGT ACCGGGCCTT CGCCGACGCG ATCTGCGCCG ACCTGGCATC GGTCTACGCG CACCACCCGG CGCTGCGGAT GTGGCACATC GGCAACGAGT TCGGGACCGT CTGCCACTGC GACCGGTGCG CCGTGCGCTT CCGCCGCTGG CTCCAGGCCA GATACGGGGA CCTCGACCGG CTGAACGAAG CCTGGGGCAC CGCCTTCTGG TCGCAGCGCT ACGGCGACTG GGACGAGGTC ATCCCGCCGC GCCAGGTCCA GTACGTCATC AACCCCACCC AGGACCTGGA CTATCAGCGT TTTGCCTCCG ACCTGCTGCT GGAGGGCTTC ACGACGGAGC GCGACATCGT GCGCGGCGCC AACCCCGCCA TCCCGATCAC GACCAACTCC ATGACCTTCT TCAAGGCCAC GGACTTGTGG CGGTGGGGCG CCGAGGAGGA TTTCACCGCG CTGGACTCCT ACCCGGACCC CAACAGCCCG CGGGCTGCCC GCGACGGGGC GATGGCCCAG GACCTCATCC GCTCCGTGGG CGGCGGCAAG CCGTGGCTGA TGATGGAGCA GACCCCGGGC CGCGCCGGAT TCCGGCACGT CGCGACGCCC AAGCGTCCCG GTCTGAACCG CCTGTGGTCC CTGCAGGCGG TGGCGCGCGG CGCCGACGGG ATCCTGCACT TCCAGTGGCG CGCCTCCCAG CAGGGCGCCG AACGCACCCA CGGAGCGATG CTGCCGCATG CCGGCCCTGA TTCAAGGATC TTCCGCGAGG TGTGCGACCT GGGCAAGGAG CTGGGCGACC TGACCGGCGT CCTGGGCGCC AAGGTGGCCG CCGAGGTCGC CATCGTGCTC GACTGGGACT CCTGGCGCGC CGTCGAGCTG GACCACCAGC CGCACAGCGG ATTCCGCTAC GTCGACCGCA TCCGCGAGTA CTACACGCCG CTGTGGAAGA CGAACGTGAC CGTCGACTTC GTGCATCCGG AAGCCGATCT GAGCCGGTAC AAACTCGTCG TGGCACCGAA CCTTTATCAG GTCTGTGACG CTGCCGCCGA CAACGTCATC GGCTACGTGG AGCGCGGTGG AAACCTGGTC GTCGGCCCGT TCTCCGGCGT GGCCGACCCC GACGAGCGCA TCCGCCTCGG CGGGTACCCG GCGCCGTTCC GGGACCTGCT GGGACTGCGC ATCGAGGAGT ACTGGCCGCT GCCGGACGAC GAACCGCTCA CCCTGCGCTC AGAACTGTTC GGCGACTTCA GCGCGCGATC CTGGGCCGAG TGGCTGGCCA CCGGCGCCGG CACGGCGCTC GCCGAGATCG CCACGGGACC GCTGACAGGA ATCCCCGCGA TCGTGCGCAA CGCCTACGGC GCGGGGACCG CCTGGTACGT GGCCACGCTG CCCGAGGAGC GGGTGATGGC GCGCCTGCTC GCCCTCGCTT GCGACCAAGC CGGCGTCGAA TCGGTTCTCC CCGGTCTGCC GGAAGGAGTC GAGGCGGTGC GGCGGGGCGA ATACGTCTTC GTGCTCGACC ACGGCAGGGG GACCGTCGAG GTCCGGTCGA CGGTCGCGGC CACCCTGTAG
|
Protein sequence | MPSLGDATGG RILFGGDYNP EQWPREVWDE DVRLMREAGV NLATVGVFSW ALLEPRPGER DFGWLTEVLD LLHANGIGVG LATPTASPPP WMGHRWPETL PRNPDGTIRT YGSRNAYCPS SPVYRAFADA ICADLASVYA HHPALRMWHI GNEFGTVCHC DRCAVRFRRW LQARYGDLDR LNEAWGTAFW SQRYGDWDEV IPPRQVQYVI NPTQDLDYQR FASDLLLEGF TTERDIVRGA NPAIPITTNS MTFFKATDLW RWGAEEDFTA LDSYPDPNSP RAARDGAMAQ DLIRSVGGGK PWLMMEQTPG RAGFRHVATP KRPGLNRLWS LQAVARGADG ILHFQWRASQ QGAERTHGAM LPHAGPDSRI FREVCDLGKE LGDLTGVLGA KVAAEVAIVL DWDSWRAVEL DHQPHSGFRY VDRIREYYTP LWKTNVTVDF VHPEADLSRY KLVVAPNLYQ VCDAAADNVI GYVERGGNLV VGPFSGVADP DERIRLGGYP APFRDLLGLR IEEYWPLPDD EPLTLRSELF GDFSARSWAE WLATGAGTAL AEIATGPLTG IPAIVRNAYG AGTAWYVATL PEERVMARLL ALACDQAGVE SVLPGLPEGV EAVRRGEYVF VLDHGRGTVE VRSTVAATL
|
| |