Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4420 |
Symbol | |
ID | 8335774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5020627 |
End bp | 5022381 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644957523 |
Product | Beta-galactosidase |
Protein accession | YP_003115125 |
Protein GI | 256393561 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.653544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.574053 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGTAC TCGACATCAC CGGCGACGGC TTCAGCCTCG ACGGTCAGCC CTTCCGGATC GTCTCCGGCG GCCTGCACTA TTTCCGAGTC CATCCGGCGC AGTGGTCCGA CCGGCTGCGC AAGGCCCGCC TGATGGGCCT GAACACCATC GACACCTACA TCCCGTGGAA CCTGCACGAG CGGCGCCCCG GCACGTTCGA CTTCGGCGGG ATCCTGGACC TGGCGGCGTT CCTGGACGCC GCCGCCGCCG AAGGGCTGCA CGTCCTGCTG CGGCCCGGGC CGTACATCTG CGGGGAGTGG GAGGGCGGCG GGCTGCCGTC GTGGCTGCTC GCCGACCCGG ATCTGGCGCT GCGCAGCACC GATCCGGCGT TCCTGCAGGC GGTCGAGGCG TACCTCGACG CGATCATGCC GATCGTGCTG CCCCGGCTGG GGACGCGCGG CGGACCGGTC ATCGCCGTGC AGGTGGAGAA CGAGTACGGG GCGTACGGCT CCGACACCGC CTATATGGAG CGGCTGTACG AGGCGCTGAC GTCGCGGGGT ATCGACGTAC CCTTCTTCAC CTCCGACCAG CCCAACGACC TGGCGGACGG CGCGCTGCCC GGCGTCCTTG CCACCGCGAA CTTCGGCGGC AAGGTGACCG CCTCGCTCGC GGCACTGCGT GCGCAGCAGC CGACCGGACC GCTGATGTGC GCGGAGTTCT GGAACGGCTG GTTCGACTAC TGGGGCGGCA CGCACGCGCA GCGCTCCGCC GAGGACGCCG GCGCCGCGCT GGAGGAGATG CTGCAAGCCG GCGCTTCGGT GAACTTCTAC ATGTTCCACG GCGGCACCAA CTTCGGATTC ACCAACGGCG CCAACGACAA GGGGACGTAC CGCGCCACGG TCACGTCCTA CGACTACGAC TCGCCGCTGG ACGAAGCCGG GGACCCGACG GAGAAGTACC GGCGCTTCCG CTCCATCATC GGCAAGTACG AGACGGTGCC GGACGAGGAA GTCCCGGAGC CGGGGGAGAA GCTGGCGCCG GTCTCGGTGG CTCTGACCGG GCGCGCGGCG TTGTTCTCCG AGGCGAGTTT GGCTTCCTTG GGCGTGGCGC AGAACTCTGA GACACCGCTG ACGATGGAGC TGCTCGGTCA GGACTTCGGT TTCGTGCTCT ACGAAACCCG GCTTCCCGCG GCGGGTCCGG CGACGCTGAC GTTCGACGAG ATCGGCGACC GCGCGCAGGT GTTCGTCGAC GGTCAGCCGG TCGGCGTGCT GGAGCGCGAG CGGCACGAGC ATGTGCTGTC GTTCCTGGTG CCGCGCGCCG ATGCGCAGCT GCGCGTGCTA GTGGAGAACC AGGGTCGGGT GAACTACGGC CAGAAGCTCG CCGATCGCAA GGGTCTGATA GGCGCGGTCC ATCTCGACGG CGCGCCGCTC ACCGGCTGGA CTTCGCGTCC GCTGCCGCTG GACGACCTGA CCGGGCTGGC CTACGCCGAG CTCGACGGCC CGGCGGTCGG ACCCGGCTTC CACCGAGGCA CGTTCGACCT CGACCGATGC GCGGACACCT ACCTGCACCT GCCCGGCTGG ACCAAGGGCG TGGCCTGGAT CAACGGCTTC AACCTGGGTC GCTACTGGTC GCGCGGCCCG CAGGGGTCGT TGTACGTGCC CGGACCGGTG CTGCGTGCCG GAACGAACGA GCTGGTCGTC CTCGAGCTGC ACGGCGCGCG CGCCGCGGCG GCCGAGCTGC GGCCGGTCCC GGATTTGGGA CCGACGGAGC TGTGA
|
Protein sequence | MAVLDITGDG FSLDGQPFRI VSGGLHYFRV HPAQWSDRLR KARLMGLNTI DTYIPWNLHE RRPGTFDFGG ILDLAAFLDA AAAEGLHVLL RPGPYICGEW EGGGLPSWLL ADPDLALRST DPAFLQAVEA YLDAIMPIVL PRLGTRGGPV IAVQVENEYG AYGSDTAYME RLYEALTSRG IDVPFFTSDQ PNDLADGALP GVLATANFGG KVTASLAALR AQQPTGPLMC AEFWNGWFDY WGGTHAQRSA EDAGAALEEM LQAGASVNFY MFHGGTNFGF TNGANDKGTY RATVTSYDYD SPLDEAGDPT EKYRRFRSII GKYETVPDEE VPEPGEKLAP VSVALTGRAA LFSEASLASL GVAQNSETPL TMELLGQDFG FVLYETRLPA AGPATLTFDE IGDRAQVFVD GQPVGVLERE RHEHVLSFLV PRADAQLRVL VENQGRVNYG QKLADRKGLI GAVHLDGAPL TGWTSRPLPL DDLTGLAYAE LDGPAVGPGF HRGTFDLDRC ADTYLHLPGW TKGVAWINGF NLGRYWSRGP QGSLYVPGPV LRAGTNELVV LELHGARAAA AELRPVPDLG PTEL
|
| |