Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3784 |
Symbol | |
ID | 8335137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4278930 |
End bp | 4279958 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644956924 |
Product | aldo/keto reductase |
Protein accession | YP_003114527 |
Protein GI | 256392963 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00710905 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCC CCCTGCGTCC GTTCGGACGT ACCGGCGTCA AGGTCAGCGC GCTCGCGCTG GGCACGATGA TGTTCGGACC CCGCGGCAAC CCGGATCACG ACGACAGCAT CCGGATCGTC CACCGGGCCC TGGACGCCGG GATCAACCTG GTCGACACCG CGGACGTGTA CAGCCAGGGC GAGTCCGAGA CGATCGTCGG CAAGGCCCTG GCCGGCCGCC GCGACAGCGT CTTCCTGGCC ACGAAGTTCC ACGGCCGGAT GGGGGAGGAC GCCAACCGCT TCGGCAACTC CCGCCGGTGG ATCGTCAAGG CGGTCGAGGA ATCGCTCGCG CGTCTCCAGA CCGACCACAT CGACCTGTAC CAGGTGCACC GTCCCGAGGA CGACACCGAC ATCGACGAAA CCCTCGGCGC GCTGTCCGAT CTCGTCCACC AGGGCAAGAT CCGCTACATC GGCACCTCGA CCTTCGAGCC GTCCGGCATC GTGGAGGCCC AGTGGGTGGC CGAGAAGCGC GGCCGGGAGC GGGTGGTGGC CGAACAACCC CCGTACTCGG TCCTCGCCCG CGGCATCGAG CGCGAGGTCC TGCCGGTCGC GCAGAAATAC GGACTCGCGG TGATCCCCTG GAGCCCCCTG GCCGGAGGCT GGTTGACCGG CAAGTTCCGC GTCGGCGCAG CGCAGCCCGA GACCAGCCGC GGCGCCCAGC AGGGCCGCTT CGAGATCGGC GACCCGGCCA ACGCCGACAA ACTCCAGGCG GTCGAAGCCC TGGCCCTCCT CGCCGAGGAA GCAGGCATCT CACTGGTCCA CCTGGCGCTG GCGTTCGTGC TGGAGCACCC GGCCGTCACG GCGCCGATCA TCGGCCCCCG CACGTTCGAG CAGCTCGAAG GACAGCTCGA CGCGGCATCC CTGCGGTTGG CACCGGACGT CCTCGACGAG ATCGACAAGA TCGTGCCCCC GGGCGTCACC CTGTCCGCAC GCGATGCCGG ATATAACCCG CCGTCGGTCA CCGACTCCGC CCGTCGGCGC CGTGCCTGA
|
Protein sequence | MSIPLRPFGR TGVKVSALAL GTMMFGPRGN PDHDDSIRIV HRALDAGINL VDTADVYSQG ESETIVGKAL AGRRDSVFLA TKFHGRMGED ANRFGNSRRW IVKAVEESLA RLQTDHIDLY QVHRPEDDTD IDETLGALSD LVHQGKIRYI GTSTFEPSGI VEAQWVAEKR GRERVVAEQP PYSVLARGIE REVLPVAQKY GLAVIPWSPL AGGWLTGKFR VGAAQPETSR GAQQGRFEIG DPANADKLQA VEALALLAEE AGISLVHLAL AFVLEHPAVT APIIGPRTFE QLEGQLDAAS LRLAPDVLDE IDKIVPPGVT LSARDAGYNP PSVTDSARRR RA
|
| |