Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5013 |
Symbol | |
ID | 8336367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5740804 |
End bp | 5741949 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644958112 |
Product | galactonate dehydratase |
Protein accession | YP_003115714 |
Protein GI | 256394150 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.527728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA CCTCGCTGGA GACGTTCCTG GTCGCGCCCC GGTGGCTGTT CCTGCGCATC GGGACCGACG AGGGCCTGGC CGGCTGGGGC GAGCCGGTCC TGGAGGGGCG CGCCGAGACG GTCCGCGCCG CCGTCGCCGA ACTGTCCGAA TACCTGATCG GCGAAGACCC GCTGCGCCTG GAGCACCACT GGCAGGTGCT GACCAAGGGC GGCTTCTACC GGGGCGGACC GGTGCTGTCC TCGGCCGTGG CCGGTATCGA CCAGGCGCTG TGGGACCTGG CCGGGAAGAC CTACGGCGTG CCGGTGCACC AGCTGCTCGG CGGACCGGTC CGCGAGCGGG CCCGGGTGTA CGCCTGGATC GGCGGCGACC GTCCGACGCA GGTCGCCGAA CTGGCCGCCG AGCAGGTCGA AGCCGGGTTC ACCGCGGTGA AGATGAACGG CTCGGCGCAG ATGCGGCACA TCGACACCCC GGCTGCCACC GCGGCGGTCA CCGCCCGGGT CGCCGCCGTG CGCGAGGTGC TGGGCGCCGA TCGGGACGTC GTCGTCGACT TCCACGGCCG GATGTCCACC GCGATGTCGC GGCGTCTGCT GCGGATGCTC GAACCCTTGC AGCCGTTGTT CGTCGAAGAG CCCGTGCTCC CGGAGAACAG CCGCGATCTG CGGTCCCTGG CCGAGTCTTC GGGCGTGCCG TTGGCTGTCG GCGAGCGCCT GTACTCCCGG TGGGACTTCC GGGACGTGCT GCCCAGCGGG ATCGCCGTGG CGCAGCCCGA CGTCTCGCAC GCCGGCGGCA TCTCCGAACT GCGCCGCATC GCCGCGGCCG CCGAGACCTA CGACGTGGCG ATGGCGCCGC ACTGCCCGCT CGGGCCGATC GCGCTGGCGG CCAGTCTTCA AGTGGACTTC GCCACGCCCA ACTTCCTGAT CCAGGAGCAG AGCCTGGGTC TGCACTACAA CCGCGGCAAC GAGATGCTCG ACTACCTGCT CGACCCGGAG CCGCTGCGGG TCCGCGACGG CCACATCGAC CGGCTGACCG GTCCGGGTCT GGGGATCGAG ATCGACGAGG CCGCGGTGCG CCGCGCCGAC GAGACCGGCC ACCACTGGCG GAACCCGGTG TGGCACCACC AGGACGGGTC GTTCGCGGAA TGGTGA
|
Protein sequence | MKITSLETFL VAPRWLFLRI GTDEGLAGWG EPVLEGRAET VRAAVAELSE YLIGEDPLRL EHHWQVLTKG GFYRGGPVLS SAVAGIDQAL WDLAGKTYGV PVHQLLGGPV RERARVYAWI GGDRPTQVAE LAAEQVEAGF TAVKMNGSAQ MRHIDTPAAT AAVTARVAAV REVLGADRDV VVDFHGRMST AMSRRLLRML EPLQPLFVEE PVLPENSRDL RSLAESSGVP LAVGERLYSR WDFRDVLPSG IAVAQPDVSH AGGISELRRI AAAAETYDVA MAPHCPLGPI ALAASLQVDF ATPNFLIQEQ SLGLHYNRGN EMLDYLLDPE PLRVRDGHID RLTGPGLGIE IDEAAVRRAD ETGHHWRNPV WHHQDGSFAE W
|
| |