Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0994 |
Symbol | |
ID | 8332328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 1133539 |
End bp | 1135413 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644954143 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_003111763 |
Protein GI | 256390199 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.165504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00173885 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGGAG CCACGGGGCT CACCGCCGGT GCGAAAGTCC TGATCACCGG CGGCGCCGGC TTCATCGGCT CCACGGTCGC CTCCGCCTGC CTGGACGCCG ACCTGGTCCC GGTGATCCTG GACAACCTGT CCACCGGCCG CAGCGAGTTC ACCGAGGGCC GGATCTTCTA TGAGGGCGAC ATCGCCGACG CCGCGCTCCT CGACCGGGTC TTCGCCGCGC AGCCGGACAT CGCCGCGGCA GTGCACTGCG CGGCGCTGAC GAACGTGCCG GAGTCGGTCG CCAACCCGAT CCGGTACTAC CGCGAGAACG TCACCAAGAC CCTGGAGCTG ATCGAGGGGT TGGTCCGCAA CGGCTGCCGC CGCATGGTGT TCTCCTCCTC CGCGTCCGTC TACGCCGCGG GTTCCTCCGG TCCTTACGCG CGCTCTAAGG CGATCACGGA GTGGGTGCTG GAGGACGTGG CGCGCGCCGG CGATCTGCAA GCCGTCGCCC TGCGCTACTT CACCCCGATC GGCGCCGATC CTGACTTCCA TACCGGTACT CCGAGCCCGG AAGCGCTGCA TGTCCTGGAC AAGGTGACCA CCGCCTACCG GAGCGACGAG CCGTTTCACA TCGCTGGCAC CGACTTCTCG CCCTTCGATA GCCGCCGCGA CGCTCGCGAC TACATCCATG TCTGGGACCT GGCTGAGGCG CACGTCGCGG CGCTGCGGAA CTTCGACGCG ATCGTGGCGC GCCATGCGTC CCACACCGTC CCGTACGAGG TGATCAATCT CGGCGCCGGG GACGGTACGA CGGTCCCCCA GCCGGCTCTC GAGGCGCGTC AGCCGCTCGG CTGGGCGCCG CGGTACTCCG TCGGGACGGG TATTCGGGAC GCGCTGACGT GGGCTCGCGT GCGTGCCGGT CTGCCCGCGC CGGCGCGGGT CCGGGCCGCG AAGCCCGCCT CGGTCGTCGA TGTCCTGATG CCGTACTACG GCGACGTCGG CATGATGCAG GAGGCGGTAC GCAGCGTCCT GGCACAGCGT GACCAGCACT GGCGGCTGAC GGTGGTCGAC GACGGCGCCG AGCCGGGCGT CCCGGAGTGG TTCGCCGGCC TGATCGCCGA GCACGGACCC GACAAGATCC GCTACCAGCG CAACCCGGTC AACCTCGGCA TCACCGAGAA CTTCCAGAAA TGCCTGAGCC TGGTCACCCA TCCGCTGGTC ACCATGATCG GCTGCGACGA CCGCATGCTT CCGGACTACA TCGGAACCGT CCGAGCCCTG ATGCGCGACT ACCCCCGCGT CTCCCTCGCC CAGCCCGGCG TGGAGGTCAT CGACGGAGCC GGCGAGGTCG TCGAACCCTG GGTCGACAAG GTCAAGCGGC GCCTCTACGC CCCCCGCGTC CATGGCGCCC TGGTCCTGAG CGGCGAATCC CTGGCCGTCA GCCTCCTGCG CGGCAACTGG ATGTACTTCC CAGCCATCTG CTGGCGCGCC GACGTCATCA CCGAAGTCGG CTTCGACCCC CACCTCCGCG TGATCCAGGA CCTGGCCCTG ACCCTCGAGT GGGTCCGCGC CGGCGCCCAG ATCGTCGTCA GCGACACCAT CTGCTTCCAG TACCGCCGCC ACGCAGTCAG CCTCTCCAGC GAACAAGCCA CCACCGGCGC CCGCTTCACC GAAGAACGCA CCTTCTTCCT CGACGAAGCC GCCCGCATGG ACCGCCTAGG CTGGCGCCAC GCCGCCCGCA CAGCCCGCCT CCACCTCTCC TCACGCCTCC ACGCGGCCAC GATGCTCCCC AGCGCCCTCC GCCGCGGCAG CCGCGACGGC GTCCGAACCC TGGCGGCCTA CGCCTTCGGA CCCTCCCGGC GCCCCGGAGG CTCGAGCGGA GGACCAGCAC GGTGA
|
Protein sequence | MSGATGLTAG AKVLITGGAG FIGSTVASAC LDADLVPVIL DNLSTGRSEF TEGRIFYEGD IADAALLDRV FAAQPDIAAA VHCAALTNVP ESVANPIRYY RENVTKTLEL IEGLVRNGCR RMVFSSSASV YAAGSSGPYA RSKAITEWVL EDVARAGDLQ AVALRYFTPI GADPDFHTGT PSPEALHVLD KVTTAYRSDE PFHIAGTDFS PFDSRRDARD YIHVWDLAEA HVAALRNFDA IVARHASHTV PYEVINLGAG DGTTVPQPAL EARQPLGWAP RYSVGTGIRD ALTWARVRAG LPAPARVRAA KPASVVDVLM PYYGDVGMMQ EAVRSVLAQR DQHWRLTVVD DGAEPGVPEW FAGLIAEHGP DKIRYQRNPV NLGITENFQK CLSLVTHPLV TMIGCDDRML PDYIGTVRAL MRDYPRVSLA QPGVEVIDGA GEVVEPWVDK VKRRLYAPRV HGALVLSGES LAVSLLRGNW MYFPAICWRA DVITEVGFDP HLRVIQDLAL TLEWVRAGAQ IVVSDTICFQ YRRHAVSLSS EQATTGARFT EERTFFLDEA ARMDRLGWRH AARTARLHLS SRLHAATMLP SALRRGSRDG VRTLAAYAFG PSRRPGGSSG GPAR
|
| |