Gene Caci_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0994 
Symbol 
ID8332328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1133539 
End bp1135413 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content70% 
IMG OID644954143 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003111763 
Protein GI256390199 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.165504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00173885 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGGAG CCACGGGGCT CACCGCCGGT GCGAAAGTCC TGATCACCGG CGGCGCCGGC 
TTCATCGGCT CCACGGTCGC CTCCGCCTGC CTGGACGCCG ACCTGGTCCC GGTGATCCTG
GACAACCTGT CCACCGGCCG CAGCGAGTTC ACCGAGGGCC GGATCTTCTA TGAGGGCGAC
ATCGCCGACG CCGCGCTCCT CGACCGGGTC TTCGCCGCGC AGCCGGACAT CGCCGCGGCA
GTGCACTGCG CGGCGCTGAC GAACGTGCCG GAGTCGGTCG CCAACCCGAT CCGGTACTAC
CGCGAGAACG TCACCAAGAC CCTGGAGCTG ATCGAGGGGT TGGTCCGCAA CGGCTGCCGC
CGCATGGTGT TCTCCTCCTC CGCGTCCGTC TACGCCGCGG GTTCCTCCGG TCCTTACGCG
CGCTCTAAGG CGATCACGGA GTGGGTGCTG GAGGACGTGG CGCGCGCCGG CGATCTGCAA
GCCGTCGCCC TGCGCTACTT CACCCCGATC GGCGCCGATC CTGACTTCCA TACCGGTACT
CCGAGCCCGG AAGCGCTGCA TGTCCTGGAC AAGGTGACCA CCGCCTACCG GAGCGACGAG
CCGTTTCACA TCGCTGGCAC CGACTTCTCG CCCTTCGATA GCCGCCGCGA CGCTCGCGAC
TACATCCATG TCTGGGACCT GGCTGAGGCG CACGTCGCGG CGCTGCGGAA CTTCGACGCG
ATCGTGGCGC GCCATGCGTC CCACACCGTC CCGTACGAGG TGATCAATCT CGGCGCCGGG
GACGGTACGA CGGTCCCCCA GCCGGCTCTC GAGGCGCGTC AGCCGCTCGG CTGGGCGCCG
CGGTACTCCG TCGGGACGGG TATTCGGGAC GCGCTGACGT GGGCTCGCGT GCGTGCCGGT
CTGCCCGCGC CGGCGCGGGT CCGGGCCGCG AAGCCCGCCT CGGTCGTCGA TGTCCTGATG
CCGTACTACG GCGACGTCGG CATGATGCAG GAGGCGGTAC GCAGCGTCCT GGCACAGCGT
GACCAGCACT GGCGGCTGAC GGTGGTCGAC GACGGCGCCG AGCCGGGCGT CCCGGAGTGG
TTCGCCGGCC TGATCGCCGA GCACGGACCC GACAAGATCC GCTACCAGCG CAACCCGGTC
AACCTCGGCA TCACCGAGAA CTTCCAGAAA TGCCTGAGCC TGGTCACCCA TCCGCTGGTC
ACCATGATCG GCTGCGACGA CCGCATGCTT CCGGACTACA TCGGAACCGT CCGAGCCCTG
ATGCGCGACT ACCCCCGCGT CTCCCTCGCC CAGCCCGGCG TGGAGGTCAT CGACGGAGCC
GGCGAGGTCG TCGAACCCTG GGTCGACAAG GTCAAGCGGC GCCTCTACGC CCCCCGCGTC
CATGGCGCCC TGGTCCTGAG CGGCGAATCC CTGGCCGTCA GCCTCCTGCG CGGCAACTGG
ATGTACTTCC CAGCCATCTG CTGGCGCGCC GACGTCATCA CCGAAGTCGG CTTCGACCCC
CACCTCCGCG TGATCCAGGA CCTGGCCCTG ACCCTCGAGT GGGTCCGCGC CGGCGCCCAG
ATCGTCGTCA GCGACACCAT CTGCTTCCAG TACCGCCGCC ACGCAGTCAG CCTCTCCAGC
GAACAAGCCA CCACCGGCGC CCGCTTCACC GAAGAACGCA CCTTCTTCCT CGACGAAGCC
GCCCGCATGG ACCGCCTAGG CTGGCGCCAC GCCGCCCGCA CAGCCCGCCT CCACCTCTCC
TCACGCCTCC ACGCGGCCAC GATGCTCCCC AGCGCCCTCC GCCGCGGCAG CCGCGACGGC
GTCCGAACCC TGGCGGCCTA CGCCTTCGGA CCCTCCCGGC GCCCCGGAGG CTCGAGCGGA
GGACCAGCAC GGTGA
 
Protein sequence
MSGATGLTAG AKVLITGGAG FIGSTVASAC LDADLVPVIL DNLSTGRSEF TEGRIFYEGD 
IADAALLDRV FAAQPDIAAA VHCAALTNVP ESVANPIRYY RENVTKTLEL IEGLVRNGCR
RMVFSSSASV YAAGSSGPYA RSKAITEWVL EDVARAGDLQ AVALRYFTPI GADPDFHTGT
PSPEALHVLD KVTTAYRSDE PFHIAGTDFS PFDSRRDARD YIHVWDLAEA HVAALRNFDA
IVARHASHTV PYEVINLGAG DGTTVPQPAL EARQPLGWAP RYSVGTGIRD ALTWARVRAG
LPAPARVRAA KPASVVDVLM PYYGDVGMMQ EAVRSVLAQR DQHWRLTVVD DGAEPGVPEW
FAGLIAEHGP DKIRYQRNPV NLGITENFQK CLSLVTHPLV TMIGCDDRML PDYIGTVRAL
MRDYPRVSLA QPGVEVIDGA GEVVEPWVDK VKRRLYAPRV HGALVLSGES LAVSLLRGNW
MYFPAICWRA DVITEVGFDP HLRVIQDLAL TLEWVRAGAQ IVVSDTICFQ YRRHAVSLSS
EQATTGARFT EERTFFLDEA ARMDRLGWRH AARTARLHLS SRLHAATMLP SALRRGSRDG
VRTLAAYAFG PSRRPGGSSG GPAR