Gene Caci_5157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5157 
Symbol 
ID8336511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5923086 
End bp5924051 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content72% 
IMG OID644958255 
Product5-dehydro-4-deoxyglucarate dehydratase 
Protein accessionYP_003115857 
Protein GI256394293 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0414541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0087046 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTACCC TGACAGACCG CCTCGACGGC CTGCTCTTCT TCCCTGTCAC GCCGTTCACC 
CCGGACCAGG GGCACGTCGA CCTCGACGCC TTCGCGGCGC ATCTGGAGAG CCGGCTGGCG
CTGCTGGATC CGGCGCGTCC TGGGCTGTCG GCGGTGTTCG CGGCGTGCGG GACCGGCGAG
TTCTTCTCGC TGGACCAGCG CGAGTACGCC GAGGTGCTGC GGGTCGCGGT GCAGGTCACG
GCGGGGCGGG CGCCGGTGCT CGGCGGTGTC GGCTACGGGG CGCCGCTGGC CGCGTCGTTC
GTGCGGGCCG CTGAGCAGGC CGGGGTCGAC GGGCTGCTGG TCCTGCCGCC CTACCTGGTC
TCCGGCAGCC AGCAGGGGCT CGCCGACCAC TATCGGAGCA TCGCCGCCTC CACCGAGCTG
GACCTGATCA TCTACCAGCG CGACAACGTC ACCTTCGCCC CGGAGACCGT CGCGGACCTG
GCCGAGGTGC CGAACATCAT CGGGTTCAAG GACGGCCGGG GCGACCTGGA CCTGATGCAG
CGCATCGTCG CGGCGGTCCG CATGCGGCAC GGCGCCGACC GCCTGTTGTT CCTCAACGGC
CTGCCCACCG CCGAGATGAC GCAGCTCGCC TACCGCGGGA TCGGCGTCCC GCTGTACTCC
TCGGCGGTGT TCTGCTTCGC CCCGGACATC GCGCTGGCGT TCTACCACGC CTGCCGCGAG
GGCGACGGGG CGCTCGCCGA CGCGCTGATC GACCGCTTCT ACAAGCCGCT GGTCGAGCTG
CGCAACAAGG GCGCCGGCTA CGCGGTCTCG CTGGTGAAGG CCGGTGTGCG CCTGGACGGG
CTGGACGCCG GACCGGTGCG CTCACCGCTG ACCGAGCCGG CTCCGGAGCA CCTGGAGCAG
CTCGAGCAGC TGATCGCCGA CGGCCGCGCG GTCCTGGCCG AGCACAAGGT CGGAGCCGCG
GCGTGA
 
Protein sequence
MSTLTDRLDG LLFFPVTPFT PDQGHVDLDA FAAHLESRLA LLDPARPGLS AVFAACGTGE 
FFSLDQREYA EVLRVAVQVT AGRAPVLGGV GYGAPLAASF VRAAEQAGVD GLLVLPPYLV
SGSQQGLADH YRSIAASTEL DLIIYQRDNV TFAPETVADL AEVPNIIGFK DGRGDLDLMQ
RIVAAVRMRH GADRLLFLNG LPTAEMTQLA YRGIGVPLYS SAVFCFAPDI ALAFYHACRE
GDGALADALI DRFYKPLVEL RNKGAGYAVS LVKAGVRLDG LDAGPVRSPL TEPAPEHLEQ
LEQLIADGRA VLAEHKVGAA A