Gene Caci_6048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6048 
Symbol 
ID8337411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6970178 
End bp6971542 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content72% 
IMG OID644959152 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003116746 
Protein GI256395182 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.066301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAG TCGTCAGGAA TCCGGCCGGG ACCGGATCCA TCGAAGACAC GTGGTCCGCG 
CTCCCCGCGG GCCAGCAGCC GGAGTGGCCG GACGCCGCCG AGCTGTCCTC CGTCGTCTCC
GAGCTCCGCT CGTATCCGCC CCTGGTCTTC GCCGGGGAGG CCGACCAGCT CAAGGACCGC
ATCGCCGCGG TGTCCCGGGG CGAGGCCTTC CTGCTCACCG GCGGGGACTG CGCCGAGACC
TTCGCCGGGG TCACCGCCGA CGCGATCCGC GCCAAGCTCA AGACGCTGCT GCAGATGGCC
GTCGTGCTCA CCTACGCCGC CTCGCTGCCG GTGGTGAAGG TCGGGCGCAT CGCCGGGCAG
TACTCCAAGC CGCGCTCCAA GCCGACCGAG ACCCGCGACG GGGTCACCCT TCCGGCGTAC
CGCGGCGACT CCGTCAACGG CTTCGACTTC ACCCCCGAGG CCCGCACCCC GGACCCCAAG
CGCCTGCTGC GCCTGTACCA GGCCAGCGCC TCCACGCTGA ACCTGGTGCG CGCGTTCACC
ACCGGCGGCT ACGCCGACAT GCGCCAGGTG CACGCCTGGA ACCAGGACTT CGTGGCCGGC
TCGCCCTCCG GGGAGCGCTA CGAGGCGCTG GCCGGGGAGA TCGACCGCGC CCTGCAGTTC
ATGCGCGCCT GCGGCACCGA GCCGGAGGAA CTGCGCACCG TCGAGTTCTA CGCCGCGCAC
GAGGGCCTGG TGCTCCCGTA CGAGTCGGCG CTGACCCGCA TCGACTCGCG CACCGGCAAC
CCCTACGCGA CCAGCGGCCA CTACATCTGG ATCGGCGAGC GCACCCGGGA CCTGGACGGC
GCGCACGTGG AGTACTTCAG CCGGATCAGC AACCCGATCG GCATCAAGCT GGGCCCGGGC
ACCGCCCCGG ACGACGCGCT GTCCTACCTG GACCGGCTGG ACCCCGACCG CGAGCCCGGC
CGGCTGTCGT TCATCGTCCG CATGGGCGCC GGGCAGGTGC GCGAGAAGCT GCCGGCCCTG
GTGGAGAAGG TGCGCGGCGA GGGCCACCAG GTGGCGTGGA TCTGCGACCC GATGCACGGC
AACACCTTCG AGGCGCCCTC GGGCCACAAG ACCCGCCGCT TCGACGACGT GCTCGACGAG
GTCAAGGGCT TCTTCGAGGT GCACCACGGC CTCGGCTCGC ACCCCGGCGG CATCCACGTC
GAGCTCACCG GCGAGGACGT CACCGAGTGC GTCGGCGGCG GCACGGAGAT CGCCCTGGAC
GCCCTGCACC AGCGCTACGA GACGCTGTGC GACCCCCGGC TGAACCGCAG CCAGTCGCTG
GACCTGGCGT TCCTCGTCGC GGAGATGCTG CGCGCCCGGC GCTGA
 
Protein sequence
MSEVVRNPAG TGSIEDTWSA LPAGQQPEWP DAAELSSVVS ELRSYPPLVF AGEADQLKDR 
IAAVSRGEAF LLTGGDCAET FAGVTADAIR AKLKTLLQMA VVLTYAASLP VVKVGRIAGQ
YSKPRSKPTE TRDGVTLPAY RGDSVNGFDF TPEARTPDPK RLLRLYQASA STLNLVRAFT
TGGYADMRQV HAWNQDFVAG SPSGERYEAL AGEIDRALQF MRACGTEPEE LRTVEFYAAH
EGLVLPYESA LTRIDSRTGN PYATSGHYIW IGERTRDLDG AHVEYFSRIS NPIGIKLGPG
TAPDDALSYL DRLDPDREPG RLSFIVRMGA GQVREKLPAL VEKVRGEGHQ VAWICDPMHG
NTFEAPSGHK TRRFDDVLDE VKGFFEVHHG LGSHPGGIHV ELTGEDVTEC VGGGTEIALD
ALHQRYETLC DPRLNRSQSL DLAFLVAEML RARR