Gene Caci_3635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3635 
Symbol 
ID8334988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4067112 
End bp4068218 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content71% 
IMG OID644956776 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003114379 
Protein GI256392815 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0253276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCGC CCGTTCCCTC CTCGCCCGCA GAGCCCGCCG AGACCACCGA AACCACCAGT 
GACCTGCGTG TGACCAGCTT CCAGCCCCTC ATCCCACCGG CCGACCTGCG GGCCGAGTTG
CCGCTGGGCG AGAAACGCGC CGCGTTGGTG CGTGAGAGCC GGCGTACCGT GCGCGACATC
CTCGCCGGCG CCGACGACCG GCTGCTGGTC GTCGTCGGGC CGTGCTCGGT CCATGACCCC
GCCGCCGCCC TGGAATACGC GCACCGGCTC GCCGCGGCCG CCGCCGAGCA CCGCGACGAC
GTGTTCGTGG TCATGCGCGT CTACTTCGAG AAGCCGCGCA CCACGGTGGG CTGGAAGGGC
CTGATCAACG ACCCGGGCAT GGACGGGACC CACGACGTCC CCCGAGGACT GCGCCTGGCG
CGTCAGGTCC TGCTCGACGT ACTGGACGCC GGCCTGCCGA CCGGCTGCGA ATTCCTGGAG
CCCACCAGCC CTCAGTACAT CGCCGACACC GTGTCCTGGG GCGCGATCGG CGCGCGAACG
CCCGAAAGCC AAGTCCACAG GCAGCTCGCC TCCGGCATGT CGATGCCGGT CGGCTTCAAG
AACGCCACCG ACGGCGCCAT CCAGCCCGCC ATCGACGGCT GCCGAGCCGC CGCCAGCGCG
CAGTCCTTCT TCGGCATGGA CGAGCAAGGC CGCGGCGCGG TCGTCTCCAC CACCGGCAAC
CCCGACTGCC ACATCATCCT GCGCGGCGGA CGCACCGGAC CCAACTACAG CACCGAAGAC
GTGCGAGCCG CCCTGGACCT CGTCCGCGAG GCAGGCAAGC CGGAGCACCT GATCATCGAC
GCCAGCCACG GCAACAGCGG CAAGGACCAC ACCCGCCAGA GCCTCGCCGT CCGCGAGATC
GCGAACCGCC TCGCGGCCGG AGACACCGGC GTCGCGGGCA TGATGCTCGA GAGCTTCCTG
GTCCCGGGAC GCCAGGAGCC GGGCCCGCTC GAGGGACTGC GCTACGGGCA GAGCGTGACG
GACGCGTGTA TCGGCTGGGA GGAGACCGAG GAACTGCTCC AGGTCATGGC CACGGCGGTC
CGCGACCGGC GCACGGCGCG GAGCTGA
 
Protein sequence
MPSPVPSSPA EPAETTETTS DLRVTSFQPL IPPADLRAEL PLGEKRAALV RESRRTVRDI 
LAGADDRLLV VVGPCSVHDP AAALEYAHRL AAAAAEHRDD VFVVMRVYFE KPRTTVGWKG
LINDPGMDGT HDVPRGLRLA RQVLLDVLDA GLPTGCEFLE PTSPQYIADT VSWGAIGART
PESQVHRQLA SGMSMPVGFK NATDGAIQPA IDGCRAAASA QSFFGMDEQG RGAVVSTTGN
PDCHIILRGG RTGPNYSTED VRAALDLVRE AGKPEHLIID ASHGNSGKDH TRQSLAVREI
ANRLAAGDTG VAGMMLESFL VPGRQEPGPL EGLRYGQSVT DACIGWEETE ELLQVMATAV
RDRRTARS